Already a member?
Sign in
Oracle Olap Terminology-D
The aim of this page is to provide a glossary of common OLAP and data warehousing terms and expressions. This list is for all terms beginning with D

A common way of introducing data warehousing is to refer to the characteristics of a data warehouse as set forth by William Inmon:
Subject OrientedData warehouses are designed to help you analyze data. For example, to learn more about your company's sales data, you can build a data warehouse that concentrates on sales. Using this data warehouse, you can answer questions such as "Who was our best customer for this item last year?" This ability to define a data warehouse by subject matter, sales in this case, makes the data warehouse subject oriented.
IntegratedIntegration is closely related to subject orientation. Data warehouses must put data from disparate sources into a consistent format. They must resolve such problems as naming conflicts and inconsistencies among units of measure. When they achieve this, they are said to be integrated.
Time VariantA data warehouse's focus on change over time is what is meant by the term time variant. In order to discover trends in business, analysts need large amounts of data. This is very much in contrast toonline transaction processing (OLTP)systems, where performance requirements demand that historical data be moved to an archive.
ata warehouses and their architectures vary depending upon the specifics of an organization's situation. Three common architectures are:
Data Warehouse Architecture (Basic)The image below shows a simple architecture for a data warehouse. End users directly access data derived from several source systems through the data warehouse.
In the image above, the metadata and raw data of a traditional OLTP system is present, as is an additional type of data, summary data. Summaries are very valuable in data warehouses because they pre-compute long operations in advance. For example, a typical data warehouse query is to retrieve something such as August sales. A summary in an Oracle database is called amaterialized view.
Data Warehouse Architecture (with a Staging Area)You need to clean and process your operational data before putting it into the warehouse, as shown in the image above. You can do this programmatically, although most data warehouses use astaging areainstead. A staging area simplifies building summaries and general warehouse management. The image below illustrates this typical architecture.
A dimension is a structure that categorizes data in order to enable users to answer business questions. Commonly used dimensions are customers, products, and time. For example, each sales channel of a clothing retailer might gather and store data regarding sales and reclamations of their Cloth assortment. The retail chain management can build a data warehouse to analyze the sales of its products across all stores over time and help answer questions such as:
The data in the retailer's data warehouse system has two important components: dimensions and facts. The dimensions are products, customers, promotions, channels, and time. One approach for identifying your dimensions is to review your reference tables, such as a product table that contains everything about a product, or a promotion table containing all information about promotions. The facts are sales (units sold) and profits. A data warehouse contains facts about the sales of each product at on a daily basis.
Within the OLAP Option the fact information is stored in what is called a Cube and the dimensional information is stored in a Dimension. In our example, each sales transaction record is uniquely defined as for each customer, for each product, for each sales channel, for each promotion, and for each day (time).
Typically members within a dimension are organized according to hierarchies and each dimension can have as many hierarchies as required to fully describe all the possible relationships. Each hierarchy represents a natural 1:n relationship between members or groups of members (the levels of a hierarchy). When traversing a hierarchy, going up a level in the hierarchy is called rolling up the data and going down a level in the hierarchy is called drilling down the data. In the retailer example:

Data analysis typically starts at higher levels in the dimensional hierarchy and gradually drills down if the situation warrants such analysis.
A single dimension can contain multiple hierarchies. For example, suppose our retailer wants to track the sales of certain items over time. Most analysis over time typically involves looking at data rolled up over calendar time periods as well as over fiscal time periods. Therefore, a typical time dimension will contain at least two time hierarchies:

Star Schema Dimension Table
A star schemaprovides one table or view with columns containing member id's representing all levels of a hierarchy for each dimension. Each row in the table specifies a branch in the hierarchy. Additional columns identify additional attributes for each level, such as long and short descriptions. In the case of a time dimension, additional attributes will be required to provide information on end date and time span for each level.

Snowflake Schema Dimension Table A snowflake schema provides separate tables or views for each levels of a hierarchy. Each row in the table specifies a level in the hierarchy with an additional column to link to each parent across the various hierarchies. The same basic requirements apply as for star schemas in terms of uniqueness.

Other more complex schemas can be created. The basic snowflake schema can be taken a stage further by moving the various attributes, such as descriptions etc, to separate tables. This follows a more 3NF approach to data storage and although it looks more complicated it can easily be managed within AWM's mapping editor.

Also see the entries for Snowflake, Star, Ragged Hierarchy, Skip Level Hierarchy

Data source
A relational table, view, synonym, or other database object that provides detail data for cubes and cube dimensions.
Data from flat/text files on the operating system, even Excel Spreadsheets, can be acessed using Oracle Database's 'external tables' feature (which makes flat files look as if they are tables which can be selected from using SQL just like other objects inside the Database)
Data from other non-Oracle databases can either be directly accessed using Oracle Database Gateways, or staged and cleansed first inside Oracle using ETL/ELT tools like Oracle Warehouse Builder, or Oracle Data Integrator.
Thus Oracle OLAP systems can be generated from heterogeneous data sources, and not only from data already present inside Oracle Database.
Data Mart
Datamarts are systems designed for a particular line of business. The image below llustrates an example where purchasing, sales, and inventories are separated. In this example, a financial analyst might want to analyze historical data for purchases and sales.Data warehouse
A data warehouse is a relational database that is designed for query and analysis rather than for transaction processing. It usually contains historical data derived from transaction data, but can include data from other sources. Data warehouses separate analysis workload from transaction workload and enable an organization to consolidate data from several sources.In addition to a relational database, a data warehouse environment can include an extraction, transportation, transformation, and loading (ETL) solution, online analytical processing (OLAP) and data mining capabilities, client analysis tools, and other applications that manage the process of gathering data and delivering it to business users.A common way of introducing data warehousing is to refer to the characteristics of a data warehouse as set forth by William Inmon:
- Subject Oriented
- Integrated
- Nonvolatile
- Time Variant
Subject OrientedData warehouses are designed to help you analyze data. For example, to learn more about your company's sales data, you can build a data warehouse that concentrates on sales. Using this data warehouse, you can answer questions such as "Who was our best customer for this item last year?" This ability to define a data warehouse by subject matter, sales in this case, makes the data warehouse subject oriented.
IntegratedIntegration is closely related to subject orientation. Data warehouses must put data from disparate sources into a consistent format. They must resolve such problems as naming conflicts and inconsistencies among units of measure. When they achieve this, they are said to be integrated.
NonvolatileNonvolatile means that,once entered into the data warehouse, data should not change. This is logical because the purpose of a data warehouse is to enable you to analyze what has occurred.
Time VariantA data warehouse's focus on change over time is what is meant by the term time variant. In order to discover trends in business, analysts need large amounts of data. This is very much in contrast toonline transaction processing (OLTP)systems, where performance requirements demand that historical data be moved to an archive.
ata warehouses and their architectures vary depending upon the specifics of an organization's situation. Three common architectures are:
- Data Warehouse Architecture (Basic)
- Data Warehouse Architecture (with a Staging Area)
- Data Warehouse Architecture (with a Staging Area and Data Marts)
Data Warehouse Architecture (with a Staging Area)You need to clean and process your operational data before putting it into the warehouse, as shown in the image above. You can do this programmatically, although most data warehouses use astaging areainstead. A staging area simplifies building summaries and general warehouse management. The image below illustrates this typical architecture.
Data Warehouse Architecture (with a Staging Area and Data Marts)Although the architecture in previous example is quite common, you may want to customize your warehouse's architecture for different groups within your organization. You can do this by addingdatamarts,which are systems designed for a particular line of business. The image below illustrates an example where purchasing, sales, and inventories are separated. In this example, a financial analyst might want to analyze historical data for purchases and sales.

Database standard form
An analytic workspace that has been constructed with a specific set of objects, such as hierarchy dimensions, level dimensions, parent relations, and level relations. Each object must be defined with a set of properties that identify its role and its relationships with other objects in the analytic workspace. Standard form is required for an analytic workspace to be accessible to OLAP tools, however, it is not a prerequisite for multidimensional analysis.Decimal
The DECIMAL data type is smaller at 8 bytes for each value and thus takes up less disk space than NUMBER data types. All calculations are done in the CPU Floating Point Unit, which is many times faster than integer arithmetic. However, floating point calculations produce slightly differently results on different platforms, typically at the seventh decimal place. DECIMAL is the best choice when the analytic workspace will be used heavily for computations.Denormalized
Permit redundancy in a table. Contrast with normalize.Derived measure
See calculated measure.Descendant
A dimension member at a lower level of aggregation than a particular member. For example, in a Time dimension, the day 06-July-07 is the descendant of year 2007. The member immediately below is the child. In a dimension hierarchy, the data values of the descendants roll up into the data values of the ancestors. Contrast withancestor. See alsoaggregation,child,hierarchy,level.Detail data
Data at the lowest level, which is acquired from another source. Contrast withaggregation.Dimension
The term dimension is commonly used in two ways:- A general term for any characteristic that is used to specify the members of a data set. The three most common dimensions in sales-oriented data warehouses are time, geography, and product. Most dimensions have hierarchies.
- An object defined in a database to enable queries to navigate dimensions. In Oracle Database 10g, a dimension is a database object that defines hierarchical (parent/child) relationships between pairs of column sets. In Oracle Express, a dimension is a database object that consists of a list of values.
A dimension is a structure that categorizes data in order to enable users to answer business questions. Commonly used dimensions are customers, products, and time. For example, each sales channel of a clothing retailer might gather and store data regarding sales and reclamations of their Cloth assortment. The retail chain management can build a data warehouse to analyze the sales of its products across all stores over time and help answer questions such as:
- What is the effect of promoting one product on the sale of a related product that is not promoted?
- What are the sales of a product before and after a promotion?
- How does a promotion affect the various distribution channels?
The data in the retailer's data warehouse system has two important components: dimensions and facts. The dimensions are products, customers, promotions, channels, and time. One approach for identifying your dimensions is to review your reference tables, such as a product table that contains everything about a product, or a promotion table containing all information about promotions. The facts are sales (units sold) and profits. A data warehouse contains facts about the sales of each product at on a daily basis.
Within the OLAP Option the fact information is stored in what is called a Cube and the dimensional information is stored in a Dimension. In our example, each sales transaction record is uniquely defined as for each customer, for each product, for each sales channel, for each promotion, and for each day (time).
Typically members within a dimension are organized according to hierarchies and each dimension can have as many hierarchies as required to fully describe all the possible relationships. Each hierarchy represents a natural 1:n relationship between members or groups of members (the levels of a hierarchy). When traversing a hierarchy, going up a level in the hierarchy is called rolling up the data and going down a level in the hierarchy is called drilling down the data. In the retailer example:
- Within the time dimension, months roll up to quarters, quarters roll up to years, and years roll up to all years.
- Within the product dimension, products roll up to subcategories, subcategories roll up to categories, and categories roll up to all products.
- Within the customer dimension, customers roll up to city. Then cities roll up to state. Then states roll up to country. Then countries roll up to subregion. Finally, subregions roll up to region, as shown below
Data analysis typically starts at higher levels in the dimensional hierarchy and gradually drills down if the situation warrants such analysis.
A single dimension can contain multiple hierarchies. For example, suppose our retailer wants to track the sales of certain items over time. Most analysis over time typically involves looking at data rolled up over calendar time periods as well as over fiscal time periods. Therefore, a typical time dimension will contain at least two time hierarchies:
- Calendar
- Fiscal
Dimension key
See dimension member.Dimension member
One element in the list that composes a cube dimension. For example, a Time dimension might have dimension members for days, months, quarters, and years.Dimension Table
Dimension tables describe the business entities of an enterprise, represented as hierarchical, categorical information such as time, departments, locations, and products. Dimension tables are sometimes called lookup or reference tables. Typically, these types of tables are divided into three basic categories, that are related to the mapping options within Analytic Workspace schema:
Star Schema Dimension Table
A star schemaprovides one table or view with columns containing member id's representing all levels of a hierarchy for each dimension. Each row in the table specifies a branch in the hierarchy. Additional columns identify additional attributes for each level, such as long and short descriptions. In the case of a time dimension, additional attributes will be required to provide information on end date and time span for each level.
Snowflake Schema Dimension Table A snowflake schema provides separate tables or views for each levels of a hierarchy. Each row in the table specifies a level in the hierarchy with an additional column to link to each parent across the various hierarchies. The same basic requirements apply as for star schemas in terms of uniqueness.
Other more complex schemas can be created. The basic snowflake schema can be taken a stage further by moving the various attributes, such as descriptions etc, to separate tables. This follows a more 3NF approach to data storage and although it looks more complicated it can easily be managed within AWM's mapping editor.
Also see the entries for Snowflake, Star, Ragged Hierarchy, Skip Level Hierarchy
Dimension Value
One element in the list that makes up a dimension. For example, a computer company might have dimension values in the product dimension called LAPPC and DESKPC. Values in the geography dimension might include Boston and Paris. Values in the time dimension might include MAY96 and JAN97.
Drill
To navigate from one item to a set of related items. Drilling typically involves navigating up and down through the levels in a hierarchy.Drilling down expands the view to include child values that are associated with parent values in the hierarchy. Drilling up collapses the list of descendant values that are associated with a parent value in the hierarchy. The image below shows drilling within Discoverer Report via the dimension on the row edge.
Kevin.Lancaster |
Latest page update: made by Kevin.Lancaster
, Jul 2 2008, 4:39 AM EDT
(about this update
About This Update
Edited by Kevin.Lancaster
2 words added 2 words deleted view changes - complete history) |
|
Keyword tags:
OLAP terminology
More Info: links to this page
|
| Started By | Thread Subject | Replies | Last Post | |
|---|---|---|---|---|
| keith_laker | Phrases and Terminology | 0 | Jan 21 2008, 9:47 AM EST by keith_laker | |
|
Thread started: Jan 21 2008, 9:47 AM EST
Watch
If you can think of any additional phrases or terminology you would like to see added to this page, feel free to make updates or make a posting and I will research the subject for and create an entry for you.
Regards Keith Laker Oracle EMEA Consulting OLAP Blog: http://oracleOLAP.blogspot.com/ OLAP Wiki: http://wiki.oracle.com/page/Oracle+OLAP+Option DM Blog: http://oracledmt.blogspot.com/ OWB Blog : http://blogs.oracle.com/warehousebuilder/ OWB Wiki : http://wiki.oracle.com/page/Oracle+Warehouse+Builder DW on OTN : http://www.oracle.com/technology/products/bi/db/11g/index.html
0
out of
2 found this valuable.
Do you?
Keyword tags:
OLAP terminology
|
||||
