OLAP cube

OLAP cube
OLAP Cube

An OLAP cube (for online analytical processing) is a data structure that allows fast analysis of data.[1] It can also be defined as the capability of manipulating and analyzing data from multiple perspectives. The arrangement of data into cubes overcomes some limitations of relational databases.[2]

Contents

Background

OLAP cubes can be thought of as extensions to the two-dimensional array of a spreadsheet. For example a company might wish to analyze some financial data by product, by time-period, by city, by type of revenue and cost, and by comparing actual data with a budget. These additional methods of analyzing the data are known as dimensions.[3] Because there can be more than three dimensions in an OLAP system the term hypercube is sometimes used.

Functionality

The OLAP cube consists of numeric facts called measures which are categorized by dimensions. The cube metadata (structure) may be created from a star schema or snowflake schema of tables in a relational database. Measures are derived from the records in the fact table and dimensions are derived from the dimension tables.

Pivot

A financial analyst might want to view or "pivot" the data in various ways, such as displaying all the cities down the page and all the products across a page. This could be for a specified period, version and type of expenditure. Having seen the data in this particular way the analyst might then immediately wish to view it in another way. The cube could effectively be re-oriented so that the data displayed now has periods across the page and type of cost down the page. Because this re-orientation involves re-summarizing very large amounts of data, this new view of the data has to be generated efficiently to avoid wasting the analyst's time, i.e. within seconds, rather than the hours a relational database and conventional report-writer might have taken.[4]

Hierarchy

Each of the elements of a dimension could be summarized using a hierarchy.[5] The hierarchy is a series of parent-child relationships, typically where a parent member represents the consolidation of the members which are its children. Parent members can be further aggregated as the children of another parent.[6]

For example May 2005 could be summarized into Second Quarter 2005 which in turn would be summarized in the Year 2005. Similarly the cities could be summarized into regions, countries and then global regions; products could be summarized into larger categories; and cost headings could be grouped into types of expenditure. Conversely the analyst could start at a highly summarized level, such as the total difference between the actual results and the budget, and drill down into the cube to discover which locations, products and periods had produced this difference.

OLAP operations

The analyst can understand the meaning contained in the databases using multi-dimensional analysis. By aligning the data content with the analyst's mental model, the chances of confusion and erroneous interpretations are reduced. The analyst can navigate through the database and screen for a particular subset of the data, changing the data's orientations and defining analytical calculations.[6] The user-initiated process of navigating by calling for page displays interactively, through the specification of slices via rotations and drill down/up is sometimes called "slice and dice". Common operations include slice and dice, drill down, roll up, and pivot.

OLAP slicing

Slice: A slice is a subset of a multi-dimensional array corresponding to a single value for one or more members of the dimensions not in the subset.[6] The picture shows a slicing operation: The sales figures of all sales regions and all product categories of the company in the year 2004 are "sliced" out the data cube.


OLAP dicing

Dice: The dice operation is a slice on more than two dimensions of a data cube (or more than two consecutive slices).[7] The picture shows a dicing operation: The new cube shows the sales figures of a limited number of product categories, the time and region dimensions cover the same range as before.


OLAP-functionalities
OLAP Drill-up and drill-down

Drill Down/Up: Drilling down or up is a specific analytical technique whereby the user navigates among levels of data ranging from the most summarized (up) to the most detailed (down).[6] The picture shows a drilling operation: Theres a better understanding of the sales figures of the product category "Outdoor-Schutzausrüstung" since you now see the sales figures for the single products of this category.


Roll-up: A roll-up involves computing all of the data relationships for one or more dimensions. To do this, a computational relationship or formula might be defined.[6]


OLAP pivoting

Pivot: This operation is also called rotate operation. It rotates the data in order to provide an alternative presentation of data - the report or page display takes a different dimensional orientation.[6] The picture shows a pivoting operation: The whole cube is rotated, giving another perspective on the data.


Linking cubes and sparsity

The commercial OLAP products have different methods of creating and of linking cubes and hypercubes (see Types of OLAP).

Linking cubes is a method of overcoming sparsity. Sparsity arises when not every cell in the cube is filled with data and so valuable processing time is taken by effectively adding up zeros. For example revenues may be available for each customer and product but cost data may not be available with this amount of analysis. Instead of creating a sparse cube, it is sometimes better to create another separate, but linked, cube in which a sub-set of the data can be analyzed into great detail. The linking ensures that the data in the cubes remain consistent.

Variance in products

The data in cubes may be updated at times, perhaps by different people. Techniques are therefore often needed to lock parts of the cube while one of the users is writing to it and to recalculate the cube's totals. Other facilities may allow an alert that shows previously calculated totals are no longer valid after the new data have been added, but some products only calculate the totals when they are needed.

Technical definition

In database theory, an OLAP cube is[8] an abstract representation of a projection of an RDBMS relation. Given a relation of order N, consider a projection that subtends X, Y, and Z as the key and W as the residual attribute. Characterizing this as a function,

f : (X,Y,Z) → W,

the attributes X, Y, and Z correspond to the axes of the cube, while the W value into which each ( X, Y, Z ) triple maps corresponds to the data element that populates each cell of the cube.

Insofar as two-dimensional output devices cannot readily characterize four dimensions, it is more practical to project "slices" of the data cube (we say project in the classic vector analytic sense of dimensional reduction, not in the SQL sense, although the two are conceptually similar), perhaps

g : (X,Y) → W

which may suppress a primary key, but still have some semantic significance, perhaps a slice of the triadic functional representation for a given Z value of interest.

The motivation[8] behind OLAP displays harks back to the cross-tabbed report paradigm of 1980s DBMS. One may wish for a spreadsheet-style display, where values of X populate row $1; values of Y populate column $A; and values of g : ( X, Y ) → W populate the individual cells "southeast of" $B2, so to speak, $B2 itself included. While one can certainly use the DML (Data Manipulation Language) of traditional SQL to display ( X, Y, W ) triples, this output format is not nearly as convenient as the cross-tabbed alternative: certainly, the former requires one to hunt linearly for a given ( X, Y ) pair in order to determine the corresponding W value, while the latter enables one to more conveniently scan for the intersection of the proper X column with the proper Y row.

See also

References

  1. ^ Codd E.F., Codd S.B., and Salley C.T. (1993). "Providing OLAP (On-line Analytical Processing) to User-Analysts: An IT Mandate". Codd & Date, Inc. http://www.fpm.com/refer/codd.html. Retrieved 2008-03-05. 
  2. ^ Hari Mailvaganam (2007). "Introduction to OLAP - Slice, Dice and Drill". DWreview. http://www.dwreview.com/OLAP/Introduction_OLAP.html. Retrieved 2008-03-05. 
  3. ^ "Cybertec releases OLAP cubes for PostgreSQL". PostgreSQL. 2006-10-02. http://www.postgresql.org/about/news.653. Retrieved 2008-03-05. 
  4. ^ "Computer Encyclopedia: multidimensional views". Answers.com. http://www.answers.com/topic/multidimensional-views?cat=technology. Retrieved 2008-03-05. 
  5. ^ "Oracle9i Data Warehousing Guide hierarchy". Lorentz Center. http://www.lorentzcenter.nl/awcourse/oracle/server.920/a96520/glossary.htm#432038. Retrieved 2008-03-05. 
  6. ^ a b c d e f "OLAP and OLAP Server Definitions". The OLAP Council. 1995. http://www.olapcouncil.org/research/glossaryly.htm. Retrieved 2008-03-18. 
  7. ^ "Glossary of Data Mining Terms". University of Alberta. 1999. http://www.cs.ualberta.ca/~zaiane/courses/cmput690/glossary.html. Retrieved 2008-03-17. 
  8. ^ a b Gray, Jim; Bosworth, Adam; Layman, Andrew; Priahesh, Hamid (1995-11-18). "Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals". Proc. 12th International Conference on Data Engineering. IEEE. pp. 152–159. http://research.microsoft.com/~gray/DataCube.doc. Retrieved 2008-11-09. 

External links


Wikimedia Foundation. 2010.

Игры ⚽ Поможем написать курсовую

Look at other dictionaries:

  • OLAP-Cube — Ein OLAP Cube oder Data Cube oder Cube Operator oder OLAP Würfel ist ein in der Data Warehouse Theorie gebräuchlicher Begriff zur logischen Darstellung von Daten. Die Daten werden dabei als Elemente eines mehrdimensionalen Würfels (engl. cube)… …   Deutsch Wikipedia

  • OLAP cube — /oʊlæp ˈkjub/ (say ohlap kyoohb) noun a data structure that allows for the fast analysis of data, as in data warehousing, in which data is categorised in a number of different ways allowing for different views to be presented quickly. {O(n)l(ine) …  

  • OLAP — Online Analytical Processing (OLAP) wird neben dem Data Mining (auch Datenschürfung) zu den Methoden der analytischen Informationssysteme gezählt. OLAP wird weiterhin den hypothesengestützten Analysemethoden zugeordnet. Der Analyst muss vor der… …   Deutsch Wikipedia

  • Cube (OLAP) — Ein OLAP Cube oder Data Cube oder Cube Operator oder OLAP Würfel ist ein in der Data Warehouse Theorie gebräuchlicher Begriff zur logischen Darstellung von Daten. Die Daten werden dabei als Elemente eines mehrdimensionalen Würfels (engl. cube)… …   Deutsch Wikipedia

  • OLAP-Würfel — Schematisches Beispiel eines Dimensionswürfel mit drei Dimensionen (Data Cube) Ein OLAP Würfel oder Datenwürfel (englisch OLAP cube oder Data Cube), auch Cube Operator genannt, ist ein in der Data Warehouse Theorie gebräuchlicher Begriff zur… …   Deutsch Wikipedia

  • cube — /kjub / (say kyoohb) noun 1. solid bounded by six equal squares, the angle between any two adjacent faces being a right angle. 2. a piece of anything of this form. 3. the third power of a quantity: the cube of 4 is 4 × 4 × 4, or 64. 4. a small… …  

  • Cube — This article is about the geometric shape. For other uses, see Cube (disambiguation). Regular Hexahedron (Click here for rotating model) Type Platonic solid Elements F = 6, E = 12 V = 8 (χ = 2) …   Wikipedia

  • Cube (disambiguation) — A cube is a shape in three dimensions. It may also refer to: Contents 1 People 2 Math and science 3 Entertainment …   Wikipedia

  • Cube — (engl. Würfel) steht für: Cube (Computerspiel), Computerspiel Rubik s Cube, englische Bezeichnung für den Zauberwürfel von Ernő Rubik Cube (Fahrradhersteller), Fahrradhersteller aus Waldershof Nissan Cube, PKW Modell OLAP Würfel, Darstellungsform …   Deutsch Wikipedia

  • Cube (Homonymie) — Cette page d’homonymie répertorie les différents sujets et articles partageant un même nom …   Wikipédia en Français

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”