Dimension table

Dimension table: In data warehousing, a dimension table is one of the set of companion tables to a fact table.

The fact table contains business facts or measures and foreign keys which refer to candidate keys (normally primary keys) in the dimension tables.

Contrary to fact tables, the dimension tables contain descriptive attributes (or fields) which are typically textual fields or discrete numbers that behave like text. These attributes are designed to serve two critical purposes: query constraining/filtering and query result set labeling.

Dimension attributes are supposed to be:

Verbose - labels consisting of full words,

Descriptive,

Complete - no missing values,

Discretely valued - only one value per row in dimensional table,

Quality assured - no misspelling, no impossible values.

Dimension table rows are uniquely identified by a single key field. It is recommended that the key field is a simple integer for the reason that key value is meaningless and is only used to be join fields between the fact and dimension tables.

The usage of surrogate dimension keys brings several advantages among:

Performance - join processing is much more efficient if a single field surrogate key is used,

Buffer from operational key management practices - prevents form situations when removed data rows might reappear when their natural keys might be reused or reassigned after a long period of dormancy,

Mapping to integrate disparate sources,

Handle unknown or not applicable connections,

Track changes in dimension attribute values.

Usage of surrogate keys also brings an additional costs due the burden put on the ETL system. Still, pipeline processing can be improved and ETL tools have built-in improved surrogate key processing.

The goal of dimension table is to create standardized conformed dimensions which can be shared across the enterprise's data warehouse environment and joining to multiple fact table representing various business processes.

Conformed dimensions are highly important to enterprise nature of DW/BI system for following reasons:

Consistency - every fact table is filtered consistently, result query answer are labeled consistently,

Integration - queries are able to drill different processes fact tables separately for each individual fact table and then join the results on common dimension attributes,

Reduced development time to market - the common dimensions are available without recreating the wheel over again.

Over time, the attributes of a given row in a dimension table may change. For example, the shipping address for a company may change. Kimball refers to this phenomenon as Slowly Changing Dimensions. Strategies for dealing with this kind of change are divided into three categories:

Type One - Simply overwrite the old value(s).

Type Two - Add a new row containing the new value(s), and distinguish between the rows using Tuple-versioning techniques.

Type Three - Add a new attribute to the existing row.

See also

Data Warehouse

Dimension

Fact table

References

Kimball, Ralph. The Data Warehouse Lifecycle Toolkit Scond Edition. Winely Publishing Inc., 2008, p.241-246.

Kimball, Ralph et al. (1998); The Data Warehouse Lifecycle Toolkit, p17. Pub. Wiley. ISBN 0-471-25547-5.

Kimball, Ralph (1996); The Data Warehouse Toolkit, p100. Pub. Wiley. ISBN 0-471-15337-0.

v · d · eData warehouse

Creating the data warehouse

Concepts

Database

Dimension

Dimensional modeling

Fact

OLAP

Star schema

Aggregate

Variants

Anchor Modeling

Column-oriented DBMS

Data Vault Modeling

HOLAP

MOLAP

ROLAP

Operational data store

Elements

Data dictionary/Metadata

Data mart

Sixth normal form

Surrogate key

Fact

Fact table

Early-arriving fact

Measure

Dimension

Dimension table

Degenerate

Slowly changing

Filling

Extract-Transform-Load (ETL)

Extract

Transform

Load

Using the data warehouse

Concepts

Business intelligence

Dashboard

Data mining

Decision support system (DSS)

OLAP cube

Languages

Data Mining Extensions (DMX)

MultiDimensional eXpressions (MDX)

XML for Analysis (XMLA)

Tools

Business intelligence tools

Reporting software

Spreadsheet

Related

People

Bill Inmon

Ralph Kimball

Products

Comparison of OLAP Servers

Data warehousing products and their producers

Categories:
Data warehousing

Игры ⚽ Нужна курсовая?

Look at other dictionaries:

Dimension (data warehouse) — This article is about a dimension in a data warehouse. For other uses, see dimension (disambiguation). In a data warehouse, a dimension is a data element that categorizes each item in a data set into non overlapping regions. A data warehouse… … Wikipedia
table — [ tabl ] n. f. • 1050; var. taule XIIIe (→ tôle); lat. tabula « planche, tablette » I ♦ Objet formé essentiellement d une surface plane horizontale, généralement supportée par un pied, des pieds, sur lequel on peut poser des objets. 1 ♦ Surface… … Encyclopédie Universelle
Slowly changing dimension — Dimension is a term in data management and data warehousing that refers to logical groupings of data such as geographical location, customer information, or product information. Slowly Changing Dimensions (SCD) are dimensions that have data that… … Wikipedia
Table (information) — Tabular redirects here. For the typewriter key, see tab key. For sortable tables in Wikipedia, see Help:Sorting An example table rendered in a web browser using HTML. A table is a means of arranging data in rows an … Wikipedia
Table of Lie groups — This article gives a table of some common Lie groups and their associated Lie algebras.The following are noted: the topological properties of the group (dimension, connectedness, compactness, the nature of the fundamental group, and whether or… … Wikipedia
Degenerate dimension — Contents 1 The Kimball definition 2 Other uses of the term 3 See also 4 External reference 5 Notes … Wikipedia
Fact table — In data warehousing, a fact table consists of the measurements, metrics or facts of a business process. It is often located at the centre of a star schema, surrounded by dimension tables.Fact tables provide the (usually) additive values which act … Wikipedia
Produit vectoriel en dimension 7 — En mathématiques, et plus précisément en algèbre linéaire, le produit vectoriel en dimension 7 est une loi de composition interne d un espace euclidien à 7 dimensions, ayant certaines propriétés du produit vectoriel usuel (en dimension 3) ;… … Wikipédia en Français
State transition table — In automata theory and sequential logic, a state transition table is a table showing what state (or states in the case of a nondeterministic finite automaton) a finite semiautomaton or finite state machine will move to, based on the current state … Wikipedia
Saison 1 de La Cinquième Dimension — Cet article présente le guide de la première saison de la série télévisée La Cinquième Dimension. Sommaire 1 Épisodes 1.1 Épisode 1a : Le Jour de la déchirure 1.2 Épisode 1b : Une petite paix bien tranquille … Wikipédia en Français

Academic Dictionaries and Encyclopedias

Dimension table

See also

References

Look at other dictionaries:

Share the article and excerpts

Academic Dictionaries and Encyclopedias

Wikipedia

Dimension table

See also

References

Look at other dictionaries:

Share the article and excerpts

Direct link