Dimension table

Dimension table

In data warehousing, a dimension table is one of the set of companion tables to a fact table.

The fact table contains business facts or measures and foreign keys which refer to candidate keys (normally primary keys) in the dimension tables.

Contrary to fact tables, the dimension tables contain descriptive attributes (or fields) which are typically textual fields or discrete numbers that behave like text. These attributes are designed to serve two critical purposes: query constraining/filtering and query result set labeling.

Dimension attributes are supposed to be:

  • Verbose - labels consisting of full words,
  • Descriptive,
  • Complete - no missing values,
  • Discretely valued - only one value per row in dimensional table,
  • Quality assured - no misspelling, no impossible values.

Dimension table rows are uniquely identified by a single key field. It is recommended that the key field is a simple integer for the reason that key value is meaningless and is only used to be join fields between the fact and dimension tables.

The usage of surrogate dimension keys brings several advantages among:

  • Performance - join processing is much more efficient if a single field surrogate key is used,
  • Buffer from operational key management practices - prevents form situations when removed data rows might reappear when their natural keys might be reused or reassigned after a long period of dormancy,
  • Mapping to integrate disparate sources,
  • Handle unknown or not applicable connections,
  • Track changes in dimension attribute values.

Usage of surrogate keys also brings an additional costs due the burden put on the ETL system. Still, pipeline processing can be improved and ETL tools have built-in improved surrogate key processing.

The goal of dimension table is to create standardized conformed dimensions which can be shared across the enterprise's data warehouse environment and joining to multiple fact table representing various business processes.

Conformed dimensions are highly important to enterprise nature of DW/BI system for following reasons:

  • Consistency - every fact table is filtered consistently, result query answer are labeled consistently,
  • Integration - queries are able to drill different processes fact tables separately for each individual fact table and then join the results on common dimension attributes,
  • Reduced development time to market - the common dimensions are available without recreating the wheel over again.

Over time, the attributes of a given row in a dimension table may change. For example, the shipping address for a company may change. Kimball refers to this phenomenon as Slowly Changing Dimensions. Strategies for dealing with this kind of change are divided into three categories:

  • Type One - Simply overwrite the old value(s).
  • Type Two - Add a new row containing the new value(s), and distinguish between the rows using Tuple-versioning techniques.
  • Type Three - Add a new attribute to the existing row.

See also

References

  • Kimball, Ralph. The Data Warehouse Lifecycle Toolkit Scond Edition. Winely Publishing Inc., 2008, p.241-246.

Wikimedia Foundation. 2010.

Игры ⚽ Нужен реферат?

Look at other dictionaries:

  • Dimension (data warehouse) — This article is about a dimension in a data warehouse. For other uses, see dimension (disambiguation). In a data warehouse, a dimension is a data element that categorizes each item in a data set into non overlapping regions. A data warehouse… …   Wikipedia

  • table — [ tabl ] n. f. • 1050; var. taule XIIIe (→ tôle); lat. tabula « planche, tablette » I ♦ Objet formé essentiellement d une surface plane horizontale, généralement supportée par un pied, des pieds, sur lequel on peut poser des objets. 1 ♦ Surface… …   Encyclopédie Universelle

  • Slowly changing dimension — Dimension is a term in data management and data warehousing that refers to logical groupings of data such as geographical location, customer information, or product information. Slowly Changing Dimensions (SCD) are dimensions that have data that… …   Wikipedia

  • Table (information) — Tabular redirects here. For the typewriter key, see tab key. For sortable tables in Wikipedia, see Help:Sorting An example table rendered in a web browser using HTML. A table is a means of arranging data in rows an …   Wikipedia

  • Table of Lie groups — This article gives a table of some common Lie groups and their associated Lie algebras.The following are noted: the topological properties of the group (dimension, connectedness, compactness, the nature of the fundamental group, and whether or… …   Wikipedia

  • Degenerate dimension — Contents 1 The Kimball definition 2 Other uses of the term 3 See also 4 External reference 5 Notes …   Wikipedia

  • Fact table — In data warehousing, a fact table consists of the measurements, metrics or facts of a business process. It is often located at the centre of a star schema, surrounded by dimension tables.Fact tables provide the (usually) additive values which act …   Wikipedia

  • Produit vectoriel en dimension 7 — En mathématiques, et plus précisément en algèbre linéaire, le produit vectoriel en dimension 7 est une loi de composition interne d un espace euclidien à 7 dimensions, ayant certaines propriétés du produit vectoriel usuel (en dimension 3) ;… …   Wikipédia en Français

  • State transition table — In automata theory and sequential logic, a state transition table is a table showing what state (or states in the case of a nondeterministic finite automaton) a finite semiautomaton or finite state machine will move to, based on the current state …   Wikipedia

  • Saison 1 de La Cinquième Dimension — Cet article présente le guide de la première saison de la série télévisée La Cinquième Dimension. Sommaire 1 Épisodes 1.1 Épisode 1a : Le Jour de la déchirure 1.2 Épisode 1b : Une petite paix bien tranquille …   Wikipédia en Français

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”