Column (data store)

Column (data store)
A column consists of a (unique) name, a value, and a timestamp.

A column of a distributed data store is a NoSQL object of the lowest level in a keyspace. It is a tuple (a key-value pair) consisting of three elements:[1]

  • Unique name: Used to reference the column
  • Value: The content of the column. It can have different types, like AsciiType, LongType, TimeUUIDType, UTF8Type among others.
  • Timestamp: The system timestamp used to determine the valid content.

Contents

Usage

The column is used as a store for the value and has a timestamp that is used to differentiate the valid content from stale ones. According to the CAP theorem, distributed data stores cannot guarantee consistency, as availability is a more important issue. Therefore, the data store or the application programmer will use the timestamp to find out which of the stored values in the backup nodes are up-to-date.

Some data stores, like Apache Cassandra 0.7, may use the more sophisticated vector clock instead of the timestamp to resolve stale information.[2]

Differences to a relational database

In relational databases, a column is a part of a relational table that can be seen in each row of the table. This is not the case in distributed data stores, where the concept of a table only vaguely exists. A column can be part of a ColumnFamily that resembles at most a relational row, but it may appear in one row and not in the others. Also, the number of columns may change from row to row, and new updates to the data store model may also modify the column number. So, all the work of keeping up with changes relies on the application programmer.

Examples

In JSON-like notation, three column definitions are given:

    street: {name: "street", VALUE: "1234 x street", TIMESTAMP: 123456789},
    city: {name: "city", VALUE: "san francisco", TIMESTAMP: 123456789},
    zip: {name: "zip", VALUE: "94107", TIMESTAMP: 123456789},
 }

See also

  • Super column

References

  1. ^ Chaker Nakhli. "Cassandra’s data model cheat sheet: Data model elements: Column". Chaker Nakhli's Blog - Yet another technical blog.. http://www.javageneration.com/?p=70. Retrieved 2011-03-29. "A name value pair [consisting of a name and a value] (contains also a time-stamp, not represented here for the sake of clarity)." 
  2. ^ "Apache Cassandra Glossary: Terms: Vector Clock". http://io.typepad.com/: Eben Hewitt's Cassandra Guide. http://io.typepad.com/glossary.html. Retrieved 2011-03-29. "A Vector Clock event synchronization strategy will likely be introduced in Cassandra 0.7." 

External links


Wikimedia Foundation. 2010.

Игры ⚽ Поможем сделать НИР

Look at other dictionaries:

  • Data store — A data store is a data repository of a set of integrated objects. These objects are modeled using classes defined in database schemas.[1] Data store includes not only data repositories like databases, it is a more general concept that includes… …   Wikipedia

  • Operational data store — An operational data store (or ODS ) is a database designed to integrate data from multiple sources for additional operations on the data. The data is then passed back to operational systems for further operations and to the data warehouse for… …   Wikipedia

  • Column (database) — In the context of a relational database table, a column is a set of data values of a particular simple type, one for each row of the table.[1] The columns provide the structure according to which the rows are composed. The term field is often… …   Wikipedia

  • Data warehouse — Overview In computing, a data warehouse (DW) is a database used for reporting and analysis. The data stored in the warehouse is uploaded from the operational systems. The data may pass through an operational data store for additional operations… …   Wikipedia

  • Data Vault Modeling — is a database modeling method that is designed to provide historical storage of data coming in from multiple operational systems. It is also a method of looking at historical data that, apart from the modeling aspect, deals with issues such as… …   Wikipedia

  • Data transformation — Data transformation/Source transformation Concepts metadata · data mapping data transformation · model transf …   Wikipedia

  • Data Mining Extensions — (DMX) is a query language for Data Mining Models supported by Microsoft s SQL Server Analysis Services product. Like SQL, it supports a data definition language, data manipulation language and a data query language, all three with SQL like syntax …   Wikipedia

  • Data extraction — is the act or process of retrieving data out of (usually unstructured or poorly structured) data sources for further data processing or data storage (data migration). The import into the intermediate extracting system is thus usually followed by… …   Wikipedia

  • Column-oriented DBMS — A column oriented DBMS is a database management system (DBMS) that stores its content by column rather than by row. This has advantages for data warehouses and library catalogues where aggregates are computed over large numbers of similar data… …   Wikipedia

  • Data model — Overview of data modeling context: A data model provides the details of information to be stored, and is of primary use when the final product is the generation of computer software code for an application or the preparation of a functional… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”