- HBase
HBase is an open-source, column-oriented, distributed database modeled after Google's
BigTable and written inJava . It is developed as part ofApache Software Foundation 'sHadoop project and runs on top of HDFS (Hadoop Distributed File System), providing BigTable-like capabilities for Hadoop.HBase features compression, in-memory operation, and
Bloom filter s on a per-column basis as outlined in the original BigTable paper [ [http://209.85.163.132/papers/bigtable-osdi06.pdf Chang, et al. (2006). Bigtable: A Distributed Storage System for Structured Data] ] . Tables in HBase can serve as the input and output for MapReduce jobs run in Hadoop, and may be accessed through the [http://hadoop.apache.org/hbase/docs/current/ Java API] .History
HBase began as a project by the company [http://www.powerset.com Powerset] out of a need to process massive amounts of data for the purposes of natural language search. It is now a top-level Apache project and has generated considerable interest. [ [http://blog.powerset.com/ Powerset Blog] ]
External links
* [http://hadoop.apache.org/hbase Official HBase homepage]
* [http://hadoop.apache.org Official Hadoop homepage]References
ee also
*
BigTable
* [http://www.hypertable.org/ Hypertable - C++ implementation]
Wikimedia Foundation. 2010.