Online aggregation

Online aggregation

Almost all database operations are performed in batch mode, i.e. the user issues a query and waits till the database has finished processing the entire query. On the contrary, using online aggregation, the user gets estimates of an aggregate query in an online fashion as soon as the query is issued. For example, if the final answer is 1000, after k seconds, the user gets the estimates in form of a confidence interval like [990, 1020] with 95% probability. This confidence keeps on shrinking as the system gets more and more samples.

Online Aggregation (OLA) was proposed in 1997 by Hellerstein, Haas and Wang[1] for a group-by aggregate query over single table. Later, the authors showed how to evaluate joins in online fashion[2]. In 2007, Jermaine et al. designed and implemented a prototype database system called Database-Online (or DBO) that computes group-by aggregate query over multiple tables in an online and more importantly in a scalable fashion[3]. All the approaches for OLA use random sampling, which is non-trivial in distributed environment due to inspection paradox of renewal reward theory. In 2011, Pansare et al. proposed a bayesian model to deal with the inspection paradox and implemented Online Aggregation for MapReduce-like environment[4].

References

  1. ^ Hellerstein, Joseph M.; Haas, Peter J. and Wang, Helen J. (June 1997). "Online aggregation". SIGMOD Rec. 26: 171–182. doi:http://doi.acm.org/10.1145/253262.253291. http://doi.acm.org/10.1145/253262.253291. Retrieved 30 September 2011. 
  2. ^ Haas, Peter; Hellerstein, Joseph (June 1999). "Ripple joins for online aggregation". SIGMOD Rec. 28 (2): 287–298. doi:http://doi.acm.org/10.1145/304181.304208. http://doi.acm.org/10.1145/304181.304208. Retrieved 30 September 2011. 
  3. ^ Jermaine, Chris; Arumugam, Subramanian and Pol, Abhijit and Dobra, Alin (2007). "Scalable approximate query processing with the DBO engine". SIGMOD: 725–736. doi:http://doi.acm.org/10.1145/1247480.1247560. http://doi.acm.org/10.1145/1247480.1247560. Retrieved 30 September 2011. 
  4. ^ Pansare, Niketan; Borkar Vinayak, Jermaine Chris, Condie Tyson (August 2011). "Online Aggregation for Large MapReduce Jobs". VLDB. http://www.cs.rice.edu/~np6/Papers/OnlineAggregationForLargeMapReduceJobs.pdf. Retrieved 30 September 2011. 

Wikimedia Foundation. 2010.

Игры ⚽ Нужен реферат?

Look at other dictionaries:

  • Aggregation (Data Warehouse) — Aggregation (auch Konsolidierung oder Verdichtung) bezeichnet, im Zusammenhang mit der Verwaltung großer Datenmengen in einem Data Warehouse, das Zusammenfassen einer Reihe von Fakten zu einem einzelnen Fakt. Beispielsweise lassen sich aus einer… …   Deutsch Wikipedia

  • Aggregation (OLAP) — Aggregation (auch Konsolidierung oder Verdichtung) bezeichnet, im Zusammenhang mit der Verwaltung großer Datenmengen in einem Data Warehouse, das Zusammenfassen einer Reihe von Fakten zu einem einzelnen Fakt. Beispielsweise lassen sich aus einer… …   Deutsch Wikipedia

  • Online shopping — is the process whereby consumers directly buy goods or services from a seller in real time, without an intermediary service, over the Internet. It is a form of electronic commerce. An online shop, eshop, e store, Internet shop, webshop, webstore …   Wikipedia

  • Online presence management — is a process that engineers and stimulates a certain brand online. It combines the elements of web design and development, blogging, search engine optimization, pay per click marketing, reputation management, directory listings, social media,… …   Wikipedia

  • Online social entertainment — blends entertaining interactive functionality and content including live video streaming, video chat communications, multi player gaming, music and videos streaming, with social networking service such as social graph management, forums, reviews …   Wikipedia

  • Online Transaction Processing — (OLTP), Onlinetransaktionsverarbeitung, bezeichnet ein Benutzungsparadigma von Datenbanksystemen und Geschäftsanwendungen, bei dem die Verarbeitung von Transaktionen im Vordergrund steht. Während früher Batch Verarbeitung gebräuchlich war, und… …   Deutsch Wikipedia

  • Online analytical processing — In computing, online analytical processing, or OLAP (  /ˈoʊlæ …   Wikipedia

  • Online banking — Part of a series on Electronic commerce Online goods and services Streaming media Electronic books Softwar …   Wikipedia

  • Online diary — An online diary is a personal diary or journal that is published on the World Wide Web on a personal website or a diary hosting website. Contents 1 Overview 2 Early history 3 Formation of a community 4 …   Wikipedia

  • Online Books Page — The Online Books Page is an index of e text books available on the Internet.[1] It is edited by John Mark Ockerbloom and is hosted by the library of the University of Pennsylvania.[2] The Online Books Page lists over 30,000 books and has several… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”