Count-Min sketch

Count-Min sketch

The Count-Min sketch (or CM sketch) is a probabilistic sub-linear space streaming algorithm which can be used to summarize a data stream in many different ways. The algorithm was invented in 2003 by Graham Cormode and S. Muthu Muthukrishnan[1].

Contents

Algorithm

Setup

The data structure is parameterized by the constants w and d which determine the time and space needs and the probability of error of the queries. The algorithm needs a two dimensional array with width w and depth d called here count. A series of d hash functions must be randomly drawn from a pairwise independent hash function family.

For later convenience we assign w = \lceil e/\epsilon \rceil and  d = \lceil \ln{1/\delta} \rceil.

Update

When a new value a arrives we update as follows: \forall j : 1 \leq j \leq d,  count[j,h_j(a)] \leftarrow count[j,h_j(a)] + a . That is, for each row we take the corresponding hash function, apply it to the newly received value and add one to the column corresponding to the hash value.

Query

The array can then be used to estimate any of several different statistics at any point. If we want to estimate, for instance, the number of times a specific value i appeared so far in the stream we would compute min jcount[j,hj(i)] (this assume all added values are positive). This estimate has the guarantee that \hat a_i \leq a_i + \epsilon |a| with probability 1 − δ.

Small modifications to the data structure can be used to sketch other different stream statistics.

External links

References

  1. ^ Cormode, Graham; S. Muthukrishnan (2004). "An Improved Data Stream Summary: The Count-Min Sketch and its Applications". J. Algorithms 55: 29–38. 

Wikimedia Foundation. 2010.

Игры ⚽ Нужен реферат?

Look at other dictionaries:

  • literature — /lit euhr euh cheuhr, choor , li treuh /, n. 1. writings in which expression and form, in connection with ideas of permanent and universal interest, are characteristic or essential features, as poetry, novels, history, biography, and essays. 2.… …   Universalium

  • Timothy McVeigh — For the United States Navy sailor, see Timothy R. McVeigh. Timothy McVeigh FBI mugshot of McVeigh Born April 23, 1968(1968 04 23) Buffalo, New York, U.S. Died June 11, 2001( …   Wikipedia

  • Chicago Bears — Current season Established 1919[1] Play in Soldier Field Chicago, Illinois Headquartered in Lake Forest, Illinois …   Wikipedia

  • HEBREW GRAMMAR — The following entry is divided into two sections: an Introduction for the non specialist and (II) a detailed survey. [i] HEBREW GRAMMAR: AN INTRODUCTION There are four main phases in the history of the Hebrew language: the biblical or classical,… …   Encyclopedia of Judaism

  • Annie Girardot — Pour les articles homonymes, voir Girardot. Annie Girardot …   Wikipédia en Français

  • Frankenstein in popular culture — lists many ways the novel Frankenstein , and Frankenstein s monster, have influenced film, TV, games and popular culture in general and the many derivative works it has inspired.Film derivativesilent EraThe first film adaptation of the tale,… …   Wikipedia

  • performing arts — arts or skills that require public performance, as acting, singing, or dancing. [1945 50] * * * ▪ 2009 Introduction Music Classical.       The last vestiges of the Cold War seemed to thaw for a moment on Feb. 26, 2008, when the unfamiliar strains …   Universalium

  • Spain — /spayn/, n. a kingdom in SW Europe. Including the Balearic and Canary islands, 39,244,195; 194,988 sq. mi. (505,019 sq. km). Cap.: Madrid. Spanish, España. * * * Spain Introduction Spain Background: Spain s powerful world empire of the 16th and… …   Universalium

  • List of Russian explorers — The Russian Empire at its peak in 1866, including the spheres of influence; this territorial expansion largely corresponds to the extent of contiguous exploration by Russians. This is a list of explorers from the Russian Federation, Soviet Union …   Wikipedia

  • Ed Grimley — Edward Mayhoff Ed Grimley is a fictional character introduced on the television series SCTV and later used on Saturday Night Live. He was created and played by Martin Short. He is a hyperactive, neurotic nerd with a large frontal cowlick who is… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”