Discounted cumulative gain

Discounted cumulative gain

Discounted cumulative gain (DCG) is a measure of effectiveness of a Web search engine algorithm or related applications, often used in information retrieval. Using a graded relevance scale of documents in a search engine result set, DCG measures the usefulness, or gain, of a document based on its position in the result list. The gain is accumulated from the top of the result list to the bottom with the gain of each result discounted at lower ranks.[1]

Contents

Mathematical Details

Two assumptions are made in using DCG and its related measures.

  1. Highly relevant documents are more useful when appearing earlier in a search engine result list (have higher ranks)
  2. Highly relevant documents are more useful than marginally relevant documents, which are in turn more useful than irrelevant documents.

DCG originates from an earlier, more primitive, measure called Cumulative Gain.

Cumulative Gain

Cumulative Gain (CG) is the predecessor of DCG and does not include the position of a result in the consideration of the usefulness of a result set. In this way, it is the sum of the graded relevance values of all results in a search result list. The CG at a particular rank position p is defined as:

 \mathrm{CG_{p}} = \sum_{i=1}^{p} rel_{i}

Where reli is the graded relevance of the result at position i.

The value computed with the CG function is unaffected by changes in the ordering of search results. That is, moving a highly relevant document di above a higher ranked, less relevant, document dj does not change the computed value for CG. Based on the two assumptions made above about the usefulness of search results, DCG is used in place of CG for a more accurate measure.

Discounted Cumulative Gain

The premise of DCG is that highly relevant documents appearing lower in a search result list should be penalized as the graded relevance value is reduced logarithmically proportional to the position of the result. The discounted CG accumulated at a particular rank position p is defined as:

 \mathrm{DCG_{p}} = rel_{1} + \sum_{i=2}^{p} \frac{rel_{i}}{\log_{2}i}

There has not been shown any theoretically sound justification for using a logarithmic reduction factor[2] other than the fact that it produces a smooth reduction. An alternative formulation of DCG[3] places stronger emphasis on retrieving relevant documents:

 \mathrm{DCG_{p}} = \sum_{i=1}^{p} \frac{ 2^{rel_{i}} - 1 }{ \log_{2}(1+i)}

The function is equivalent to the previous DCG function when the relevance values of documents are binary; rel_{i} \in \{0,1\}.

Normalized DCG

Search result lists vary in length depending on the query. Comparing a search engine's performance from one query to the next cannot be consistently achieved using DCG alone, so the cumulative gain at each position for a chosen value of p should be normalized across queries. This is done by sorting documents of a result list by relevance, producing an ideal[disambiguation needed ] DCG at position p. For a query, the normalized discounted cumulative gain, or nDCG, is computed as:

 \mathrm{nDCG_{p}} = \frac{DCG_{p}}{IDCG{p}}

The nDCG values for all queries can be averaged to obtain a measure of the average performance of a search engine's ranking algorithm. Note that in a perfect ranking algorithm, the DCGp will be the same as the IDCGp producing an nDCG of 1.0. All nDCG calculations are then relative values on the interval 0.0 to 1.0 and so are cross-query comparable.

The main difficulty encountered in using nDCG is the unavailability of an ideal ordering of results when only partial relevance feedback is available.

Example

Presented with a list of documents in response to a search query, an experiment participant is asked to judge the relevance of each document to the query. Each document is to be judged on a scale of 0-3 with 0 meaning irrelevant, 3 meaning completely relevant, and 1 and 2 meaning "somewhere in between". For the documents ordered by the ranking algorithm as

D1,D2,D3,D4,D5,D6

the user provides the following relevance scores:

3,2,3,0,1,2

That is: document 1 has a relevance of 3, document 2 has a relevance of 2, etc. The Cumulative Gain of this search result listing is:

 \mathrm{CG_{p}} = \sum_{i=1}^{p} rel_{i} = 3+2+3+0+1+2 = 11

Changing the order of any two documents does not affect the CG measure. If D3 and D4 are switched, the CG remains the same, 11. DCG is used to emphasize highly relevant documents appearing early in the result list. Using the logarithmic scale for reduction, the DCG for each result in order is:

i reli log 2i  \frac{rel_{i}}{\log_{2}i}
1 3 0 N/A
2 2 1 2
3 3 1.59 1.887
4 0 2.0 0
5 1 2.32 0.431
6 2 2.59 0.772

So the DCG6 of this ranking is:

 \mathrm{DCG_{6}} = rel_{1} + \sum_{i=2}^{p} \frac{rel_{i}}{\log_{2}i} = 3 + (2+1.887+0+0.431+0.772) = 8.09

Now a switch of D3 and D4 results in a reduced DCG because a less relevant document is placed higher in the ranking; that is, a more relevant document is discounted more by being placed in a lower rank.

The performance of this query to another is incomparable in this form since the other query may have more results, resulting in a larger overall DCG which may not necessarily be better. In order to compare, the DCG values must be normalized.

To normalize DCG values, an ideal ordering for the given query is needed. For this example, that ordering would be the monotonically decreasing sort of the relevance judgments provided by the experiment participant, which is:

3,3,2,2,1,0

The DCG of this ideal ordering, or IDCG, is then:

IDCG6 = 8.693

And so the nDCG for this query is given as:

 \mathrm{nDCG_{6}} = \frac{DCG_{6}}{IDCG{6}} = \frac{8.09}{8.693} = 0.9306

References

  1. ^ Kalervo Jarvelin, Jaana Kekalainen: Cumulated gain-based evaluation of IR techniques. ACM Transactions on Information Systems 20(4), 422–446 (2002)
  2. ^ B. Croft, D. Metzler, and T. Strohman (2009). Search Engines: Information Retrieval in Practice. Addison Wesley". 
  3. ^ http://learningtorankchallenge.yahoo.com/instructions.php

Wikimedia Foundation. 2010.

Игры ⚽ Поможем сделать НИР

Look at other dictionaries:

  • Cumulative gain — may refer to: discounted cumulative gain (information retrieval). cumulative elevation gain (running, cycling, and mountaineering) This disambiguation page lists articles associated with the same title. If an …   Wikipedia

  • Information retrieval — This article is about information retrieval in general. For the fictional government department, see Brazil (film). Information retrieval (IR) is the area of study concerned with searching for documents, for information within documents, and for… …   Wikipedia

  • Обучение ранжированию — (англ. learning to rank или machine learned ranking, MLR)[1]  это класс задач машинного обучения с учителем, заключающихся в автоматическом подборе ранжирующей модели по обучающей выборке, состоящей из множества списков и заданных… …   Википедия

  • DCG — may refer to: Definite clause grammar, a means of expressing grammatical relationships Discounted Cumulative Gain, a performance measure for search engine ranking algorithms Discontinuous gas exchange, a physiological pattern of respiratory gas… …   Wikipedia

  • Business and Industry Review — ▪ 1999 Introduction Overview        Annual Average Rates of Growth of Manufacturing Output, 1980 97, Table Pattern of Output, 1994 97, Table Index Numbers of Production, Employment, and Productivity in Manufacturing Industries, Table (For Annual… …   Universalium

  • Economic Affairs — ▪ 2006 Introduction In 2005 rising U.S. deficits, tight monetary policies, and higher oil prices triggered by hurricane damage in the Gulf of Mexico were moderating influences on the world economy and on U.S. stock markets, but some other… …   Universalium

  • Media and Publishing — ▪ 2007 Introduction The Frankfurt Book Fair enjoyed a record number of exhibitors, and the distribution of free newspapers surged. TV broadcasters experimented with ways of engaging their audience via the Internet; mobile TV grew; magazine… …   Universalium

  • Computers and Information Systems — ▪ 2009 Introduction Smartphone: The New Computer.       The market for the smartphone in reality a handheld computer for Web browsing, e mail, music, and video that was integrated with a cellular telephone continued to grow in 2008. According to… …   Universalium

  • international relations — a branch of political science dealing with the relations between nations. [1970 75] * * * Study of the relations of states with each other and with international organizations and certain subnational entities (e.g., bureaucracies and political… …   Universalium

  • Life insurance — The foundation of life insurance is the recognition of the value of a human life and the possibility of indemnification for the loss of that value. F. C. Oviatt, Economic place of insurance and its relation to society[1] Life insurance is a… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”