- Relevance feedback
Relevance
feedback is a feature of someinformation retrieval systems. The idea behind relevance feedback is to take the results that are initially returned from a given query and to use information about whether or not those results are relevant to perform a new query. We can usefully distinguish between three types of feedback: explicit feedback, implicit feedback, and blind or "pseudo" feedback.Explicit feedback
Explicit feedback is obtained from assessors of relevance indicating the relevance of a document retrieved for a query. This type of feedback is defined as explicit only when the assessors (or other users of a system) know that the feedback provided is interpreted as relevance judgments.
Users may indicate relevance explicitly using a "binary" or "graded" relevance system. Binary relevance feedback indicates that a document is either relevant or irrelevant for a given query. Graded relevance feedback indicates the relevance of a document to a query on a scale using numbers, letters, or descriptions (such as "not relevant", somewhat relevant", "relevant", or "very relevant"). Graded relevance may also take the form of a cardinal ordering of documents created by an assessor; that is, the assessor places documents of a result set in order of (usually descending) relevance.
A performance
metric which became popular around 2005 to measure the usefulness of a rankingalgorithm based on the explicit relevance feedback isNDCG . Other measures includeprecision at "k" and mean average precision.Implicit feedback
Implicit feedback is inferred from user behavior, such as noting which documents they do and do not select for viewingFact|date=May 2008, the duration of time spent viewing a documentFact|date=May 2008, or page browsing or scrolling actionsFact|date=May 2008.
The key differences of implicit relevance feedback from that of explicit includeFact|date=May 2008:
# the user is not assessing relevance for the benefit of the IR system, but only satisfying their own needs and
# the user is not necessarily informed that their behavior (selected documents) will be used as relevance feedbackBlind feedback
Blind or "pseudo" relevance feedback is obtained by assuming that the top "k" documents in the result set containing "n" results (usually where "k" << "n") are relevant.
Blind feedback automates the manual part of relevance feedback and has the advantage that assessors are not required.
Using relevance information
Relevance information is utilized by using the contents of the relevant documents to either adjust the weights of terms in the original query, or by using those contents to add words to the query. Relevance feedback is often implemented using the
Rocchio algorithm .Further reading
[http://www.umiacs.umd.edu/~jimmylin/LBSC796-INFM718R-2006-Spring/lecture7.ppt Relevance feedback lecture notes] - Jimmy Lin's lecture notes, adapted from Doug Oard's
[http://www.ischool.berkeley.edu/~hearst/irbook/chapters/chap10.html] - chapter from modern information retrieval
Wikimedia Foundation. 2010.