HITS algorithm

HITS algorithm

Hyperlink-Induced Topic Search (HITS) is a link analysis algorithm that rates Web pages, developed by Jon Kleinberg. It determines two values for a page: its authority, which estimates the value of the content of the page, and its hub value, which estimates the value of its links to other pages.

Algorithm details

Authority and hub values are defined in terms of one another in a mutual recursion. An authority value is computed as the sum of the scaled hub values that point to that page. A hub value is the sum of the scaled authority values of the pages it points to. Some implementations also consider the relevance of the linked pages.

HITS, like Page and Brin's PageRank, is an iterative algorithm based on the linkage of the documents on the web. However it does have some major differences:
* It is executed at query time, not at indexing time, with the associated hit on performance that accompanies query-time processing.
* It is not commonly used by search engines. (Though a similar algorithm was said to be used by Teoma [cite web |url=http://en.wikipedia.org/w/index.php?title=Jon_Kleinberg&action=edit&section=1|title=Teoma vs. Google, Round 2|publisher=Search Engine Watch] , which was acquired by Ask.com.)
* It computes two scores per document, hub and authority, as opposed to a single score.
* It is processed on a small subset of ‘relevant’ documents, not all documents as was the case with PageRank.

ee also

*PageRank
*SALSA
*TrustRank

References

*cite journal |last=Kleinberg |first=Jon |authorlink= |coauthors= |year=1999 |month= |title=Authoritative sources in a hyperlinked environment |journal=Journal of the ACM |volume=46 |issue=5 |pages=604–632 |id= |url=http://www.cs.cornell.edu/home/kleinber/auth.pdf |accessdate= |quote=
*cite book |chapter=Improvement of HITS-based Algorithms on Web Documents |title=Proceedings of the 11th International World Wide Web Conference (WWW 2002) |last=Li |first=L. |authorlink= |coauthors=Shang, Y.; Zhang, W. |year=2002 |publisher= |location=Honolulu, HI |isbn=1880672200 |pages= |chapterurl=http://www2002.org/CDROM/refereed/643/

External links

*US patent|6112202


Wikimedia Foundation. 2010.

Игры ⚽ Поможем решить контрольную работу

Look at other dictionaries:

  • HITS — may refer to:* Headend in the Sky, a Comcast Media Center service which provides digital cable television services * High Way In The Sky, Synthetic vision system (SVS) * HITS algorithm, Hypertext Induced Topic Selection * Homicide Investigation… …   Wikipedia

  • Lanczos algorithm — The Lanczos algorithm is an iterative algorithm invented by Cornelius Lanczos that is an adaptation of power methods to find eigenvalues and eigenvectors of a square matrix or the singular value decomposition of a rectangular matrix. It is… …   Wikipedia

  • Hilltop algorithm — The Hilltop algorithm is an algorithm created by Krishna Bharat while he was at Compaq Systems Research Center and George A. Mihăilă, then at the University of Toronto. The algorithm is used to find topic relevant documents to the particular… …   Wikipedia

  • Steinhaus-Johnson-Trotter algorithm — The Steinhaus Johnson Trotter algorithm or Johnson Trotter algorithm is an algorithm which generates permutations by transposing elements.AlgorithmThe algorithm is setup with the idea that only one set of neighbors needs to swap positions and… …   Wikipedia

  • Boyer–Moore–Horspool algorithm — In computer science, the Boyer–Moore–Horspool algorithm or Horspool s algorithm is an algorithm for finding substrings in strings. It is a simplification of the Boyer Moore algorithm which is related to the Knuth Morris Pratt algorithm. The… …   Wikipedia

  • Jon Kleinberg — Infobox Scientist image width = 150px name = Jon Kleinberg box width = caption = birth date = October 1971 birth place = Boston, Massachusetts death date = death place = residence = citizenship = nationality = flagicon|USA American ethnicity =… …   Wikipedia

  • PageRank — is a link analysis algorithm that assigns a numerical weighting to each element of a hyperlinked set of documents, such as the World Wide Web, with the purpose of measuring its relative importance within the set. The algorithm may be applied to… …   Wikipedia

  • CheiRank — Nodes with links in the plane of PageRank and CheiRank The CheiRank is an eigenvector with a maximal real eigenvalue of the Google matrix G * constructed for a directed network with the inverted directions of links. It is similar to the PageRank… …   Wikipedia

  • Search engine technology — Modern web search engines are complex software systems using the technology that has evolved over the years. The largest search engines such as Google and Yahoo! utilize tens or hundreds of thousands of computers to process billions of web pages… …   Wikipedia

  • Hit — and the acronym HIT may refer to:;Science/Engineering * Homogeneous Isotropic Turbulence, Fluid Dynamics ;Sport * Hit (baseball) * High intensity training, a form of strength training;Music * Hit (album), by Peter Gabriel * Hits (disambiguation) …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”