# H-index

H-index

The $h$-index is an index that quantifies both the actual scientific productivity and the apparent scientific impact of a scientist. The index is based on the set of the scientist's most cited papers and the number of citations that they have received in other people's publications. The index can also be applied to the productivity and impact of a group of scientists, such as a department or university or country. The index was suggested by Jorge E. Hirsch, a physicist at UCSD, as a tool for determining theoretical physicists' relative quality [cite journal |last=Hirsch |first=J. E. |authorlink= |coauthors= |year=2005 |month= |title=An index to quantify an individual's scientific research output |journal=PNAS |volume=102 |issue=46 |pages=16569&ndash;16572 |doi=10.1073/pnas.0507655102 |url= |accessdate= |quote= ] and is sometimes called the "Hirsch index" or "Hirsch number".

Hirsch suggests that, for physicists, a value for h of about 10-12 might be a useful guideline for tenure decisions at major research universities. A value of about 18 could mean a full professorship, 15–20 could mean a fellowship in the American Physical Society, and 45 or higher could mean membership in the United States National Academy of Sciences. [cite news |first=Ivars |last=Peterson |authorlink= |coauthors= |title=Rating Researchers |url=http://www.sciencenews.org/articles/20051203/mathtrek.asp |work=Science News Online |publisher= |date=December 2, 2005 |accessdate= ]

Definition and purpose

The index is based on the distribution of citations received by a given researcher's publications. Hirsch writes::"A scientist has index h if h of his Np papers have at least h citations each, and the other (Np - h) papers have at most h citations each."

In other words, a scholar with an index of "h" has published "h" papers each of which has been cited by others at least "h" times. [cite web |url=http://pda.physorg.com/lofi-news-hindex-says-papers_7971.html |title=Physicist Proposes New Way to Rank Scientific Output |accessdate=2008-07-03 |work= |publisher= |date= ] Thus, the "h"-index reflects both the number of publications and the number of citations per publication. The index is designed to improve upon simpler measures such as the total number of citations or publications. The index works properly only for comparing scientists working in the same field; citation conventions differ widely among different fields.

The "h"-index serves as an alternative to more traditional journal impact factor metrics in the evaluation of the impact of the work of a particular researcher. Because only the most highly cited articles contribute to the "h"-index, its determination is a relatively simpler process. Hirsch has demonstrated that $h$ has high predictive value for whether a scientist has won honors like National Academy membership or the Nobel Prize. In physics, a moderately productive scientist should have an $h$ equal to the number of years of service while biomedical scientists tend to have higher values.

Calculating $h$

The $h$-index can be manually determined using free Internet databases, such as Google Scholar. Subscription-based databases such as Scopus and the Web of Knowledge provide automated calculators. Each database is likely to produce a different $h$ for the same scholar, because of different coverage in each DB: Google Scholar has more citations than Scopus and Web of Science but each of their smaller citation collections tends to be more accurate.

The topic has been studied in some detail by Lokman I. Meho and Kiduk Yang.cite journal |last=Meho |first=L. I. |authorlink= |coauthors=Yang, K. |year=2007 |month= |title=Impact of Data Sources on Citation Counts and Rankings of LIS Faculty: Web of Science vs. Scopus and Google Scholar |journal=Journal of the American Society for Information Science and Technology |volume=58 |issue=13 |pages=2105&ndash;2125 |doi=10.1002/asi.20677 |url= |accessdate= |quote= ] Web of Knowledge was found to have strong coverage of journal publications, but poor coverage of high impact conferences (a particular problem for Computer Science based scholars); Scopus has better coverage of conferences, but poor coverage of publications prior to 1992; Google Scholar has the best coverage of conferences and most journals (though not all), but like Scopus has limited coverage of pre-1990 publications. Google Scholar has also been criticized for including gray literature in its citation counts. [cite journal |last=Jacsó |first=Péter |authorlink= |coauthors= |year=2006 |month= |title=Dubious hit counts and cuckoo's eggs |journal=Online Information Review |volume=30 |issue=2 |pages=188&ndash;193 |doi=10.1108/14684520610659201 |url= |accessdate= |quote= ] However, the Meho and Yang study showed that the majority of the additional citation sources Google Scholar uses are legitimate refereed forums. It has been suggested that in order to deal with the sometimes wide variation in $h$ for a single academic measured across the possible citation databases, that one could assume false negatives in the databases are more problematic than false positives and take the maximum $h$ measured for an academic. [cite journal |last=Sanderson |first=Mark |authorlink= |coauthors= |year=2008 |month= |title=Revisiting "h" measured on UK LIS and IR academics |journal=Journal of the American Society for Information Science and Technology |volume=59 |issue=7 |pages=1184&ndash;1190 |doi=10.1002/asi.20771 |url= |accessdate= |quote= ]

It should be remembered that the content of all of the databases, particularly Google Scholar, continually changes, so any research on the content of the databases risks going out of date.

The "h"-index was intended to address the main disadvantages of other bibliometric indicators, such as total number of papers or total number of citations. Total number of papers does not account for the quality of scientific publications, while total number of citations can be disproportionately affected by participation in a single publication of major influence. The "h"-index is intended to measure simultaneously the quality and sustainability of scientific output, as well as, to some extent, the diversity of scientific research. The "h"-index is much less affected by methodological papers proposing successful new techniques, methods or approximations, which can be extremely highly cited. For example, one of the most cited condensed matter theorists, John P. Perdew, has been very successful in devising new approximations within the widely used density functional theory. He has published 3 papers cited more than 5000 times and 2 cited more than 4000 times. Several thousand papers utilizing the density functional theory are published every year, most of them citing at least one paper of J.P. Perdew. His total citation count is close to 39 000, while his "h"-index is large, 51, but not unique. In contrast, the condensed-matter theorist with the highest "h"-index (94), Marvin L. Cohen, has a lower citation count of 35 000. One can argue that in this case the "h"-index reflects the broader impact of Cohen's papers in solid-state physics due to his larger number of highly-cited papers.

Criticism

There are a number of situations in which $h$ may provide misleading information about a scientist's output: [cite journal |last=Wendl |first=Michael |authorlink= |coauthors= |year=2007 |month= |title=H-index: however ranked, citations need context |journal=Nature |volume=449 |issue=7161 |pages=403 |doi=10.1038/449403b |url= |accessdate= |quote= ]

*The "h"-index is bounded by the total number of publications. This means that scientists with a short career are at an inherent disadvantage, regardless of the importance of their discoveries. For example, Évariste Galois' "h"-index is 2, and will remain so forever. Had Albert Einstein died in early 1906, his "h"-index would be stuck at 4 or 5, despite his being widely acknowledged as one of the most important physicists, even considering only his publications to that date.

*The "h"-index does not consider the "context" of citations. For example, citations in a paper are often made simply to flesh-out an introduction, otherwise having no other significance to the work. "h" also does not resolve other contextual instances: citations made in a negative context and citations made to fraudulent or retracted work. (This is true for other metrics using citations, not just for the h-index.)

*The "h"-index does not account for confounding factors. These include the practice of "gratuitous authorship", which is still common in some research cultures, the so-called Matthew effect, and the favorable citation bias associated with review articles.

*The "h"-index has been found to have slightly less predictive accuracy and precision than the simpler measure of mean citations per paper.cite journal | author = Sune Lehmann, Andrew D. Jackson, Benny E. Lautrup | title=Measures for measures
journal = Nature | volume=444 |issue=7122 |pages=1003–4 |year=2006 | pmid=17183295| doi=10.1038/4441003a
] However, this finding was contradicted by another study. [cite journal |author=Hirsch J. E. |title=Does the h-index have predictive power? |journal=PNAS |volume=104 |pages=19193-19198 |year=2007 |doi=10.1073/pnas.0707962104 ]

*The "h"-index is a natural number and thus lacks discriminatory power. Ruane and Tol therefore propose a rational "h"-index that interpolates between "h" and "h"+1. [Cite journal |author=Frances Ruane & Richard S. J. Tol |title=Rational (successive) h -indices: An application to economics in the Republic of Ireland |journal=Scientometrics |volume=75 |issue=2 |year=2008 |pages=395&ndash;405|doi=10.1007/s11192-007-1869-7 ]

*While the "h"-index de-emphasizes singular successful publications in favor of sustained productivity, it may do so too strongly. Two scientists may have the same "h"-index, say, $h=30$, but one has 20 papers that have been cited more than 1000 times and the other has none. Clearly scientific output of the former is more valuable. Several recipes to correct for that have been proposed, such as the g-index, but none has gained universal support.

*The "h"-index is affected by limitations in citation data bases. Some automated searching processes find citations to papers going back many years, while others find only recent papers or citations. This issue is less important for those whose publication record started after automated indexing began around 1990. Citation data bases contain some citations that are not quite correct and therefore will not properly match to the correct paper or author.

*The "h"-index does not account for the number of authors of a paper. If the impact of a paper is the number of citations it receives, it might be logical to divide that impact by the number of authors involved. (Some authors will have contributed more than others, but in the absence of information on contributions, the simplest assumption is to divide credit equally.) Not taking into account the number of authors could allow gaming the "h"-index and other similar indices: for example, two equally capable researchers could agree to share authorship on all their papers, thus increasing each of their "h"-indices. Even in the absence of such explicit gaming, the "h"-index and similar indices tend to favor fields with larger groups, e.g. experimental over theoretical.

Comparison with other metrics

The $h$-index grows as citations accumulate and thus it depends on the 'academic age' of a researcher. Using papers published within a particular time period, e.g. within the last 10 years, would allow to measure the current productivity as opposed to the lifetime achievement.

Various proposals to modify the "h"-index in order to emphasize different features have been made. [cite journal |last=Sidiropoulos |first=Antonis |authorlink= |coauthors=Katsaros, Dimitrios; Manolopoulos, Yannis |year=2007 |month= |title= Generalized Hirsch h-index for disclosing latent facts in citation networks |journal=Scientometrics |volume=72 |issue=2 |pages=253&ndash;280 |doi=10.1007/s11192-007-1722-z |url= |accessdate= |quote= ] [g-index] [Cite journal | url = http://bmj.com/cgi/eletters/331/7528/1339-c#123188 | title = V-index: A fairer index to quantify an individual's research output capacity | author = Jayant S Vaidya | journal = BMJ | year = 2005 | month = December | volume = 331 | pages = 339-c-1340-c ] [Katsaros D., Sidiropoulos A., Manolopous Y., (2007), [http://sunsite.informatik.rwth-aachen.de/Publications/CEUR-WS//Vol-245/paper3.pdf Age Decaying H-Index for Social Network of Citations] in Proceedings of [http://ceur-ws.org/Vol-245/ Workshop on Social Aspects of the Web Poznan, Poland, April 27, 2007] ]

ee also

*Bibliometrics
*impact factor
*Erdős number
*h-b index
*g-index
*Eddington number (cycling) An earlier metric of the same form.

References

Publications related to h-index

*cite journal |last=Ball |first=Philip | title=Index aims for fair ranking of scientists | year = 2005 | journal = Nature | volume = 436 | issue = 7053 | url = | pages = 900 | doi=10.1038/436900a
*cite journal |author=Kelly, C. D.; Jennions, M. D. |title=The h index and career assessment by numbers |journal=Trends Ecol. Evol. (Amst.) |volume=21 |issue=4 |pages=167–70 |year=2006 |pmid=16701079 |doi=10.1016/j.tree.2006.01.005
*cite journal |author=Lehmann, S.; Jackson, A. D.; Lautrup, B. E. |title=Measures for measures |journal=Nature |volume=444 |issue=7122 |pages=1003–4 |year=2006 |pmid=17183295 |doi=10.1038/4441003a
*cite journal |last=Sidiropoulos |first=Antonis |authorlink= |coauthors=Katsaros, Dimitrios; Manolopoulos, Yannis |year=2007 |month= |title= Generalized Hirsch h-index for disclosing latent facts in citation networks |journal=Scientometrics |volume=72 |issue=2 |pages=253&ndash;280 |doi=10.1007/s11192-007-1722-z |url= |accessdate= |quote=
*cite journal |last=Soler |first=José M. |authorlink= |coauthors= |year=2007 |month= |title=A rational indicator of scientific creativity |journal=Journal of Informetrics |volume=1 |issue=2 |pages=123&ndash;130 |doi=10.1016/j.joi.2006.10.004 |url= |accessdate= |quote=
*cite journal |last=Symonds |first=M. R. |coauthors="et al." |title=Gender differences in publication output: towards an unbiased metric of research performance |journal=PLoS ONE |volume=1 |issue= |pages=e127 |year=2006 |pmid=17205131 |doi=10.1371/journal.pone.0000127
*cite journal |last=Taber |first=Douglass F. |authorlink= |coauthors= |year=2005 |month= |title=Quantifying Publication Impact |journal=Science |volume=309 |issue=5744 |pages=2166a |doi=10.1126/science.309.5744.2166a |url= |accessdate= |quote=

Computing the h-index

* Quadsearch calculates an [http://quadsearch.csd.auth.gr/index.php?lan=1&s=2 estimate of h] on Google Scholar. The h-factor is provided after one carries out the requested search.
* A [http://www.mathworks.com/matlabcentral/fileexchange/loadFile.do?objectId=9710&objectType=file MATLAB script] to compute the h-index.
* [http://www.harzing.com/pop.htm Publish or Perish] calculates various statistics, including the h-index and the g-index using Google Scholar data. This service requires downloading a program, which is available in PC and Linux formats (no Mac format)
* The [http://hview.limsi.fr HView visualizer] showing a sorted histogram of citations showing the h-number as the biggest square included in the histogram
* Yet another [http://insitu.lri.fr/~roussel/projects/scholarindex/ web script] highlighting the article(s) to cite to raise the h-number

Lists of h-indices

* A [http://www.rsc.org/chemistryworld/News/2007/April/23040701.asp long list of chemists] with high h-index values
* The [http://www.cs.ucla.edu/~palsberg/h-number.html H-index for computer science]
* [http://blog.everydayscientist.com/?p=10 H values for Stanford p-chem professors] from [http://blog.everydayscientist.com/ "The Everyday Scientist"]
* [http://www.scimagojr.com/ H index for Journals and Countries]

Wikimedia Foundation. 2010.

Нужна курсовая?

### Look at other dictionaries:

• index — [ ɛ̃dɛks ] n. m. • 1503; mot lat. « indicateur » 1 ♦ Doigt de la main le plus proche du pouce (ainsi nommé parce que ce doigt sert à indiquer, à montrer). Les deux index. Prendre un objet entre le pouce et l index. « Levant l index à sa bouche,… …   Encyclopédie Universelle

• Index (base de donnees) — Index (base de données) En informatique, et en particulier dans le contexte des bases de données, un index est un élément de redondance que l on va spécifier pour permettre au Système de Gestion de Base de Données d optimiser certaines requêtes.… …   Wikipédia en Français

• Index Librorum Prohibitorum — Index Librorum Prohibitorum, 1564. L Index librorum prohibitorum (index des livres interdits) aussi appelé Index expurgatorius, Index librorum prohibitorum juxta exemplar romanum jussu sanctissimi domini nostri est une liste d ouvrages que les… …   Wikipédia en Français

• Index — In dex, n.; pl. E. {Indexes}, L. {Indices}(?). [L.: cf. F. index. See {Indicate}, {Diction}.] [1913 Webster] 1. That which points out; that which shows, indicates, manifests, or discloses; as, the increasing unemployment rate is an index of how… …   The Collaborative International Dictionary of English

• Index error — Index In dex, n.; pl. E. {Indexes}, L. {Indices}(?). [L.: cf. F. index. See {Indicate}, {Diction}.] [1913 Webster] 1. That which points out; that which shows, indicates, manifests, or discloses; as, the increasing unemployment rate is an index of …   The Collaborative International Dictionary of English

• Index expurgatorius — Index In dex, n.; pl. E. {Indexes}, L. {Indices}(?). [L.: cf. F. index. See {Indicate}, {Diction}.] [1913 Webster] 1. That which points out; that which shows, indicates, manifests, or discloses; as, the increasing unemployment rate is an index of …   The Collaborative International Dictionary of English

• Index finger — Index In dex, n.; pl. E. {Indexes}, L. {Indices}(?). [L.: cf. F. index. See {Indicate}, {Diction}.] [1913 Webster] 1. That which points out; that which shows, indicates, manifests, or discloses; as, the increasing unemployment rate is an index of …   The Collaborative International Dictionary of English

• index finger — Index In dex, n.; pl. E. {Indexes}, L. {Indices}(?). [L.: cf. F. index. See {Indicate}, {Diction}.] [1913 Webster] 1. That which points out; that which shows, indicates, manifests, or discloses; as, the increasing unemployment rate is an index of …   The Collaborative International Dictionary of English

• Index glass — Index In dex, n.; pl. E. {Indexes}, L. {Indices}(?). [L.: cf. F. index. See {Indicate}, {Diction}.] [1913 Webster] 1. That which points out; that which shows, indicates, manifests, or discloses; as, the increasing unemployment rate is an index of …   The Collaborative International Dictionary of English

• Index hand — Index In dex, n.; pl. E. {Indexes}, L. {Indices}(?). [L.: cf. F. index. See {Indicate}, {Diction}.] [1913 Webster] 1. That which points out; that which shows, indicates, manifests, or discloses; as, the increasing unemployment rate is an index of …   The Collaborative International Dictionary of English

• Index Librorum Prohibitorum — Index In dex, n.; pl. E. {Indexes}, L. {Indices}(?). [L.: cf. F. index. See {Indicate}, {Diction}.] [1913 Webster] 1. That which points out; that which shows, indicates, manifests, or discloses; as, the increasing unemployment rate is an index of …   The Collaborative International Dictionary of English