GeneRIF

GeneRIF

A GeneRIF or Gene Reference Into Function is a short (255 characters or less) statement about the function of a gene. GeneRIFs provide a simple mechanism for allowing scientists to add to the functional annotation of genes described in the [http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=gene Entrez Gene] database. In practice, "function" is construed quite broadly. For example, there are GeneRIFs that discuss the role of a gene in a disease, GeneRIFs that point the viewer towards a review article about the gene, and GeneRIFs that discuss the structure of a gene. However, the stated intent is for GeneRIFs to be about gene function.

GeneRIFs are always associated with specific entries in the Entrez Gene database. Each GeneRIF has a pointer to the PubMed ID (a type of document identifier) of a scientific publication that provides evidence for the statement made by the GeneRIF. GeneRIFs are often extracted directly from the document that is identified by the PubMed ID, very frequently from its title or from its final sentence.

GeneRIFs are usually produced by NCBI indexers, but anyone may submit a GeneRIF.To be processed, a valid Gene ID must exist for the specific gene, or the Gene staff must have assigned an overall Gene ID to the species. The latter case is implemented via records in Gene with the symbol NEWENTRY. Once the Gene ID is identified, only three types of information are required to complete a submission:
# a concise phrase describing a function or functions (less than 255 characters in length, preferably more than a restatement of the title of the paper);
# a published paper describing that function, implemented by supplying the PubMed ID of a citation in PubMed;
# a valid e-mail address (which will remain confidential).

Here are some GeneRIFs taken from Entrez Gene for GeneID 7157, the human gene TP53.The PubMed document identifiers have been omitted from the examples. Note the wide variability with respect to the presence or absence of punctuation and of sentence-initial capital letters.
* p53 and c-erbB-2 may have independent role in carcinogenesis of gall bladder cancer
* Degradation of endogenous HIPK2 depends on the presence of a functional p53 protein.
* p53 codon 72 alleles influence the response to anticancer drugs in cells from aged people by regulating the cell cycle inhibitor p21WAF1
* Logistic regression analysis showed p53 and COX-2 as dependent predictors in pancreatic carcinogenesis, and a reciprocal relationship to neoplastic progression between p53 and COX-2.

GeneRIFs are an unusual type of textual genre, and they have recently been the subject of a number of articles from the natural language processing community.

References

* [http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=pubmed&cmd=Retrieve&dopt=Abstract&list_uids=14728215&query_hl=5&itool=pubmed_docsum Abstract of Mitchell et al.'s early publication on GeneRIFs]
* Cite conference
author = William Hersh, Ravi Teja Bhupatiraju
url = http://medir.ohsu.edu/~hersh/trec-03-genomics.pdf
year = 2003
title = TREC Genomics Track Overview
Paper describing a Text Retrieval Conference "shared task" involving automatic prediction of GeneRIFs.
* Cite conference
url = http://helix-web.stanford.edu/psb06/lu.pdf
author = Lu, Zhiyong; K. Bretonnel Cohen; and Lawrence Hunter
title = Finding GeneRIFs via Gene Ontology annotations
year = 2006
conference = Proc. Pacific Symposium on Biocomputing 2006
pages = 52-63
Lu et al.'s paper describing a system that automatically suggests GeneRIFs.

Useful links

  • [http://www.ncbi.nlm.nih.gov/projects/GeneRIF/GeneRIFhelp.html NCBI's web page describing GeneRIFs]

Wikimedia Foundation. 2010.

Игры ⚽ Нужен реферат?

Look at other dictionaries:

  • Genome project — Genome projects are scientific endeavours that ultimately aim to determine the complete genome sequence of an organism (be it an animal, a plant, a fungus, a bacterium, an archaean, a protist or a virus) and to annotate protein coding genes and… …   Wikipedia

  • TREC Genomics — The TREC Genomics track is a workshop held under the auspices of NIST for the purpose of evaluating systems for information retrieval and related technologies in the genomics domain. The TREC Genomics track has taken place annually since 2003,… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”