

HomoloGene, a tool of the National Center for Biotechnology Information (NCBI), is a system for automated detection of homologs (similarity attributable to descent from a common ancestor) among the annotated genes of several completely sequenced eukaryotic genomes.

The HomoloGene processing consists of the protein analysis from the input organisms. Sequences are compared using blastp [http://www.ncbi.nlm.nih.gov/BLAST/Blast.cgi?CMD=Web&LAYOUT=TwoWindows&AUTO_FORMAT=Semiauto&ALIGNMENTS=250&ALIGNMENT_VIEW=Pairwise&CDD_SEARCH=on&CLIENT=web&DATABASE=nr&DESCRIPTIONS=500&ENTREZ_QUERY=%28none%29&EXPECT=10&FILTER=L&FORMAT_OBJECT=Alignment&FORMAT_TYPE=HTML&I_THRESH=0.005&MATRIX_NAME=BLOSUM62&NCBI_GI=on&PAGE=Proteins&PROGRAM=blastp&SERVICE=plain&SET_DEFAULTS.x=41&SET_DEFAULTS.y=5&SHOW_OVERVIEW=on&END_OF_HTTPGET=Yes&SHOW_LINKOUT=yes&GET_SEQUENCE=yes|blastp] , then matched up and put into groups, using a taxonomic tree built from sequence similarity, where closer related organisms are matched up first, and then further organisms are added to the tree. The protein alignments are mapped back to their corresponding DNA sequences, and then distance metrics as molecular distances Jukes and Cantor (1969), Ka/Ks ratio can be calculated.

The sequences are matched up by using a heuristic algorithm for maximizing the score globally, rather than locally, in a bipartite matching (see complete bipartite graph). And then it calculates the statistical significance of each match. Cutoffs are made per position and Ks values are set to prevent false "orthologs" from being grouped together. “Paralogs” are identified by finding sequences that are closer within species than other species.

Input organisms

"Homo sapiens, Pan troglodytes, Canis lupus familiaris, Bos taurus, Mus musculus, Danio rerio, Rattus norvegicus, Arabidopsis thaliana, Gallus gallus, Oryza sativa, Anopheles gambiae, Drosophila melanogaster, Magnaporthe grisea, Neurospora crassa, Caenorhabditis elegans, Saccharomyces cerevisiae, Kluyveromyces lactis, Eremothecium gossypii, Schizosaccharomyces pombe and Plasmodium falciparum".


The HomoloGene is linked to all Entrez databases and based on homology and phenotype information of these links:
* Mouse Genome Informatics (MGI),
* Zebrafish Information Network (ZFIN),
* Saccharomyces Genome Database (SGD),
* Clusters of Orthologous Groups (COG),
* FlyBase,
* Online Mendelian Inheritance in Man (OMIM)

As a result HomoloGene displays information about Genes, Proteins, Phenotypes, and Conserved Domains.

External links

* [http://www.ncbi.nlm.nih.gov/sites/entrez?db=homologene HomoloGene] at the National Center for Biotechnology Information
* [http://harvester.embl.de/ Bioinformatic Harvester] - Bioinformatic Harvester, a meta search engine that uses Homologene
* [http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=OMIM OMIM]
* [http://zfin.org/cgi-bin/webdriver?MIval=aa-ZDB_home.apg ZFIN]
* [http://www.yeastgenome.org/ SGD]
* [http://www.ncbi.nlm.nih.gov/COG/ COG]
* [http://flybase.bio.indiana.edu/ FlyBase]
* [http://www.informatics.jax.org/ MGI]
* [http://rgd.mcw.edu/ Rat Genome Database]

Wikimedia Foundation. 2010.

Игры ⚽ Поможем написать реферат

Look at other dictionaries:

  • HomoloGene — ist ein Service des National Center for Biotechnology Information (NCBI), welcher Informationen darüber gibt, ob und welche Homologien es für ein bestimmtes Gen in anderen Spezies gibt. Die Verarbeitung der Suchanfragen erfolgt automatisch und… …   Deutsch Wikipedia

  • Homologene — ist ein Service des National Center for Biotechnology Information (NCBI), welcher Informationen darüber gibt, ob und welche Homologien es für ein bestimmtes Gen in anderen Spezies gibt. Die Verarbeitung der Suchanfragen erfolgt automatisch und… …   Deutsch Wikipedia

  • Entrez — Gene ist eine vom National Center for Biotechnology Information (NCBI) betriebene Metasuchmaschine, die den zeitgleichen Zugriff auf multiple Datenbanken und damit weitgefächerte Suchen ermöglicht. Weiterhin bietet es eine ganze Reihe von Tools… …   Deutsch Wikipedia

  • NCBI-Entrez — Entrez Gene ist eine vom National Center for Biotechnology Information (NCBI) betriebene Metasuchmaschine, die den zeitgleichen Zugriff auf multiple Datenbanken und damit weitgefächerte Suchen ermöglicht. Weiterhin bietet es eine ganze Reihe von… …   Deutsch Wikipedia

  • SUMO protein — Small Ubiquitin like Modifier or SUMO proteins are a family of small proteins that are covalently attached to and detached from other proteins in cells to modify their function. SUMOylation is a post translational modification involved in various …   Wikipedia

  • Препроинсулин — Insulin Computer generated image of six insulin molecules assembled in a hexamer, highlighting the threefold symmetry, the zinc ions holding it together, and the histidine residues involved in zinc binding. Insulin is stored in the body as a… …   Википедия

  • Cryptochrome — 1 (photolyase like) Crystal structure of the PHR domain of cryptochrome 1 from Arabidopsis thaliana.[1] …   Wikipedia

  • UniGene — is an NCBI database of the transcriptome and thus, despite the name, not primarily a database for genes. Each entry is a set of transcripts that appear to stem from the same transcription locus (i.e. gene or expressed pseudogene). Information on… …   Wikipedia

  • Bioinformatik-Harvester — Der Bioinformatik Harvester (englisch harvester, „die Erntemaschine, arbeiter“) ist eine Bioinformatik Meta Suchmaschine über Gene und Proteine von Mensch, Maus, Zebrafisch, Arabidopsis, Drosophila und Ratte. Der Harvester vereint oder verlinkt… …   Deutsch Wikipedia

  • Entrez Gene — ist eine vom National Center for Biotechnology Information (NCBI) betriebene Metasuchmaschine, die den gleichzeitigen Zugriff auf multiple Datenbanken und damit weitgefächerte Suchen ermöglicht. Weiterhin bietet sie eine ganze Reihe von Tools zur …   Deutsch Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”