- OrthoDB
-
OrthoDB Content Description Catalog of Eukaryotic Orthologs. Contact Research center Swiss Institute of Bioinformatics Laboratory Computational Evolutionary Genomics Group Authors Evgenia V. Kriventseva Primary Citation Kriventseva & et al. (2008)[1] Release date 2007 Access Website orthodb.org Tools Miscellaneous OrthoDB[1] presents a catalog of eukaryotic orthologous protein-coding genes across vertebrates, arthropods, and fungi. Orthology refers to the last common ancestor of the species under consideration, and thus OrthoDB explicitly delineates orthologs at each radiation along the species phylogeny. The database of orthologs presents available protein descriptors, together with Gene Ontology and InterPro attributes, which serve to provide general descriptive annotations of the orthologous groups, and facilitate comprehensive orthology database querying.
Contents
Methodology
Orthology is defined relative to the last common ancestor of the species being considered, thereby determining the hierarchical nature of orthologous classifications. This is explicitly addressed in OrthoDB by application of the orthology delineation procedure at each radiation point of the considered phylogeny, empirically computed over the super-alignment of single-copy orthologs using a maximum-likelihood approach. The OrthoDB implementation employs a Best-Reciprocal-Hit (BRH) clustering algorithm based on all-against-all Smith–Waterman protein sequence comparisons. Gene set pre-processing selects the longest protein-coding transcript of alternatively spliced genes and of very similar gene copies. The procedure triangulates BRHs to progressively build the clusters and requires an overall minimum sequence alignment overlap to avoid domain walking. These core clusters are further expanded to include all more closely related within-species in-paralogs, and the previously identified very similar gene copies.
Data content
The database now contains over 100 species[2] with 44 vertebrate genomes sourced from Ensembl, 46 fungal genomes from UniProt and 25 arthropod genomes from several databases. The ever-increasing sampling of sequenced eukaryotic genomes brings a clearer account of the majority of gene genealogies that will facilitate informed hypotheses of gene function in newly sequenced genomes.
Examples of studies that have employed data from OrthoDB include comparative analyses of gene repertoire evolution,[3][4] comparisons of fruit fly and mosquito developmental genes,[5] analyses of bloodmeal- or infection-induced changes in gene expression in mosquitoes,[6][7][8] and analysis of the evolution of mammalian milk production.[9] Others studies citing OrthoDB can be found at PubMed.
See also
- Homology (biology)
- Phylogeny
References
- ^ a b Kriventseva, Evgenia V; Rahman Nazim, Espinosa Octavio, Zdobnov Evgeny M (Jan 2008). "OrthoDB: the hierarchical catalog of eukaryotic orthologs" (in eng). Nucleic Acids Res. (England) 36 (Database issue): D271-5. doi:10.1093/nar/gkm845. PMC P2238902. PMID 17947323. http://nar.oxfordjournals.org/content/36/suppl_1/D271.long.
- ^ Waterhouse RM, Zdobnov EM, Tegenfeldt F, Li J, Kriventseva EV (January 2011). "OrthoDB: the hierarchical catalog of eukaryotic orthologs in 2011". Nucleic Acids Res. 39 (Database issue): D283–8. doi:10.1093/nar/gkq930. PMC 3013786. PMID 20972218. http://nar.oxfordjournals.org/content/39/suppl_1/D283.long.
- ^ Waterhouse RM, Zdobnov EM, Kriventseva EV (January 2011). "Correlating traits of gene retention, sequence divergence, duplicability and essentiality in vertebrates, arthropods, and fungi.". Genome Biol Evol. 3: 75–86. doi:10.1093/gbe/evq083. PMC 3030422. PMID 21148284. http://gbe.oxfordjournals.org/content/3/75.long.
- ^ Hase T, Niimura Y, Tanaka H. (2010). "Difference in gene duplicability may explain the difference in overall structure of protein-protein interaction networks among eukaryotes.". BMC Evol Biol. 10. doi:10.1186/1471-2148-10-358. PMC 2994879. PMID 21087510. http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=2994879.
- ^ Behura SK, Haugen M, Flannery E, Sarro J, Tessier CR, Severson DW, Duman-Scheel M. (2011). "Comparative Genomic Analysis of Drosophila melanogaster and Vector Mosquito Developmental Genes.". PLoS One 6. doi:10.1371/journal.pone.0021504. PMC 3130749. PMID 21754989. http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=3130749.
- ^ Bonizzoni M, Dunn WA, Campbell CL, Olson KE, Dimon MT, Marinotti O, James AA. (2011). "RNA-seq analyses of blood-induced changes in gene expression in the mosquito vector species, Aedes aegypti.". BMC Genomics 12. doi:10.1186/1471-2164-12-82. PMC 3042412. PMID 21276245. http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=3042412.
- ^ Pinto SB, Lombardo F, Koutsos AC, Waterhouse RM, McKay K, An C, Ramakrishnan C, Kafatos FC, Michel K. (2009). "Discovery of Plasmodium modulators by genome-wide analysis of circulating hemocytes in Anopheles gambiae.". Proc Natl Acad Sci U S A. 106. doi:10.1073/pnas.0909463106. PMC 2783009. PMID 19940242. http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=2783009.
- ^ Bartholomay LC, Waterhouse RM, Mayhew GF, Campbell CL, Michel K, Zou Z, Ramirez JL, Das S, Alvarez K, Arensburger P, Bryant B, Chapman SB, Dong Y, Erickson SM, Karunaratne SH, Kokoza V, Kodira CD, Pignatelli P, Shin SW, Vanlandingham DL, Atkinson PW, Birren B, Christophides GK, Clem RJ, Hemingway J, Higgs S, Megy K, Ranson H, Zdobnov EM, Raikhel AS, Christensen BM, Dimopoulos G, Muskavitch MA. (2010). "Pathogenomics of Culex quinquefasciatus and meta-analysis of infection responses to diverse pathogens.". Science 330. doi:10.1126/science.1193162. PMC 3104938. PMID 20929811. http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=3104938.
- ^ Lemay DG, Lynn DJ, Martin WF, Neville MC, Casey TM, Rincon G, Kriventseva EV, Barris WC, Hinrichs AS, Molenaar AJ, Pollard KS, Maqbool NJ, Singh K, Murney R, Zdobnov EM, Tellam RL, Medrano JF, German JB, Rijnkels M. (2009). "The bovine lactation genome: insights into the evolution of mammalian milk.". Genome Biol. 10. doi:10.1186/gb-2009-10-4-r43. PMC 2688934. PMID 19393040. http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=2688934.
External links
Categories:- Biological databases
- Evolutionary biology
- Phylogenetics
- Database stubs
- Biology stubs
Wikimedia Foundation. 2010.