- Biomedical text mining
Biomedical text mining (also known as BioNLP) refers to text mining applied to texts and literature of the biomedical and molecular biology domain. Itis a rather recent research field on the edge of
natural language processing ,bioinformatics ,medical informatics andcomputational linguistics .There is an increasing interest in
text mining and information extraction strategies applied to the biomedical andmolecular biology literature due to the increasing number of electronically available publications stored in databases such asPubMed .Main applications
The main developments in this area have been related to the identification of biological entities (named entity recognition), such as
protein andgene names in free text, the association ofgene cluster s obtained by microarray experiments with the biological context provided by the corresponding literature, automatic extraction of protein interactions and associations of proteins to functional concepts (e.g.gene ontology terms). Even the extraction of kinetic parameters from text or the subcellular location of proteins have been addressed by information extraction and text mining systems.Examples
* [http://www.xtractor.in/ XTractor] - Discovering Newer Scientific Relations Across
PubMed Abstracts. A tool to obtain manually annotated relationships for Proteins, Diseases, Drugs and Biological Processes as they get published inPubMed .
* [http://www.chilibot.net/ Chilibot] : A tool for finding relationships between genes or gene products.
* [http://www.ihop-net.org/UniPub/iHOP/ Information Hyperlinked Over Proteins (iHOP)] (ref.: [http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=16204114 Bioinformatics, 2005 Sep 1;21 Suppl 2:ii252-ii258.] ): "A network of concurring genes and proteins extends through the scientific literature touching on phenotypes, pathologies and gene function. iHOP provides this network as a natural way of accessing millions of PubMed abstracts. By using genes and proteins as hyperlinks between sentences and abstracts, the information in PubMed can be converted into one navigable resource, bringing all advantages of the internet to scientific literature research."
* [http://fable.chop.edu FABLE] : A gene-centric text-mining search engine for MEDLINE
* [http://www.gopubmed.org GoPubMed] : retrievesPubMed abstracts for your search query, then detects ontology terms from theGene Ontology andMedical Subject Headings in the abstracts and allows the user to browse the search results by exploring the ontologies and displaying only papers mentioning selected terms, their synonyms or descendants.
* [http://www.litinspector.org LitInspector] - Gene and signal transduction pathway data mining inPubMed abstracts.
* [http://www.pubgene.org PubGene] -Co-occurrence networks display of gene and protein symbols as well as MeSH, GO,PubChem and interaction terms (such as "binds" or "induces") as these appear inMEDLINE records (that is,PubMed titles and abstracts).References
* [http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15998455 Text-mining and information-retrieval services for molecular biology]
* [http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15886388 Text mining for metabolic pathways, signaling cascades, and protein networks]
* [http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15808823 Text-mining approaches in molecular biology and biomedicine]
* [http://blimp.cs.queensu.ca/ Biomedical LIterature Mining Publications (BLIMP)] : A comprehensive and regularly updated index of publications on (bio)medical text miningConferences at which BioNLP research is presented
BioNLP is presented at a variety of meetings:
*
Pacific Symposium on Biocomputing : in plenary session
*Intelligent Systems for Molecular Biology : in plenary session and also in the BioLINK and Bio-ontologies workshops
*Association for Computational Linguistics andNorth American Association for Computational Linguistics annual meetings and associated workshops: in plenary session and as part of the BioNLP workshop (see below)
* [http://compbio.uchsc.edu/BioNLP2008 BioNLP 2008]
*American Medical Informatics Association annual meeting: in plenary sessionSee also
*
BioCreative
*TREC Genomics External links
* [https://lists.ccs.neu.edu/pipermail/bionlp/ The BioNLP mailing list archives]
* [http://compbio.uchsc.edu/ccp/corpora/index.shtml Corpora for biomedical text mining]
* [http://biocreative.sourceforge.net The BioCreative evaluations of biomedical text mining technologies]
* [http://compbio.uchsc.edu/Hunter_lab/Cohen/bioNlpPeople.html Directory of people involved in BioNLP]
Wikimedia Foundation. 2010.