ETBLAST

ETBLAST

eTBLAST is a text similarity search engine currently offering access to the MEDLINE database, the National Institutes of Health (NIH) CRISP database, the Institute of Physics (IOP) database, and the NASA technical reports database. It is continuously expanding with additional text-based databases. The eTBLAST server compares a user's natural text query to target databases using a hybrid search algorithm consisting of a low-sensitivity weighted keyword-based first pass followed by a novel sentence-alignment based second pass. eTBLAST is a free web-based service of [http://innovation.swmed.edu The Innovation Laboratory] at the University of Texas Southwestern Medical School.

eTBLAST, as a text similarity engine, made possible a large study of duplicate publications and potential plagiarisms in the biomedical literature. Thousands of random samples of Medline abstracts were submitted to eTBLAST, and those with the highest similarity were studied and entered into a on-line database. This study is on-going, with the database maturing as the entries are manually inspected and classified. This work revealed several trends, including an increasing rate of duplication in the biomedical literature, as reported in the journals [http://bioinformatics.oxfordjournals.org/cgi/content/full/24/2/243 Bioinformatics] and [http://www.nature.com/doifinder/10.1038/451397a Nature] .

Interface

Because eTBLAST is a text-similarity engine rather than a simple keyword-based search tool, it is claimed that the user need not identify and manipulate query keywords and Boolean operators, as must be done for other search engines.

eTBLAST aims to help the user rapidly to find references, evaluate novelty, find experts and journals in a given topical area and track the popularity of the topic as defined by the user’s query.

A typical query of 100 words takes 1-2 minutes to return results after a comparison to MEDLINE that as of 1/1/2007 contains over 16 million records.

References

* [http://m.errami.googlepages.com Mounir Errami] and [http://www.utsouthwestern.edu/findfac/professional/0,,12465,00.html Harold R. Garner] , A tale of two citations. Nature, 2008 Jan 24;451(7177):397-9. [http://www.ncbi.nlm.nih.gov/pubmed/18216832?ordinalpos=3&itool=EntrezSystem2.PEntrez.Pubmed.Pubmed_ResultsPanel.Pubmed_RVDocSum View on PubMed.]
* [http://m.errami.googlepages.com Mounir Errami] , Justin M. Hicks, Wayne Fisher, David Trusty, Tara C Long, [http://faculty-staff.ou.edu/W/Jonathan.D.Wren-1/ Jonathan D Wren] and [http://www.utsouthwestern.edu/findfac/professional/0,,12465,00.html Harold R. Garner] , Déjà vu--a study of duplicate citations in Medline. Bioinformatics, 2008 Jan 15;24(2):243-9. [http://www.ncbi.nlm.nih.gov/pubmed/18056062?ordinalpos=3&itool=EntrezSystem2.PEntrez.Pubmed.Pubmed_ResultsPanel.Pubmed_RVDocSum View on PubMed.]
* [http://m.errami.googlepages.com Mounir Errami] , [http://faculty-staff.ou.edu/W/Jonathan.D.Wren-1/ Jonathan D. Wren] , Justin M. Hicks, and [http://www.utsouthwestern.edu/findfac/professional/0,,12465,00.html Harold R. Garner] , eTBLAST: a web server to identify expert reviewers, appropriate journals and similar publications. Nucleic Acid Research, 2007 Apr. [http://www.ncbi.nlm.nih.gov/sites/entrez?Db=pubmed&Cmd=ShowDetailView&TermToSearch=17452348&ordinalpos=2&itool=EntrezSystem2.PEntrez.Pubmed.Pubmed_ResultsPanel.Pubmed_RVDocSum View on PubMed.]
* James Lewis, Stephan Ossowski, Justin Hicks, [http://m.errami.googlepages.com Mounir Errami] , and Harold R. Garner, Text Similarity: an alternative way to search MEDLINE, Bioinformatics, 15;22(18):2298-304, September, 2006. [http://www.ncbi.nlm.nih.gov/sites/entrez?Db=pubmed&Cmd=ShowDetailView&TermToSearch=16926219&ordinalpos=6&itool=EntrezSystem2.PEntrez.Pubmed.Pubmed_ResultsPanel.Pubmed_RVDocSum View on PubMed.]
* eTBLAST, was highlighted on the NetWatch column in Science, May 14, 2004, http://www.sciencemag.org/content/vol304/issue5673/netwatch.shtml
* Alexander Pertsemlidis and Harold R. Garner, Text Comparison Based on Dynamic Programming, IEEE Engineering in Biology and Medicine, Nov./Dec., 2004, Vol. 23, No. 6, pgs. 66-71.

External links

* [http://invention.swmed.edu/etblast/index.shtml eTBLAST]
* [http://spore.swmed.edu/dejavu Deja Vu: a database of duplicate publications]
* [http://pubmed.gov/ MEDLINE/PubMed]
* [http://crisp.cit.nih.gov/ NIH Grants and Contracts Database]
* [http://www.iop.org/ Institute of Physics]
* [http://ntrs.nasa.gov/search.jsp NASA Technical Reports]


Wikimedia Foundation. 2010.

Игры ⚽ Нужен реферат?

Look at other dictionaries:

  • Masoumeh Ebtekar — Vice President of Iran Head of Environmental Protection Organization In office 2 August 1997 – 3 August 2005 President Mohammad Khatami Preceded by New Title Succeeded by …   Wikipedia

  • Harold Garner — Harold Ray Garner ( Skip Garner ) is a biophysicist with distinguished research careers both in plasma physics and in bioengineering. He received his BS in Nuclear Engineering (minor in computer science) at the University of Missouri, Rolla in… …   Wikipedia

  • MEDLINE — For other uses, see Medline (disambiguation). MEDLINE (Medical Literature Analysis and Retrieval System Online) is a bibliographic database of life sciences and biomedical information. It includes bibliographic information for articles from… …   Wikipedia

  • BLAST — Infobox Software name=BLAST developer=Myers, E., Altschul S.F., Gish W., Miller E.W., Lipman D.J., NCBI latest release version=2.2.18 operating system=UNIX, Linux, Mac, MS Windows genre=Bioinformatics tool license=Public Domain website=… …   Wikipedia

  • PubMed — is a free search engine for accessing the MEDLINE database of citations and abstracts of biomedical research articles. The core subject is medicine, and PubMed covers fields related to medicine, such as nursing and other allied health disciplines …   Wikipedia

  • Bibus — is a reference management software designed for OpenOffice.org packages and Microsoft Word in particular, with goal of creating an open source bibliographic software package that will allow easy formatting of the bibliographic index in OpenOffice …   Wikipedia

  • Medical literature retrieval — or medical document retrieval is an activity that uses professional methods for medical research papers retrieval, report and other data to improve medicine research and practice. Contents 1 Medical Search engine 1.1 Professional medical search… …   Wikipedia

  • MEDLINE — o Medline es posiblemente la base de datos de bibliografía médica más amplia que existe.[1] Producida por la Biblioteca Nacional de Medicina de los Estados Unidos. En realidad es una versión automatizada de tres índices impresos: Index Medicus,… …   Wikipedia Español

  • Plagio — Saltar a navegación, búsqueda Se denomina plagio a una infracción del derecho de autor sobre una obra de cualquier tipo, que se produce mediante la copia de la misma, sin autorización de la persona que la creó o que es su dueña o posee los… …   Wikipedia Español

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”