- Isearch
Isearch is open-source
text retrieval software first developed in1994 as part of theIsite Z39.50 information framework. The project started at the Clearinghouse for Networked Information Discovery and Retrieval (CNIDR) of the North Carolina supercomputing center MCNC and funded by theNational Science Foundation to follow in the track of WAIS and develop prototype systems for distributed information networks encompassing Internet applications, library catalogs and other information resources.The main features of Isearch include full text and field searching, relevance ranking, Boolean queries, and support for many document types such as HTML, mail folders, list digests, MEDLINE, BibTeX, SGML/XML, FGDC Metadata, NASA DIF, ANZLIC metadata, ISO 19115 metadata and many other resource types and document formats.
It was the first search engine to be designed from the ground up to support
SGML andISO 23950 search and retrieval. It included many innovations including the "document type" model-- which is simply a (object oriented) method of associating each document with a class of functions providing a standard interface for accessing the document. It was one of the first engines (if not the first) to ever support XML.The Isearch search/indexing text algorithms were based on
Gaston Gonnet 's seminal work into PAT arrays and trees for text retrieval--- ideas that were developed for the New Oxford English Dictionary Project at the Univ. of Waterloo, and provided the seeds forTim Bray 's PAT SGML engine that formed the basis ofOpen Text . One of the limiting factors, however, of the Isearch design was that it was not well suited to handle the extremely large data sets that became popular in the mid to late 1990's. In many cases Isearch was adapted or modified to use different algorithms but usually retained the document type model and the architectural relationship with Isite.Isearch was widely adopted and used in hundreds of public search sites, including many high profile projects such as the [http://patft1.uspto.gov/ U.S. Patent and Trademark Office (USPTO) patent search] , [http://clearinghouse3.fgdc.gov/ the Federal Geographic Data Clearinghouse (FGDC)] , the NASA Global Change Master Directory, the NASA EOS Guide System, the NASA Catalog Interoperability Project, the Astronomical pre-print service based at the Space Telescope Science Institute, The PCT Electronic Gazette at the World Intellectual Property Organization (WIPO), Linsearch (a search engine for Open Source Software designed by Miles Efron), the SAGE Project of the Special Collections Department at Emory University, Eco Companion Australasia (an environmental geospatial resources catalog), Australian National Genomic Information Service (ANGIS), the Open Directory Project and numerous governmental portals in the context of the
Government Information Locator Service (GILS) GPO mandate (ended in 2005?).From 1994 to 1998 most of the development was centered around the Clearinghouse for Networked Information Discovery and Retrieval (CNIDR) in North Carolina (Engine core) and BSn in Germany (Doctypes). By 1998 much of the open-source Isearch core developers re-focused development into several spin-offs. In 1998 it became part of the
Advanced Search Facility reference software platform funded by the U.S. Department of Commerce.A/WWW Enterprises now maintains the open source version for public usage, supported by paying government clients, such as the U.S. Patent and Trademark Office, NASA, and the FGDC who have provided support to enhance the functionality and reliability of the software. The software suite is considered a reference implementation of catalog service software.
External links
* [http://www.fgdc.gov/dataandservices/isite U.S. Federal Geographic Data Committee Isite]
* [http://isite.awcubed.com/ Isite/Isearch2 Documentation Site]
* [ftp://ftp.awcubed.com/pub/Software Current Isearch download site]
* [http://www.etymon.com/tr.html Etymon: Isearch]
* [http://www.ibu.de/node/52 BSn/NONMONOTONIC Lab: IB Search Engine] , embeddable search engine. A commercial spin-off from the Isearch project.References
* [http://www.springerlink.com/content/g5e2wfd0lekygvut/ Application of Metadata Concepts to Discovery of Internet Resources]
* [http://www.springerlink.com/content/b5chmkgx8akg4m2h/ An Operational Metadata Framework for Searching, Indexing, and Retrieving Distributed Geographic Information Services on the Internet]
* The UNIX Web Server Book, Second Edition, by R. Douglas Matthews et al (Ventana Press, 1997).
* [http://www.webtechniques.com/archives/1997/05/nassar/ "Searching With Isearch". May 1997, Web Techniques]
* [http://www.itl.nist.gov/fipspubs/fip192.htm FIPS-192: APPLICATION PROFILE FOR THE GOVERNMENT INFORMATION LOCATOR SERVICE (GILS)]
* [http://www.uneca.org/awich/AWICH%20Workshop/YaoundeWorkshop/Clearinghouse%20Yaounde.pdf Clearinghouse and Metadata Concepts, Danel Behanu, U.N. Economic Commission for Africa, 2004]
* [http://www.whitehouse.gov/OMB/memoranda/m9805.html M-98-05 Guidance on the Government Information Locator Service] published by the OMB
* [http://www.hpcwire.com/archives/3149.html 01/1995 Press Release: Patent Office Launch Internet AIDS Patent Library]Comparisons
* [http://www.ukoln.ac.uk/metadata/roads/product-comparison/ Product Comparison: Information Gateway Software]
* [http://wrg.upf.edu/WRG/dctos/Middleton-Baeza.pdf A Comparison of Open Source Search Engines, Christian Middleton, Ricardo Baeza-Yates]
* [http://www.infomotions.com/musings/opensource-indexers/ Comparing Open Source Indexers]
Wikimedia Foundation. 2010.