Isearch

Isearch

Isearch is open-source text retrieval software first developed in 1994 as part of the Isite Z39.50 information framework. The project started at the Clearinghouse for Networked Information Discovery and Retrieval (CNIDR) of the North Carolina supercomputing center MCNC and funded by the National Science Foundation to follow in the track of WAIS and develop prototype systems for distributed information networks encompassing Internet applications, library catalogs and other information resources.

The main features of Isearch include full text and field searching, relevance ranking, Boolean queries, and support for many document types such as HTML, mail folders, list digests, MEDLINE, BibTeX, SGML/XML, FGDC Metadata, NASA DIF, ANZLIC metadata, ISO 19115 metadata and many other resource types and document formats.

It was the first search engine to be designed from the ground up to support SGML and ISO 23950 search and retrieval. It included many innovations including the "document type" model-- which is simply a (object oriented) method of associating each document with a class of functions providing a standard interface for accessing the document. It was one of the first engines (if not the first) to ever support XML.

The Isearch search/indexing text algorithms were based on Gaston Gonnet's seminal work into PAT arrays and trees for text retrieval--- ideas that were developed for the New Oxford English Dictionary Project at the Univ. of Waterloo, and provided the seeds for Tim Bray's PAT SGML engine that formed the basis of Open Text. One of the limiting factors, however, of the Isearch design was that it was not well suited to handle the extremely large data sets that became popular in the mid to late 1990's. In many cases Isearch was adapted or modified to use different algorithms but usually retained the document type model and the architectural relationship with Isite.

Isearch was widely adopted and used in hundreds of public search sites, including many high profile projects such as the [http://patft1.uspto.gov/ U.S. Patent and Trademark Office (USPTO) patent search] , [http://clearinghouse3.fgdc.gov/ the Federal Geographic Data Clearinghouse (FGDC)] , the NASA Global Change Master Directory, the NASA EOS Guide System, the NASA Catalog Interoperability Project, the Astronomical pre-print service based at the Space Telescope Science Institute, The PCT Electronic Gazette at the World Intellectual Property Organization (WIPO), Linsearch (a search engine for Open Source Software designed by Miles Efron), the SAGE Project of the Special Collections Department at Emory University, Eco Companion Australasia (an environmental geospatial resources catalog), Australian National Genomic Information Service (ANGIS), the Open Directory Project and numerous governmental portals in the context of the Government Information Locator Service (GILS) GPO mandate (ended in 2005?).

From 1994 to 1998 most of the development was centered around the Clearinghouse for Networked Information Discovery and Retrieval (CNIDR) in North Carolina (Engine core) and BSn in Germany (Doctypes). By 1998 much of the open-source Isearch core developers re-focused development into several spin-offs. In 1998 it became part of the Advanced Search Facility reference software platform funded by the U.S. Department of Commerce.

A/WWW Enterprises now maintains the open source version for public usage, supported by paying government clients, such as the U.S. Patent and Trademark Office, NASA, and the FGDC who have provided support to enhance the functionality and reliability of the software. The software suite is considered a reference implementation of catalog service software.

External links

* [http://www.fgdc.gov/dataandservices/isite U.S. Federal Geographic Data Committee Isite]
* [http://isite.awcubed.com/ Isite/Isearch2 Documentation Site]
* [ftp://ftp.awcubed.com/pub/Software Current Isearch download site]
* [http://www.etymon.com/tr.html Etymon: Isearch]
* [http://www.ibu.de/node/52 BSn/NONMONOTONIC Lab: IB Search Engine] , embeddable search engine. A commercial spin-off from the Isearch project.

References

* [http://www.springerlink.com/content/g5e2wfd0lekygvut/ Application of Metadata Concepts to Discovery of Internet Resources]
* [http://www.springerlink.com/content/b5chmkgx8akg4m2h/ An Operational Metadata Framework for Searching, Indexing, and Retrieving Distributed Geographic Information Services on the Internet]
* The UNIX Web Server Book, Second Edition, by R. Douglas Matthews et al (Ventana Press, 1997).
* [http://www.webtechniques.com/archives/1997/05/nassar/ "Searching With Isearch". May 1997, Web Techniques]
* [http://www.itl.nist.gov/fipspubs/fip192.htm FIPS-192: APPLICATION PROFILE FOR THE GOVERNMENT INFORMATION LOCATOR SERVICE (GILS)]
* [http://www.uneca.org/awich/AWICH%20Workshop/YaoundeWorkshop/Clearinghouse%20Yaounde.pdf Clearinghouse and Metadata Concepts, Danel Behanu, U.N. Economic Commission for Africa, 2004]
* [http://www.whitehouse.gov/OMB/memoranda/m9805.html M-98-05 Guidance on the Government Information Locator Service] published by the OMB
* [http://www.hpcwire.com/archives/3149.html 01/1995 Press Release: Patent Office Launch Internet AIDS Patent Library]

Comparisons

* [http://www.ukoln.ac.uk/metadata/roads/product-comparison/ Product Comparison: Information Gateway Software]
* [http://wrg.upf.edu/WRG/dctos/Middleton-Baeza.pdf A Comparison of Open Source Search Engines, Christian Middleton, Ricardo Baeza-Yates]
* [http://www.infomotions.com/musings/opensource-indexers/ Comparing Open Source Indexers]


Wikimedia Foundation. 2010.

Игры ⚽ Поможем решить контрольную работу

Look at other dictionaries:

  • Isearch — это свободное программное обеспечение для текстового поиска, разработанное в 1994 в Clearinghouse для Networked Information Discovery and Retrieval (CNIDR), на средства, выделенные национальным научным фондом (США). Одним из наиболее известных… …   Википедия

  • Isearch/isvrs — Isearch is a resilient and common adware program that is often installed on a user’s computer from pop ups or unprotected downloads. Even most firewalls and other computer protection programs are ineffective to stop this program from being… …   Wikipedia

  • Automotive lighting — Blinker redirects here. For other uses, see Blinker (disambiguation). Not to be confused with Magneti Marelli company AL Automotive Lighting. For lights in seafaring and aviation, see navigation light. The lighting system of a motor vehicle… …   Wikipedia

  • Список поисковых машин — …   Википедия

  • G protein-coupled receptor — G protein coupled receptors (GPCRs), also known as seven transmembrane domain receptors, 7TM receptors, heptahelical receptors, and G protein linked receptors (GPLR), comprise a large protein family of transmembrane receptors that sense molecules …   Wikipedia

  • Wide area information server — Wide Area Information Servers or WAIS is a client server text searching system that uses the ANSI Standard Z39.50 Information Retrieval Service Definition and Protocol Specifications for Library Applications (Z39.50:1988) to search index… …   Wikipedia

  • List of individuals executed in Arizona — A total of 23 individuals convicted of murder have been executed by the state of Arizona since 1976. All were by lethal injection, except those indicated with a * which were by gas chamber. See also * Capital punishment in the United States… …   Wikipedia

  • List of search engines — This is a list of Wikipedia articles about search engines, including web search engines, metasearch engines, desktop search tools, and web portals and vertical market websites that have a search facility for online databases.By… …   Wikipedia

  • AUSTAR — Infobox Company company name = AUSTAR Communications company company type = Public (asx|AUN) foundation = 1995 location city = Sydney, New South Wales Gold Coast, Queensland location country = AUS key people = John Porter, CEO Mike Fries,… …   Wikipedia

  • Performics — Infobox Company company name = Performics company company type = Private company slogan = foundation = Chicago, Illinois (1998) as Dynamic Trade location = Chicago, Illinois key people = Stuart Frankel, President Chris Henger, Vice President,… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”