Federated search

Federated search

Federated search is the simultaneous search of multiple online databases and is an emerging feature of automated, Web-based library and information retrieval systems. It is also often referred to as a portal, as opposed to simply a Web-based search engine.

Purpose

As described by [http://www2.hawaii.edu/~jacso/ Peter Jacso] (2004), federated searching consists of (1) transforming a query and broadcasting it to a group of disparate databases with the appropriate syntax, (2) merging the results collected from the databases, (3) presenting them in a succinct and unified format with minimal duplication, and (4) providing a means, performed either automatically or by the portal user, to sort the merged result set. In traditional search engines such as Google, only sources that have been indexed by the search engine’s crawler technology can be searched, retrieved and accessed. The large volume of documents housed in databases is not open to traditional Internet search engines because of limitations in crawler technology. Federated searching resolves this issue by the technique described above and makes these deep Web documents searchable without having to visit each database individually.

Process

Federated search computer programs allow users to search multiple information sources with a single query from a single user interface. The user enters a search query in the portal interface’s search box and the query is sent to every individual database in the portal or federated search list. Access details for the individual databases must be preset in the portal by its owner. Federated search systems either rely upon vendors to create commercial portal systems, or they rely upon government or other organizations to provide open access portals. How federated search is implemented depends upon which of the two types of organizations is providing the portal.

Federated search portals, either commercial or open access, generally search public access bibliographic databases, public access Web-based library catalogues (OPACs), Web-based search engines like Google and/or open-access, government-operated or corporate data collections. These individual information sources send back to the portal's interface a list of results from the search query. The user can review this hit list. Some portals will merely screen scrape the actual database results and not directly allow a user to enter the information source's application. More sophisticated ones will de-dupe the results list by merging and removing duplicates. There are additional features available in many portals, but the basic idea is the same: to improve the accuracy and relevance of individual searches as well as reduce the amount of time required to search for resources.

This process allows federated search some key advantages when compared with existing crawler-based search engines. Federated search need not place any requirements or burdens on owners of the individual information sources, other than handling increased traffic. Federated searches are inherently as current as the individual information sources, as they are searched in real time.

Implementation

One application of federated searching is the metasearch engine; however, this is not a complete solution as many documents are not currently indexed. This is known as the deep Web or invisible Web. Many more information sources are not yet stored in electronic form. Google Scholar is an example of a project trying to address this.

When the search vocabulary or data model of the search system is different from the data model of one or more of the foreign target systems the query must be translated into each of the foreign target systems. This can be done using simple data-element translation or may require semantic translation.

A challenge faced in the implementation of federated search engines is scalability, i.e. the performance of the site as the number of information sources comprising the federated search engine increase. One federated search engine that has begun to address this issue is WorldWideScience, hosted by the U.S. Department of Energy's Office of Scientific and Technical Information. WorldWideScience [ [http://www.worldwidescience.org WorldWideScience] ] is composed of more than 40 information sources, several of which are federated search portals themselves. One such portal is Science.gov [ [http://www.science.gov Science.gov] ] which itself federates more than 30 information sources representing most of the R&D output of the U.S. Federal government. Science.gov returns its highest ranked results to WorldWideScience, which then merges and ranks these results with the search returned by the other information sources that comprise WorldWideScience. [ [http://www.science.gov Science.gov] ] This approach of cascaded federated search enables large number of information sources to be searched via a single query.

Another application Sesam running in both Norway and Sweden has been built on top of an open sourced platform specialised for federated search solutions. Sesat, [ [http://sesat.no Sesat] ] , an acronym for Sesam Search Application Toolkit, is a platform that provides much of the framework and functionality required for handling parallel and pipelined searches and displaying them elegantly in a user interface, allowing engineers to focus on the index/database configuration tuning.

See also

* Metasearch engine
* Funnelback
* Aggregator

References


Wikimedia Foundation. 2010.

Игры ⚽ Поможем сделать НИР

Look at other dictionaries:

  • federated search — /fɛdəreɪtəd ˈsɜtʃ/ (say feduhraytuhd serch) noun an online search of multiple databases simultaneously resulting from a single query …  

  • Search algorithm — In computer science, a search algorithm, broadly speaking, is an algorithm that takes a problem as input and returns a solution to the problem, usually after evaluating a number of possible solutions. Most of the algorithms studied by computer… …   Wikipedia

  • Search engine (computing) — A search engine is an information retrieval system designed to help find information stored on a computer system. Search engines help to minimize the time required to find information and the amount of information which must be consulted, akin to …   Wikipedia

  • Search aggregator — A search aggregator is a type of metasearch engine which gathers results from multiple search engines simultaneously through RSS search results. It combines user specified search feeds (parameterized RSS feeds which return search results) to give …   Wikipedia

  • Federated Indians of Graton Rancheria — The Federated Indians of Graton Rancheria [ [http://www.gratonrancheria.com/index.htm Federated Indians of Graton Rancheria ] ] , formerly the Federated Coast Miwok, was officially recognized by the U.S. government on December 27, 2000, pursuant… …   Wikipedia

  • Federated States of Micronesia — This article is about the sovereign state in Oceania. For the region named Micronesia, see Micronesia. Federated States of Micronesia …   Wikipedia

  • Web search engine — Search engine redirects here. For other uses, see Search engine (disambiguation). The three most widely used web search engines and their approximate share as of late 2010.[1] A web search engine is designed to search for information on the Wo …   Wikipedia

  • Index (search engine) — Search engine indexing collects, parses, and stores data to facilitate fast and accurate information retrieval. Index design incorporates interdisciplinary concepts from linguistics, cognitive psychology, mathematics, informatics, physics, and… …   Wikipedia

  • Federated Ship Painters and Dockers Union — Infobox Union name= Painters and Dockers country= Australia affiliation= members= full name= Federated Ship Painters and Dockers Union native name= founded= 1900 current= head= dissolved date= 1 December 1993 dissolved state= merged into= Members …   Wikipedia

  • Microsoft Search Server — (MSS) is an enterprise search platform from Microsoft, based on the search capabilities of Microsoft Office SharePoint Server.[1] MSS shares its architectural underpinnings with the Windows Search platform for both the querying engine as well as… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”