Human Computer Information Retrieval

Human Computer Information Retrieval

The fields of human computer interaction (HCI) and information retrieval (IR) have both developed innovative techniques to address the challenge of navigating the complex information spaces, but their insights have to date often failed to cross disciplinary borders. Human-computer information retrieval (HCIR) has emerged in academic research and industry practice as the study of information retrieval techniques that bring human intelligence into the search process. This field brings together research in the fields of IR and HCI in order to create new kinds of search systems that depend on continuous human control of the search process.

History

This term was coined by [http://ils.unc.edu/~march Gary Marchionini] in a series of lectures delivered between 2004 and 2006. [4] [http://ils.unc.edu/~march Marchionini] ’s main thesis is that “HCIR aims to empower people to explore large-scale information bases but demands that people also take responsibility for this control by expending cognitive and physical energy.”

In 1996 and 1998, a pair of [http://www.dcs.gla.ac.uk/irhci/ workshops] at the [http://www.gla.ac.uk University of Glasgow ] on Information Retrieval and Human Computer Interaction sought to address the overlap between these two fields. However, [http://ils.unc.edu/~march Marchionini] notes the impact of the world wide web and the sudden increase in information literacy – changes that were only embryonic in the late 1990’s.

There are a few recent workshops that focus on the intersection of IR and HCI. Initiated by the [http://www.cs.umd.edu/hcil/ Human-Computer Interaction Lab] at the [http://www.umd.edu/ University of Maryland] in 2005, the Workshop on Exploratory Search alternates between the Association for Computing Machinery Special Interest Group on Information Retrieval (SIGIR) and Special Interest Group on Computer-Human Interaction (CHI) conferences. Also in 2005, the [http://www.esf.org/ European Science Foundation] held an [http://www.dcs.gla.ac.uk/IRiX/ Exploratory Workshop on Information Retrieval in Context] . Finally, the first [http://projects.csail.mit.edu/hcir/ Workshop on Human Computer Information Retrieval] was held in 2007 at the Massachusetts Institute of Technology.

What is HCIR?

HCIR includes various aspects of IR and HCI. These include exploratory search, in which users generally combine querying and browsing strategies to foster learning and investigation; information retrieval in context (i.e., taking into account aspects of the user or environment that are typically not reflected in a query); and interactive information retrieval, which [http://www.db.dk/pi/ Peter Ingwersen] defines as “the interactive communication processes that occur during the retrieval of information by involving all the major participants in information retrieval (IR), i.e. the user, the intermediary, and the IR system.” [2]

A key concern of HCIR is that IR systems intended for human users be implemented and evaluated in a way that reflects the needs of those users. [5]

Most modern IR systems employ a ranked retrieval model, in which the documents are scored based on the probability of the document’s relevance to the query. [6] In this model, the system only presents the top-ranked documents to the user. This systems are typically evaluated based on their mean average precision over a set of benchmark queries from organizations like the Text Retrieval Conference (TREC).

Because of its emphasis in using human intelligence in the information retrieval process, HCIR requires different evaluation models – one that combines evaluation of the IR and HCI components of the system. A key area of research in HCIR involves evaluation of these systems. Early work on interactive information retrieval, such as Juergen Koenemann and Nicholas J. Belkin’s 1996 study of different levels of interaction for automatic query reformulation, leverage the standard IR measures of precision and recall but apply them to the results of multiple iterations of user interaction, rather than to a single query response. [3] Other HCIR research, such as [http://www.db.dk/ombiblioteksskolen/medarbejdere/default.asp?cid=677&tid=4 Pia Borlund] ’s IIR evaluation model, applies a methodology more reminiscent of HCI, focusing on the characteristics of users, the details of experimental design, etc. [1]

Goals

[http://ils.unc.edu/~march Marchionini] put forth the following goals towards a system where the user has more control in determining relevant results [4] :

*Systems should aim to get people closer to the information they need, especially to the meaning; that is, systems can no longer only deliver the relevant documents, but must also provide facilities for making meaning with those documents.

*Systems should increase user responsibility as well as control; that is, information systems require human intellectual effort, and good effort is rewarded.

*Systems should have flexible architectures so they may evolve and adapt to increasingly more demanding and knowledgeable installed bases of users over time.

*Systems should aim to be part of information ecology of personal and shared memories and tools rather than discrete standalone services.

*Systems should support the entire information life cycle (from creation to preservation) rather than only the dissemination or use phase.

*Systems should support tuning by end users and especially by information professionals who add value to information resources.

*Systems should be engaging and fun to use.

Techniques

The techniques associated with HCIR emphasize representations of information that use human intelligence to lead the user to relevant results. These techniques also strive to allow users to explore and digest the dataset without penalty, i.e., without expending unnecessary costs of time, mouse clicks, or context shift.

Many search engines have features that incorporate HCIR techniques. Spelling suggestions and automatic query reformulation provide mechanisms for suggesting potential search paths that can lead the user to relevant results. These suggestions are presented to the user, putting control of selection and interpretation in the user’s hands.

Faceted search enables users to navigate information hierarchically, going from a category to its sub-categories, but choosing the order in which the categories are presented. This contrasts with traditional taxonomies in which the hierarchy of categories is fixed and unchanging. Faceted navigation, like taxonomic navigation, guides users by showing them available categories (or facets), but does not require them to browse through a hierarchy that may not precisely suit their needs or way of thinking. [7]

Lookahead provides a general approach to penalty-free exploration. For example, various web applications employ AJAX to automatically complete query terms and suggest popular searches. Another common example of lookahead is the way in which search engines annotate results with summary information about those results, including both static information (e.g., metadata about the objects) and “snippets” of document text that are most pertinent to the words in the search query.

Relevance feedback allows users to guide an IR system by indicating whether particular results are more or less relevant. [8]

Summarization and analytics help users digest the results that come back from the query. Summarization here is intended to encompass any means of aggregating or compressing the query results into a more human-consumable form. Faceted search, described above, is one such form of summarization. Another is clustering, which analyzes a set of documents by grouping similar or co-occurring documents or terms. Clustering allows the results to be partitioned into groups of related documents. For example, a search for “java” might return clusters for Java (programming language), Java (island), or Java (coffee).

Visual representation of data is also considered a key aspect of HCIR. The representation of summarization or analytics may be displayed as tables, charts, or summaries of aggregated data. Other kinds of information visualization that allow users access to summary views of search results include tag clouds and treemapping.

Major Figures

* [http://ciir.cs.umass.edu/~allan/ James Allan]
* Nicholas J. Belkin
* [http://www.db.dk/ombiblioteksskolen/medarbejdere/default.asp?cid=677&tid=4 Pia Borlund]
* [http://ciir.cs.umass.edu/personnel/croft.html Bruce Croft]
* [http://people.ischool.berkeley.edu/~hearst/ Marti Hearst]
* [http://www.db.dk/pi/ Peter Ingwersen]
* [http://people.csail.mit.edu/karger/ David Karger]
* Juergen Koenemann
* [http://ils.unc.edu/~march/ Gary Marchionini]
* [http://users.ecs.soton.ac.uk/mc/ m. c. schraefel]
* Ben Shneiderman
* [http://www.cs.cmu.edu/~quixote/ Daniel Tunkelang]
* [http://research.microsoft.com/~ryenw/ Ryen White]

References


#Borlund, P. (2003). The IIR evaluation model: a framework for evaluation of interactive information retrieval systems. Information Research, 8(3), Paper 152. Available online at http://informationr.net/ir/8-3/paper152.html.
#Ingwersen, P. (1992). Information Retrieval Interaction. London: Taylor Graham. Available online at http://vip.db.dk/pi/iri/index.htm.
#Koenemann, J. and Belkin, N. J. (1996). A case for interaction: a study of interactive information retrieval behavior and effectiveness. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems: Common Ground (Vancouver, British Columbia, Canada, April 13 - 18, 1996). M. J. Tauber, Ed. CHI ‘96. ACM Press, New York, NY, 205-212. Available online at http://sigchi.org/chi96/proceedings/papers/Koenemann/jk1_txt.htm.
#Marchionini, G. (2006). Toward Human-Computer Information Retrieval Bulletin, in June/July 2006 Bulletin of the American Society for Information Science. Available online at http://www.asis.org/Bulletin/Jun-06/marchionini.html.
#Mira working group (1996). Evaluation Frameworks for Interactive Multimedia Information Retrieval Applications. Available online at http://www.dcs.gla.ac.uk/mira/.
#Grossman, D. and Frieder, O. (2004). Information Retrieval Algorithms and Heuristics.
#Hearst, M. (1999). User Interfaces and Visualization, Chapter 10 of Baeza-Yates, R. and Ribeiro-Neto, B., Modern Information Retrieval.
#Rocchio, J. (1971). Relevance feedback in information retrieval. In: Salton, G (ed), The SMART Retrieval System.

External links

* [http://www.ils.unc.edu/ISSS_workshop/ 2008 NSF Workshop on Information Seeking Support Systems]
* [http://research.microsoft.com/~ryenw/hcir2008/ 2008 Workshop on Human Computer Information Retrieval]
* [http://projects.csail.mit.edu/hcir/ 2007 Workshop on Human Computer Information Retrieval]
* [http://research.microsoft.com/~ryenw/esi/ 2007 Workshop on Exploratory Search and HCI at ACM SIGCHI]
* [http://facetedsearch.googlepages.com/ 2006 SIGIR Workshop on Faceted Search]
* [http://www.dcs.gla.ac.uk/IRiX/ 2005 ESF Exploratory Workshop on Information Retrieval in Context]
* [http://www.dcs.gla.ac.uk/irhci/ 1998 Workshop on Information Retrieval and Human Computer Interaction]
* [http://www.youtube.com/watch?v=noMQjrACHxQ Video of Gary Marchionini's Human-Computer Information Retrieval Lecture on YouTube]
* [http://www.sciencedirect.com/science/journal/03064573 Information Processing & Management Vol 44] (special issues on evaluation of interactive IR and exploratory search systems)


Wikimedia Foundation. 2010.

Игры ⚽ Поможем решить контрольную работу

Look at other dictionaries:

  • Information retrieval — This article is about information retrieval in general. For the fictional government department, see Brazil (film). Information retrieval (IR) is the area of study concerned with searching for documents, for information within documents, and for… …   Wikipedia

  • Relevance (information retrieval) — In the context of information science and information retrieval, relevance denotes how well a retrieved set of documents (or a single document) meets the information need of the user. Topical relevance and other kinds of relevance Relevance most… …   Wikipedia

  • List of human-computer interaction topics — This is a list of topics in human computer interaction. General * accessibility and Computer accessibility * adaptive autonomy * affordance * banner blindness * contextual design and contextual inquiry * gender HCI * gulf of execution *… …   Wikipedia

  • Human-based computation — In computer science, human based computation is a technique when a computational process performs its function via outsourcing certain steps to humans (Kosorukoff, 2001). This approach leverages differences in abilities and alternative costs… …   Wikipedia

  • Cognitive models of information retrieval — rest on the mix of areas such as cognitive science, human computer interaction, information retrieval, and library science. They describe the relationship between a person s cognitive model of the information sought and the organization of this… …   Wikipedia

  • Music information retrieval — (MIR) is the interdisciplinary science of retrieving information from music. MIR is a small but growing field of research with many real world applications. Those involved in MIR may have a background in musicology, psychology, academic music… …   Wikipedia

  • Collaborative information seeking — (CIS) is a field of research that involves studying situations, motivations, and methods for people working in collaborative groups for information seeking projects, as well as building systems for supporting such activities. Such projects often… …   Wikipedia

  • Computer science — or computing science (abbreviated CS) is the study of the theoretical foundations of information and computation and of practical techniques for their implementation and application in computer systems. Computer scientists invent algorithmic… …   Wikipedia

  • computer science — computer scientist. the science that deals with the theory and methods of processing information in digital computers, the design of computer hardware and software, and the applications of computers. [1970 75] * * * Study of computers, their… …   Universalium

  • Computer vision — is the field concerned with automated imaging and automated computer based processing of images to extract and interpret information. It is the science and technology of machines that see. Here see means the machine is able to extract information …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”