Never-Ending Language Learning

Never-Ending Language Learning

Never-Ending Language Learning system (NELL) is a semantic machine learning system developed by a research team at Carnegie Mellon University, and supported by grants from DARPA, Google, and the NSF, with portions of the system running on a supercomputing cluster provided by Yahoo!.[1]

Contents

Process and goals

NELL was programmed by its developers to be able to identify a basic set of fundamental semantic relationships between a few hundred predefined categories of data, such as cities, companies, emotions and sports teams. Since the beginning of 2010, the Carnegie Mellon research team has been running NELL around the clock, sifting through hundreds of millions of web pages looking for connections between the information it already knows and what it finds through its search process – to make new connections in a manner that is intended to mimic the way humans learn new information.[2] For example, in encountering the word pair "Pikes Peak", NELL would notice that both words are capitalized and deduce from the second word that it was the name of a mountain, and then build on the relationship of words surrounding those two words to deduce other connections.[1]

The goal of NELL and other semantic learning systems, such as IBM's Watson system, is to be able to develop means of answering questions posed by users in natural language with no human intervention in the process.[3] Oren Etzioni of the University of Washington lauded the system's "continuous learning, as if NELL is exercising curiosity on its own, with little human help".[1]

By October 2010, NELL has doubled the number of relationships it has available in its knowledge base and has learned 440,000 new facts, with an accuracy of 87%.[4][1] Team leader Tom M. Mitchell, chairman of the machine learning department at Carnegie Mellon described how NELL "self-corrects when it has more information, as it learns more", though it does sometimes arrive at incorrect conclusions. Accumulated errors, such as the deduction that Internet cookies were a kind of baked good, led NELL to deduce from the phrases "I deleted my Internet cookies" and "I deleted my files" that "computer files" also belonged in the baked goods category.[5] Clear errors like these are corrected every few weeks by the members of the research team and the system is allowed to continue its learning process.[1]

References

  1. ^ a b c d e "Aiming to Learn as We Do, a Machine Teaches Itself". New York Times. October 4, 2010. http://www.nytimes.com/2010/10/05/science/05compute.html?hpw=&pagewanted=all. Retrieved 2010-10-05. "Since the start of the year, a team of researchers at Carnegie Mellon University — supported by grants from the Defense Advanced Research Projects Agency and Google, and tapping into a research supercomputing cluster provided by Yahoo — has been fine-tuning a computer system that is trying to master semantics by learning more like a human." 
  2. ^ Project Overview, Carnegie Mellon University. Accessed October 5, 2010.
  3. ^ Trader, Tiffany. "Machine Learns Language Starting with the Facts", HPCwire, October 5, 2010. Accessed October 5, 2010.
  4. ^ "NELL: Never-Ending Language Learning", Carnegie Mellon University. Accessed October 5, 2010.
  5. ^ VanHemert, Kyle. "Right Now A Computer Is Reading Online, Teaching Itself Language", Gizmodo, October 6, 2010. Accessed October 5, 2010.

See also

External links


Wikimedia Foundation. 2010.

Игры ⚽ Поможем решить контрольную работу

Look at other dictionaries:

  • Language education — Language Teaching redirects here. For the journal, see Language Teaching (journal). Linguistics …   Wikipedia

  • language — /lang gwij/, n. 1. a body of words and the systems for their use common to a people who are of the same community or nation, the same geographical area, or the same cultural tradition: the two languages of Belgium; a Bantu language; the French… …   Universalium

  • English language — English Pronunciation /ˈ …   Wikipedia

  • Serbo-Croatian language — Infobox Language name=Serbo Croatian nativename=Српскохрватски језик Srpskohrvatski jezik states=Bosnia and Herzegovina, Croatia, Serbia, Montenegro (under different names) region= Southeastern Europe or the Balkans speakers= approx. 21 million… …   Wikipedia

  • Korean language — This article is about the spoken Korean language. For details of the native Korean writing system, see Hangul. Korean 한국어, 조선말 Hangugeo, Chosŏnmal …   Wikipedia

  • Isan language — For the Papuan language, see Finisterre languages. Isan ภาษาอีสาน phasa isan, ภาษาลาว Spoken in Thailand Region Isan Native speakers 20 million  (2004) …   Wikipedia

  • English language — Language belonging to the Germanic languages branch of the Indo European language family, widely spoken on six continents. The primary language of the U.S., Britain, Canada, Australia, Ireland, New Zealand, and various Caribbean and Pacific… …   Universalium

  • Franco-Provençal language — language name=Franco Provençal, Arpitan nativename=patouès, arpetan pronunciation=/patuˈe/ /patuˈɑ/ states=flag|Italy flag|France flag|Switzerland region=Valle d Aosta, Piedmont, Foggia, Franche Comté, Savoie, Bresse, Bugey, Dombes, Beaujolais,… …   Wikipedia

  • Finnish language — language name=Finnish nativename=suomi pronunciation=/ˈsuo.mi/ states=FIN EST Flag|Ingria Flag|Karelia NOR SWE Flag|Torne Valley region=Northern Europe speakers=about 6 million script=Latin alphabet (Finnish variant) familycolor=Uralic fam2=Finno …   Wikipedia

  • HEBREW LANGUAGE — This entry is arranged according to the following scheme: pre biblical biblical the dead sea scrolls mishnaic medieval modern period A detailed table of contents precedes each section. PRE BIBLICAL nature of the evidence the sources phonology… …   Encyclopedia of Judaism

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”