ReCAPTCHA

ReCAPTCHA

reCAPTCHA is a system developed at Carnegie Mellon University which utilizes CAPTCHA to assist in the process of digitizing the text of books, while protecting websites from bots attempting to access restricted areas.

reCAPTCHA supplies subscribing websites with images of words that optical character recognition (OCR) software has been unable to read. The subscribing websites (whose purposes are generally unrelated to the book digitization project) present these images for humans to decipher as CAPTCHA words, as part of their normal validation procedures. They then return the results to the reCAPTCHA service, thereby contributing to the digitization project. The result is that the university receives approximately 3,000 man hours per day of free labor to help in the preservation of books.

reCAPTCHA has the same goal as Distributed Proofreaders, although DP uses conventional proofreaders. The system is reported to deliver 30 million images every day (as of December 2007), [ [http://www.theregister.co.uk/2007/12/13/facebook_captcha_goes_wrong/ Facebook takes the Captcha rap] , "The Register", 13th December 2007] and counts such popular sites as Facebook, TicketMaster, Twitter and StumbleUpon amongst subscribers. [http://news.bbc.co.uk/2/hi/technology/7023627.stm Spam weapon helps preserve books] — BBC news report by Paul Rubens, 2007-10-02.] Craigslist began using reCAPTCHA in June 2008. [http://blog.craigslist.org/2008/06/fight-spam-digitize-books/ "Fight Spam, Digitize Books"] — Craigslist Blog, 2008-06.] The U.S. National Telecommunications and Information Administration also uses reCAPTCHA for its digital TV converter box coupon program website as part of the US DTV transition. [ [https://www.dtv2009.gov/ TV Converter Box Program] ]

Operation

Most implementations use reCAPTCHA in order to validate a website registration. For example, before allowing a visitor to post on a website forum, the website can require the visitor to complete a registration. Usually this registration will require the visitor to have a valid email and, in the case of reCAPTCHA, solve the CAPTCHA image.

In order to verify that humans can decipher these previously unrecognisable words correctly, two words are displayed; one is a word which OCR software has been unable to read, and the other is a word which several other human users have already been able to identify. If the user recognises the identified word, this gives confidence that they were also correct about the new word. The same unknown words are sent to two different people; if they agree then the decipherment is assumed correct, otherwise it is sent to more people until agreement is reached. [cite web |url=http://www.cmu.edu/news/archive/2007/May/may24_recaptcha.shtml |title=May 24: Carnegie Mellon Project Boosts Book Digitization Efforts - Carnegie Mellon University |accessdate=2007-06-23 |format= |work=]

Implementation

reCAPTCHA tests are taken from the central site of the reCAPTCHA project [ [http://recaptcha.net The reCAPTCHA project] - part of the Carnegie Mellon School of Computer Science at Carnegie Mellon University.] as they are supplying the undecipherable words. This is done through a Javascript API with the server making a callback to reCAPTCHA after the request has been submitted. The reCAPTCHA project provides libraries for various programming languages and applications to make this process easier. reCAPTCHA is a free service (that is, the CAPTCHA images are provided to websites free of charge, in return for assistance with the decipherment). [ [http://recaptcha.net/faq.html Recaptcha Faq ] ]

Mailhide

reCAPTCHA has also created project Mailhide [ [http://mailhide.recaptcha.net/ reCAPTCHA Mailhide: Free Spam Protection ] ] which protects email addresses from being harvested by spambot. The email address is converted into a format that does not allow a crawler to see the full email address. For example, the email "noreply@example.com" would be converted to "nor...@example.com" The visitor would then click on the "..." and solve the CAPTCHA in order to obtain the full email address.

Notes

External links

* [http://recaptcha.net/ The reCAPTCHA project]


Wikimedia Foundation. 2010.

Игры ⚽ Поможем сделать НИР

Look at other dictionaries:

  • Recaptcha — Saltar a navegación, búsqueda El logo de reCAPTCHA reCAPTCHA es una extensión de la prueba CAPTCHA que se utiliza para reconocer texto presente en imágenes. reCAPTCHA se basa en el hecho de que para un ser humano puede ser simple determinar el… …   Wikipedia Español

  • ReCAPTCHA — Logo du reCAPTCHA …   Wikipédia en Français

  • ReCaptcha — Logo du reCAPTCHA …   Wikipédia en Français

  • ReCAPTCHA — Логотип reCAPTCHA reCAPTCHA  это система для защиты веб сайтов от интернет ботов (спам ботов), основанная на тесте Тьюринга и призванная оградить веб ресурсы от автоматических алгоритмов и программ путём генерации случайного текста и вывода… …   Википедия

  • reCAPTCHA — The reCAPTCHA logo reCAPTCHA is a system originally developed at Carnegie Mellon University s main Pittsburgh campus. It uses CAPTCHA to help digitize the text of books while protecting websites from bots attempting to access restricted areas.[ …   Wikipedia

  • reCAPTCHA — Логотип reCAPTCHA. reCAPTCHA  система, разработанная в университете Карнеги  Меллон для защиты веб сайтов от интернет ботов, и одновременной помощи в оцифровке текстов книг. Является продолжением проекта …   Википедия

  • reCAPTCHA — Logo du reCAPTCHA. Un exemple de reCAPTCHA : les mots à reconnaître sont «  …   Wikipédia en Français

  • reCAPTCHA — Beispiel einer reCAPTCHA Eingabebox reCAPTCHA ist ein CAPTCHA Dienst, also ein Verfahren, um sicherzustellen, dass eine bestimmte Handlung im Internet von einem Menschen und nicht von einem Bot vorgenommen wird. Das Besondere ist die Tatsache,… …   Deutsch Wikipedia

  • ReCAPTCHA — CAPTCHA [ kæptʃə] ist ein Akronym für Completely Automated Public Turing test to tell Computers and Humans Apart. Wörtlich übersetzt bedeutet das „Vollautomatischer öffentlicher Turing Test, um Computer und Menschen zu unterscheiden“. CAPTCHAs… …   Deutsch Wikipedia

  • Captcha — [ kæptʃə] ist ein Akronym für Completely Automated Public Turing test to tell Computers and Humans Apart. Wörtlich übersetzt bedeutet das „Vollautomatischer öffentlicher Turing Test, um Computer und Menschen zu unterscheiden“. CAPTCHAs werden… …   Deutsch Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”