Outwit (software)

Outwit (software)

Infobox Software
name = OutWit
developer = OutWit Technologies
first_release = May 2008
latest_release_version = 0.8.1.91
latest_release_date =September 2008
operating_system = Mac OS X, Windows, Linux
platform = All
genre = Web harvester
website = [http://www.outwit.com/ www.outwit.com]

The OutWit platform is a Web harvester and download management environment developed by OutWit Technologies and originally released as a first public beta in May 2008.

The central module of the platform, the OutWit Kernel, includes a library of recognition and extraction functions, packaged as a free extension for Mozilla Firefox. Around the kernel can be created specific applications using the application programming interface API. The platform's license allows advanced users to build and distribute their own original tools —called outfits— taking advantage of the Kernel's features for specific applications. Each outfit is a small XUL extension, with its own user interface, features, scripts, scrapers, directory of Web sources...

The technology is presented as a step towards a semantic browser which will recognize data and media elements using metadata when present and inferring semantic information when possible. The software automatically browses through Web sources to harvest information objects and organize them into reusable and sharable collections or mashups.

OutWit Hub is the first tool based on the OutWit platform. The beta version gathers a series of features to ease Web searches and organize collections. By breaking-down the elements of a Web page into different types of data, i.e. images, links, email addresses, text, tables etc., the program allows users to manipulate only the desired data and use it in a variety of applications. the application automatically browses through Web sources in full screen, analyzing each page’s navigation links and guessing the most pertinent next page URL. This way, with or without programming skills or technical knowledge, users can create automatic agents and scrapers to gather and format the information they seek.

While some of the data extraction functions are traditional web/screen scraping features, requiring the creation of a specific extraction masks for a page, others act more as intelligent filters eliminating all data not specifically requested.

Features

The OutWit kernel's basic feature library includes:

* Data structure recognition
* Automatic multi-page browsing
* Full-screen browsing
* Automatic slide show on image searches
* Page & image link extraction
* e-mail extraction (automatic extraction is limited)
* Table and list extraction
* Syntax colored page source
* Scraper editor for custom data extraction

External links

* [http://www.outwit.com/ OutWit Technologies' official website]
* [http://addons.mozilla.org/en-US/firefox/addon/7271 Firefox Add-ons]
* [http://www.ghacks.net/2008/08/09/outwit-hub-firefox-web-collection-tool/ gHacks review]
* [http://www.reuters.com/article/pressRelease/idUS141026+31-Jul-2008+BW20080731 Latest Press Release]
* [http://www.atelier.fr/applications/10/09072008/outwit-optimise-extraction-et-reconnaissance-des-donnees-36853-.html article on Atelier.fr]


Wikimedia Foundation. 2010.

Игры ⚽ Нужно сделать НИР?

Look at other dictionaries:

  • Screen scraping — is a technique in which a computer program extracts data from the display output of another program. The program doing the scraping is called a screen scraper. The key element that distinguishes screen scraping from regular parsing is that the… …   Wikipedia

  • Diomidis Spinellis — Diomidis D. Spinellis (Greek: Διομήδης Δ. Σπινέλλης; February 2, 1967, Athens) is a Greek computer science academic and author of the books Code Reading and Code Quality. Spinellis holds an MEng degree in Software Engineering and a Ph.D. in… …   Wikipedia

  • Спинеллис, Диомидис — В Википедии есть статьи о других людях с такой фамилией, см. Спинеллис. Диомидис Спинеллис греч. Διομήδης Δ. Σπινέλλης Дата рождения: 2 февраля …   Википедия

  • Computers and Information Systems — ▪ 2009 Introduction Smartphone: The New Computer.       The market for the smartphone in reality a handheld computer for Web browsing, e mail, music, and video that was integrated with a cellular telephone continued to grow in 2008. According to… …   Universalium

  • environment — environmental, adj. environmentally, adv. /en vuy reuhn meuhnt, vuy euhrn /, n. 1. the aggregate of surrounding things, conditions, or influences; surroundings; milieu. 2. Ecol. the air, water, minerals, organisms, and all other external factors… …   Universalium

  • Pirates of the Burning Sea — Infobox VG title = Pirates of the Burning Sea developer = Flying Lab Software publisher = Sony Online Entertainment (Europe, USA)cite web |url= http://www.soepress.com/release.asp?i=123 |title= Pirates of the Burning finds a home port with… …   Wikipedia

  • Segmented downloading — (also known as multisource downloading OR swarming download) can be a more efficient way of downloading files from many peers at once. The one single file is downloaded, in parallel, from several distinct sources or uploaders of the file. This… …   Wikipedia

  • Strategic management — is a field that deals with the major intended and emergent initiatives taken by general managers on behalf of owners, involving utilization of resources, to enhance the performance of firms in their external environments.[1] It entails specifying… …   Wikipedia

  • Alphacet — Infobox company company name = Alphacet, Inc. company type = Private Company foundation = 2007 location city = Stamford, CT industry = Financal Technology homepage = [http://www.alphacet.com?w2 www.alphacet.com] [http://www.alphacet.com?w2… …   Wikipedia

  • Advance-fee fraud — African sting An advance fee fraud is a confidence trick in which the target is persuaded to advance sums of money in the hope of realizing a significantly larger gain.[1] Among the variations on this type of scam are the Nigerian Letter (also… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”