- Outwit (software)
Infobox Software
name = OutWit
developer = OutWit Technologies
first_release =May 2008
latest_release_version = 0.8.1.91
latest_release_date =September 2008
operating_system =Mac OS X ,Windows ,Linux
platform = All
genre = Web harvester
website = [http://www.outwit.com/ www.outwit.com]The OutWit platform is a Web harvester and download management environment developed by OutWit Technologies and originally released as a first public beta in May 2008.
The central module of the platform, the OutWit Kernel, includes a library of recognition and extraction functions, packaged as a free extension for
Mozilla Firefox . Around thekernel can be created specific applications using theapplication programming interface API. The platform's license allows advanced users to build and distribute their own original tools —called outfits— taking advantage of the Kernel's features for specific applications. Each outfit is a smallXUL extension, with its own user interface, features, scripts, scrapers, directory of Web sources...The technology is presented as a step towards a semantic browser which will recognize data and media elements using metadata when present and inferring semantic information when possible. The software automatically browses through Web sources to harvest information objects and organize them into reusable and sharable collections or mashups.
OutWit Hub is the first tool based on the OutWit platform. The beta version gathers a series of features to ease Web searches and organize collections. By breaking-down the elements of a Web page into different types of data, i.e. images, links, email addresses, text, tables etc., the program allows users to manipulate only the desired data and use it in a variety of applications. the application automatically browses through Web sources in full screen, analyzing each page’s navigation links and guessing the most pertinent next page URL. This way, with or without programming skills or technical knowledge, users can create automatic agents and scrapers to gather and format the information they seek.While some of the data extraction functions are traditional web/screen scraping features, requiring the creation of a specific extraction masks for a page, others act more as intelligent filters eliminating all data not specifically requested.
Features
The OutWit kernel's basic feature library includes:
* Data structure recognition
* Automatic multi-page browsing
* Full-screen browsing
* Automatic slide show on image searches
* Page & image link extraction
* e-mail extraction (automatic extraction is limited)
* Table and list extraction
* Syntax colored page source
* Scraper editor for custom data extractionExternal links
* [http://www.outwit.com/ OutWit Technologies' official website]
* [http://addons.mozilla.org/en-US/firefox/addon/7271 Firefox Add-ons]
* [http://www.ghacks.net/2008/08/09/outwit-hub-firefox-web-collection-tool/ gHacks review]
* [http://www.reuters.com/article/pressRelease/idUS141026+31-Jul-2008+BW20080731 Latest Press Release]
* [http://www.atelier.fr/applications/10/09072008/outwit-optimise-extraction-et-reconnaissance-des-donnees-36853-.html article on Atelier.fr]
Wikimedia Foundation. 2010.