- GOCR
Infobox Software
name=GOCR
license=GNU General Public License
developer=Jörg Schulenburg
latest release version = 0.45
latest release date =November 2007
logo=
genre=Optical character recognition
website= [http://jocr.sourceforge.net jocr.sourceforge.net]GOCR (or JOCR) is a free
optical character recognition program, initially written by Jörg Schulenburg. It can be used to convert or scan image files (portable pixmap orPCX ) intotext file s. cite web|url = http://jocr.sourceforge.net/|title = GOCR|accessdate = 2008-06-25|last = Schulenburg|first = Joerg|authorlink = |year = 2007|month = March]Development
According to the program's documentation, as of version 0.44 it is still in the early stages of development. cite web|url = http://www.sfr-fresh.com/unix/privat/gocr-0.45.tar.gz:a/gocr-0.45/README|title = Member "gocr-0.45/README" of archive gocr-0.45.tar.gz|accessdate = 2008-06-25|last = SfR Fresh|authorlink = |year = undated]
It claims to handle single-column sans-serif fonts of 20-60 pixels in height, and reports trouble with serif fonts, overlapping characters, handwritten text, heterogeneous fonts, noisy images, large angles of skew, and text in anything other than a
Latin alphabet .Nomenclature
The application was originally named GOCR which stands for GNU Optical Character Recognition. When it came time to register the project on
SourceForge the name GOCR was already taken so the project was registered as JOCR (Jörg's Optical Character Recognition).As a result of this situation the project and application are known as both GOCR and JOCR. Schulenburg admits that this is problematic.
Formats
Acceptable image formats are:
* pnm
* pbm
* pgm
* ppm
* pcx (some)
* tgaOther formats are automatically converted using netpbm-progs, gzip and bzip2 via the use of a unix pipe. These images types include:
* pnm.gz
* pnm.bz2
* png
* jpg
* tiff
* gif
* bmpBarcodes
GOCR can also translate
barcode s.See also
*
GNU Ocrad References
External links
* [http://jocr.sourceforge.net GOCR Homepage]
Wikimedia Foundation. 2010.