Pdftotext

Pdftotext

pdftotext is an open source command-line utility for converting PDF files to plain text files —i.e. extracting text data from PDF-protected files. It is freely available and included with many Linux distributions. It must be installed as part of the xpdf package for Mac OS X (fink install Xpdf) or Windows.

$ pdftotext file.pdfThis usage produces a text file with the same name as the input file. Wildcards (*), for example $ pdftotext *pdf, for converting multiple files, cannot be used because pdftotext expects only one file name. A loop on the shell is needed for batch conversions, as in

$ for f in *.pdf > do > pdftotext $f > done

for the bash shell.

The pdftotext program is part of a larger PDF related package called Xpdf. which can be downloaded from [http://www.foolabs.com/xpdf/download.html foolabs.com] .


Wikimedia Foundation. 2010.

Игры ⚽ Поможем написать курсовую

Look at other dictionaries:

  • Rescue of Giuliana Sgrena — The Rescue of Giuliana Sgrena was a covert operation by the Italian military secret service, SISMI, to rescue Italian journalist Giuliana Sgrena from kidnappers in Iraq. After the successful retrieval of Ms Sgrena, on March 4, 2005, the car with… …   Wikipedia

  • Xpdf — Entwickler Foo labs Aktuelle Version 3.03 (15. August 2011) Betriebssystem diverse Unix Derivate, MacOS X, Microsoft Windows, Microsoft Windows Mobile [1] …   Deutsch Wikipedia

  • .dvi — (TeX) Pour les articles homonymes, voir DVI. DVI est un format de fichier ouvert utilisé par le système de composition de texte TeX. DVI est l abréviation de device independent, ce qui signifie « indépendant du type d unité (ou… …   Wikipédia en Français

  • DVI (TeX) — Pour les articles homonymes, voir DVI. DVI est un format de fichier ouvert utilisé par le système de composition de texte TeX. DVI est l abréviation de device independent, ce qui signifie « indépendant du type d unité (ou… …   Wikipédia en Français

  • Xpdf — Développeur Foo labs Dernière version 3.02pl5 (2 …   Wikipédia en Français

  • Poppler — Тип библиотеки для рендеринга PDF файлов Автор Derek Noonburg (автор Xpdf) Написана на C++ Операционная система Linux и др. UNIX подобные Последняя версия 0.20.0 (10 мая 2012 …   Википедия

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”