|
non searchable PDF convert to TIFF ocr back to searchable PDF
Message-ID:<90ce57c7-8120-4a17-965c-6a789dac467f@b18g2000vbl.googlegroups.com>
Subject:non-searchable PDF, convert to TIFF, ocr back to searchable PDF
Date:Wed, 3 Feb 2010 19:33:47 +0100
I have a non-searchable (because there's no encoding provided) PDF
file from which I'd like to extract its text. Can I do the following
convert the non-searchable PDF file to a TIFF file
use OCR to convert the TIFF file to a searchable PDF
?
If so, how effective is this method and what program(s) would you
recommend for each step? There seem to be many that will convert
from PDF to TIFF. I would think that Adobe Acrobat would be able to
do this easily.
Thanks,
Ted
Message-ID:<pan.2010.02.03.19.52.10@lutrina>
Subject:Re: non-searchable PDF, convert to TIFF, ocr back to searchable PDF
Date:Wed, 3 Feb 2010 20:46:38 +0100
On Wed, 03 Feb 2010 10:33:47 -0800, teds@intex.com ci disse:
> convert the non-searchable PDF file to a TIFF file use OCR to convert
> the TIFF file to a searchable PDF
[...]
truely speaking, you can do directly an ocrzation with
*Abbyy Finereader 8*
- http://www.abbyy.com/
if you want use only freeware software, you can use
*ghostscript*
- http://mirror.cs.wisc.edu/pub/mirrors/ghost/GPL/current/
with its graphical frontend
*Gsview*
- http://pages.cs.wisc.edu/~ghost/gsview/index.htm
--
Puppy Linux wiki: http://puppylover.netsons.org/dokupuppy
Puppy Linux Forum: http://puppylinux.ilbello.com
Windows me genuit, Ubuntu rapuere / tenet nunc Puppy Linux...
Message-ID:<hke5oo$29b$00$1@news.t-online.com>
Subject:Re: non-searchable PDF, convert to TIFF, ocr back to searchable PDF
Date:Thu, 4 Feb 2010 10:58:41 +0100
teds@intex.com wrote:
> I would think that Adobe Acrobat would be able to do this easily.
Yes.
|