On Jan 23, 5:41=A0pm, "alwysbrke2005" <alwysb...@[EMAIL PROTECTED]
> wrote:
> Greetings everyone,
>
> I am an owner of alot of the books. Especially every Eberron book out
> there. I want to OCR them, but am not quite sure the best way to do it.
>
> I noticed alot of PDF's out there that are either in OCR format, or
> just as images. I have a program called ABBY that can read the images
> and convert them into OCR, but then there is a TON of cleanup to do,
> making it cmopletely not worth while.
>
> My question is, how do I go about scanning my own books, and ensuring
> that they are searchable (in OCR format) right frmo the get go?
>
> Do I need special fonts?
>
> Anything else special?
>
> Thanks for the help!
>
> Alwysbrke
While its illegal to UPLOAD, nothing in the copywrite law (afaik) says
-you- have to make the legal copy for personal use -yourself- so the
easiest way on both yourself and your originals would be check
limewire/bit torrent for existing copies.
Otherwise be sure to keep the originals aligned as straight as
possible, using around 600dpi (higher is better, but slower scanning
and results in much larger files).
Im unfamiliar with ABBY, but generally any scanning program will save
the pages to images for reprinting. The ocr program converts the
images into editable text -- the big snag is finding one capable of
dealing with the multiple columns most books are formatted in without
interlacing/merging the paragraphs into a worse mess than simply
retyping.


|