Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations Chris Miller on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

Pdf Conversion question

Status
Not open for further replies.

trbelmore

Technical User
Jan 18, 2005
1
US
There is an old book that someone photocopied each page, then combined each chapter into a .pdf file. I want to get it into a word or text format. I tried a couple of pdf-to-text or pdf-to-doc programs, but they can't convert it as they see the pages as a series of images within the .pdf file. Does anyone know of a program that will accomplish this, or if not a single program, then a series of steps. I don't really want to print it all out and scan and ocr all 1200+ pages, so any simpler solution would be appreciated.
 
No matter how you look at it you are using OCR to convert the file.

OCR on programs are not completely accruate. This is true even if the original piece was generated from an electronic text file (MS-Word etc.) I have several OCR programs and find that they each require follow up after the conversion in order to ensure that characters were converted correctly. With so many pages you are going to take a very long time to get it right. The fact that the pages were scanned from a book is going to make it even worse.

Sorry no help from me just the caveat below

NOTE:
Before you start copying and distributing books or other material, you should think about copyright laws. If your friend does not own copyrights to the peice, maybe you should purchase a second copy instead of trying to pirate one.



Aloha,
cg
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top