I think that what I can do is to put the OCR'd text into a folder on my web page, where it can be available at will. Oh, and I *do* have Acrobat Distiller, and can produce a .pdf when it's all complete. That's why I chose to save as a Word document initially...there's a direct link from Word, which will also write a very convoluted HTML. This will still not be fast. I have a learning curve to follow with ABBYY, which is new to me. (I see that it has a "recognise background" function, but I've no idea how that could operate). Once I develop a technique and a couple of macros I might be able to get an assembly line running! John W. ----- Original Message ----- From: "Paul N. Lee" <Paul.N.Lee@Worldnet.att.net> To: <fractint@mailman.xmission.com> Sent: Tuesday, June 29, 2004 7:42 AM Subject: Re: [Fractint] FractInt Collection -- Volunteer Project ??
John Wilson wrote:
Hmm, I have a bit of a problem. I purchased a copy of the Second Edition, and started to OCR the thing into Word documents. However, my OCR software, ABBYY, is coughing and choking over some of the "artsy" pages. Somebody decided to print the text *over* faint fractal backgrounds, and the software has trouble resolving these "unknown characters"! The problem is not *too* severe, but more than a normal amount of manual editing is required.
John, if you want to send the "raw" text from after doing the Scan and OCR routine, then I do not mind going through that and doing the editting. We can get a lot accomplished together that way.
After I complete the text editting, I can then upload that to a central location for others to download from. And eventually have a PDF version available as well.