1997-01-05 - Re: OCR and Machine Readable Text

Header Data

From: Dale Thorn <dthorn@gte.net>
To: Steve Stewart <steve@resudox.net>
Message Hash: f8099e3eaffa2c173506ead5cc4e9eee895d921cf03b07944cbb4f106586b52e
Message ID: <32CFF545.C27@gte.net>
Reply To: <3.0.1.32.19970102225436.01072284@mail.teleport.com>
UTC Datetime: 1997-01-05 18:40:37 UTC
Raw Date: Sun, 5 Jan 1997 10:40:37 -0800 (PST)

Raw message

From: Dale Thorn <dthorn@gte.net>
Date: Sun, 5 Jan 1997 10:40:37 -0800 (PST)
To: Steve Stewart <steve@resudox.net>
Subject: Re: OCR and Machine Readable Text
In-Reply-To: <3.0.1.32.19970102225436.01072284@mail.teleport.com>
Message-ID: <32CFF545.C27@gte.net>
MIME-Version: 1.0
Content-Type: text/plain


Steve Stewart wrote:
> I have used OCR a fair bit, and I agree with you,  I think you're being
> generous by saying  even a 65% accuracy rate. I think our OCR technology
> today is pathetic, and it would be quicker just to type the damn
> documents ourselves. I've used a bunch of different packages from guys
> like HP, and others. I certainly don't know what Alan Olsen was using.

[snip]

I needed OCR to create indexed text databases of federal documents,
particularly legislation.  The amount of hand editing required is
enormous.  That alone would justify (in a sense) the use of off-
shore labor.






Thread