[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Killietalk] Scanning Job - JAKA on CD/DVD
Hi Barry,
The AKA does own an old set of KN/JAKA's. Several years ago (2000?) charter member Dick Haas sent his extensive collection of AKA publications to Dick Martino for the AKA archives. I know as he sent me all his non-AKA killie stuff and rare hobbyist magazines. This would be a perfect use of that kind gift to the Association. That said one could scan 1st generation high quality xerox copies of the old magazines.
Bobby
Barry Cooper <bjc3 at centurytel_net> wrote:
I don't want to prolong this discussion, but I would like to comment on
a couple of things Kent says. First, OCR processing to produce the PDF
document does not greatly increase the file size. In fact, for one of
the issues that I have processed, the image files for that issue added
up to 13.2 mb. The finished OCR PDF was 5.2 mb, including a full color
version of the cover, which was 1.6 mb. The scanned page images are
admittedly tiff files, which are larger than jpeg, but at worst doing
OCR to produce the final PDF will be a wash as far as file size goes
(e.g. if the scanned page images were saved as jpegs). Furthermore, an
OCR document has other advantages. The quality of the scan, particularly
with the oldest JAKAs and Killie Notes, can be quite poor, due to poor
quality paper and type. OCR allows these to be corrected, during the
proofing process. In addition, spelling errors in the originals can be
corrected. I'm in two minds about whether we should do so, as the
scanned JAKAs and KNs will form a digital historical archive. In the
issues I have done so far I decided to correct obvious mis-spellings of
common words, but have not corrected or updated nomenclature, etc. In an
OCRed document it would also be possible, perhaps in a "deluxe" edition,
to append comments regarding changed nomenclature, etc. One last problem
is that I don't know who would want their old and precious JAKAs and KNs
to be disassembled for scanning using a drum scanner or document feeder.
The AKA itself does not own a complete set of these documents and I, for
one, would not consent to my old copies being pulled apart. Finally, an
OCR processed PDF can be searched, across issues and volumes, using
Adobe Reader. I would not argue that a separate index would not be
useful, but indexes are not perfect. The ability to search for
particular words in the set of documents on a disk would be useful.
Software is available that allows you to generate an index from a PDF
document.
As everyone says, just my 2 cents.
Regards,
Barry
Barry J. Cooper
Sweet Home, OR 97386
Kent Hunsuckle wrote:
>Getting this done commercialy will run anywhere from $.05-.15 per image from
>most honest individuals if there is no prep work to be done. In the industry
>I have heard of it going for $.25 per image with prep work. (prep is making
>sure the documents can be fed into the scanner ie remove staples and such)
>This kind of work can be found by contacting your friendly micrographics
>(microfilm and fiche) folks as digital imaging is the way of the modern
>world. My suggestion would be to scan it into a PDF format but don't have it
>OCRed, the file size will be incredible and the cost will grow as well then
>you'll never be able to afford it. Also tell them to leave the document
>where you can go in and index it as you see fit. Another suggestion would be
>to scan one year at a time. This will make it easier to index and easier to
>afford to have it done.
>
>OK, I'm done, this was just a few thoughts off the top of my head.
>----- Original Message -----
>From: "Larry B."
>To:
>Sent: Saturday, October 15, 2005 4:30 PM
>Subject: Re: [Killietalk] Scanning Job - JAKA on CD/DVD
>
>
>
>
>>Okay guys. Lets do it this way. As a new member of the BOT (effective
>>1-1-2006) let me get started into this to sorta scope out the project.
>>First I need the BOT to agree that its a good idea for me to dig into this
>>a little. Then lets see what it would cost commercially without giving
>>the
>>baby away and assigning rights to OUR pubs to a commercial outfit. Brian
>>made a good point on that. I've already said I would start looking for
>>quotes from commercial outfits to get some idea of what we're talking
>>about
>>in terms of $$$$. I have an estimate of how many pages of scanning we're
>>talking about from Barry. (Approx. 7500 !) Given the state of the art in
>>OCR scanning today, thats not only a lot of scanning, its a ton of proof
>>reading and correcting. OCR is a long way from being perfect although
>>printed pages are lot less trouble than hand written pages but still
>>everything has to be proof read and corrected.. I have a licensed copy of
>>one of the premier OCR programs and its a long way from perfect.
>>Additionally all of the images should be marked as our property. More
>>work.
>>
>>And I haven't scratched the surface. Remember I just started thinking
>>about
>>this thing.
>>
>>As soon as I get the BOT sign off on me doing this. I'll get back to
>>Killitalk with whatever requests for help I think necessary. Until then
>>give me and the list a break and lets get back to fish on here. I would
>>like to know if anyone out there has any contacts into the businesses that
>>does this kind of work.
>>
>>But if anybody has any really good ideas get to me off-list. All who help
>>will be recognized for that help..
>>
>>Larry Botkin
>>
>>AKA
>>CKA
>>WAKO
>>TAKO
>>
>>
>>
>>
>>
>>At 01:33 PM 10/15/2005, you wrote:
>>
>>
>>>Date: Sat, 15 Oct 2005 11:39:19 -0500
>>>From:
>>>Subject: Re: [Killietalk] scanning job
>>>To: "killifish discussion list"
>>>
>>>Ok Where is the leader of this project?
>>>
>>>----- Original Message -----
>>>From: "Kate Breimayer"
>>>To:
>>>Sent: Saturday, October 15, 2005 11:34 AM
>>>Subject: [Killietalk] scanning job
>>>
>>>
>>>
>>>
>>>>Ok, now you got me curious. Somebody email the requirements off list
>>>>and
>>>>I will see if I can do it and how long it takes. Got a scanner, good
>>>>computers, photoshop and other useful stuff, and lots of free time.
>>>>Also
>>>>a Linux box if someone wants to find an open source version of a
>>>>program
>>>>needed to do this. Note I am NOT the household member with the computer
>>>>skills! I am merely the better half who constantly badgers him to teach
>>>>me. The more I do while unemployed the less damage to the ol resume...
>>>>
>>>>Kate B
>>>>
>>>>
>>>>
>>>>>-----Original Message-----
>>>>>From: killietalk-bounces at aka_org [mailto:killietalk-bounces at aka_org]
>>>>>On
>>>>>Behalf Of Mike Jacobs
>>>>>Sent: Friday, October 14, 2005 10:17 PM
>>>>>To: killietalk at aka_org
>>>>>Subject: [Killietalk] JAKA copy
>>>>>
>>>>>"Many people don't realize what a time consuming job this is if it is
>>>>>to
>>>>>be
>>>>>done properly.
>>>>>
>>>>> Most scanners come with software for optical character
>>>>>recognition,
>>>>>but
>>>>>mistakes always creep in because of dirt or printing imperfections.
>>>>>To
>>>>>proof one page and render it error free takes a detail oriented
>>>>>person.
>>>>>When your done you get a text version of the article but it is not a
>>>>>facsimile. To get something identical to the original you need to
>>>>>match
>>>>>types, margins and all the other print settings.
>>>>>
>>>>> Then you have to scan the pictures and insert them. To get a
>>>>>nice
>>>>>job,
>>>>>you have to calibrate your scanner as well as you monitor. Then you
>>>>>insert
>>>>>the pictures between the text columns.
>>>>>
>>>>> I have done this with several of my bosses' old journal articles.
>>>>>
>>>>>The pages were a little larger than JAKA pages, but not 8.5" X 11".
>>>>>
>>>>>Once I became good at it, it took about a hour to produce one digital
>>>>>page
>>>>>that looked like the original with searchable text.
>>>>>
>>>>> The alternative is just to scan the page like a picture and then
>>>>>make a
>>>>>pdf from the image. This can be done quickly, but is of much poorer
>>>>>quality. Often times the print from the back page shows through and
>>>>>the
>>>>>file size necessary to get a nice monitor viewable image, is larger
>>>>>than
>>>>>a
>>>>>standard pdf. Also the finished product looks like the original, but
>>>>>can
>>>>>not be treated as text. It is really only a picture of the original.
>>>>>When
>>>>>things are going well, this will take about ten to fifteen minutes per
>>>>>page.
>>>>>The end result could be put on DVD's instead of CD's.
>>>>>
>>>>> I think this is a much needed project, but everyone, especially
>>>>>volunteers, should realize that it is a very time intensive project.
>>>>>
>>>>>Before tackling it, make sure you have good computer with plenty of
>>>>>memory
>>>>>and hard drive space, as well as a high resolution scanner.
>>>>>
>>>>>Software made specifically for this process and not the standard OEM
>>>>>software that comes with the scanner, will also be needed.
>>>>>
>>>>> A large club with a computer guru and several less technical but
>>>>>still
>>>>>computer literate volunteers might be more successful, but there would
>>>>>have
>>>>>to be someone overseeing the project to make sure each person is
>>>>>producing
>>>>>the same quality of work.
>>>>>
>>>>> I do hope this gets done sometime, and will congratulate the
>>>>>person
>>>>>who
>>>>>completes it.
>>>>>
>>>>>
>>>>>
>>>>>Bill Martin"
>>>>>
>>>>>
>>>>>
>>>>>
>>>>Join the AKA at http://aka.org/modules/tinycontent0/index.php?id=9
>>>>Archives are at http://fins.actwin.com/killietalk/
>>>>Modify your subscription at
>>>>http://www.actwin.com/mailman/listinfo.cgi/killietalk
>>>>
>>>>
>>>
>>>
>>Join the AKA at http://aka.org/modules/tinycontent0/index.php?id=9
>>Archives are at http://fins.actwin.com/killietalk/
>>Modify your subscription at
>>http://www.actwin.com/mailman/listinfo.cgi/killietalk
>>
>>
>>
>>
>
>Join the AKA at http://aka.org/modules/tinycontent0/index.php?id=9
>Archives are at http://fins.actwin.com/killietalk/
>Modify your subscription at http://www.actwin.com/mailman/listinfo.cgi/killietalk
>
>
>
>
Join the AKA at http://aka.org/modules/tinycontent0/index.php?id=9
Archives are at http://fins.actwin.com/killietalk/
Modify your subscription at http://www.actwin.com/mailman/listinfo.cgi/killietalk
Join the AKA at http://aka.org/modules/tinycontent0/index.php?id=9
Archives are at http://fins.actwin.com/killietalk/
Modify your subscription at http://www.actwin.com/mailman/listinfo.cgi/killietalk