User login

Scanning a book: hardware and software problems

Tue, 04/13/2010 - 14:32 - augustin

As I am opening this site, one of the most urgent things I have to do is scan a whole book, pass it through an OCR software in order to publish this rare book on the internet.

It is thus that the first few issues in this site are all related to scanning and ORC:
#11: OCR with tesseract: garbage output
#12: tesseract language setting
#13: mass processing TIFF images: GIMP scripts

I managed to make tesseract work.

But now, I need to buy a scanner so that I can scan the whole book. And it will be handy to scan miscellaneous things, now and then.
#17: Which scanner for Ubuntu?

So, not only are the first few issues related to scanning and ORC, but so are the first few wiki pages created on this site:
http://linux.overshoot.tv/wiki/ocr_optical_character_recognition
http://linux.overshoot.tv/wiki/scanners

The book is in French and will be published there:
http://3enjeux.overshoot.tv/

augustin's blog
Login or register to post comments

User login

Tickets per project

Scanning a book: hardware and software problems

Who's online