tesseract language setting

Project:Linux software
Component:Miscellaneous
Category:support request
Priority:normal
Assigned:Unassigned
Status:closed
Related pages:#10: OCR - optical character recognition
Description

I did a first test scan of a page of a French book.
The result was about 95% correct, the mistakes being with the same accented characters.

I read somewhere that tesseract could be trained, but I have not found out how, yet....

Comments

#1

wiki

#2

Maybe there are language settings or character sets to take into consideration...

#3

Title:How to train tesseract» tesseract language setting

I was simply missing the -l fra language setting.

#4

Status:active» fixed

#5

Status:fixed» closed
Related pages:-10: OCR - optical character recognition

Automatically closed -- issue fixed for 2 weeks with no activity.