» article_clean » tesseract-ocr-en

tesseract-ocr-en  

17. April 2010

These pages are dedicated to my tests of tesseract-ocr 3.00 and related software. All tests were done on Mandrivalinux 64bit primary.

In past I tried to test/train tesseract (2.04) for Slovak lan­guage. Version 3.00 brought support for a lot of lan­guages including Slovak. Its ocr result differs based on input scans. Unfortunately training data are not available, so I can not improve it. Training process is still not described in every details.

For this reason I started to find a way how to create lan­guage file with my data. I expect that reader is familiar with ReadMe, FAQ and Training process.

articles

download

gui for tesseract:



RSS