Add a language dictionary for OCR

Open Bee™ Scan OCS uses Tesseract as its OCR engine.

By default, Tesseract’s dictionaries for English, French, German, Dutch, Italian, Spanish, Portuguese, Chinese,  Korean and Japanese languages are included during installation. 

It is possible to add others as follows: 

  • Download additional dictionaries from https://code.google.com/p/tesseract-ocr/downloads/list  
  • Unzip the contents in the “tesseracttessdata” folder of the Open Bee™ Scan OCS installation folder
  • Verify that the language has been added with the command 
    cd "C:Program Files (x86)OpenBeeOpen Bee Scan O.C.Stesseract" 
    tesseract.exe --list-langs 
     

Then add the language in the Open Bee™ Scan OCS  configuration: 

“conf/ocs.conf” file in the installation folder: 

dts.service.languages=["en","fr","chi","tha"]

a restart of the Open Bee™ Scan OCS service is required.