{"id":20398,"date":"2024-02-16T15:59:58","date_gmt":"2024-02-16T14:59:58","guid":{"rendered":"https:\/\/help.openbee.com\/open-bee-portal\/knowledge-base\/how-to-guides\/add-a-language-dictionary-for-ocr\/"},"modified":"2024-02-16T15:59:58","modified_gmt":"2024-02-16T14:59:58","slug":"add-a-language-dictionary-for-ocr","status":"publish","type":"page","link":"https:\/\/help.openbee.com\/en\/open-bee-portal\/knowledge-base\/how-to-guides\/add-a-language-dictionary-for-ocr\/","title":{"rendered":"Add a language dictionary for OCR"},"content":{"rendered":"<div id=\"main-content\" class=\"wiki-content group\">\n<p>Open Bee\u2122 Scan OCS&nbsp;uses Tesseract as its OCR engine.<\/p>\n<p>By default, Tesseract&#8217;s dictionaries for English, French, German, Dutch, Italian, Spanish, Portuguese, <span>Chinese,&nbsp;<\/span> Korean and Japanese languages are included during installation.&nbsp;<\/p>\n<p>It is possible to add others as follows:&nbsp;<\/p>\n<ul style=\"list-style-type: square;\">\n<li>Download additional dictionaries from&nbsp;<a href=\"https:\/\/code.google.com\/p\/tesseract-ocr\/downloads\/list\" class=\"external-link\" rel=\"nofollow\">https:\/\/code.google.com\/p\/tesseract-ocr\/downloads\/list<\/a>&nbsp;&nbsp;<\/li>\n<li>Unzip the contents in the &#8220;tesseracttessdata&#8221; folder of the Open Bee\u2122 Scan OCS installation folder<\/li>\n<li>Verify that the language has been added with the command&nbsp;\n<div class=\"code panel pdl\" style=\"border-width: 1px;\">\n<div class=\"codeContent panelContent pdl\">\n<pre class=\"theme: Confluence; brush: java; gutter: false\" style=\"font-size:12px;\">cd \"C:Program Files (x86)OpenBeeOpen Bee Scan O.C.Stesseract\" \ntesseract.exe --list-langs \n&nbsp;<\/pre>\n<\/div>\n<\/div>\n<\/li>\n<\/ul>\n<p>Then add the language in the Open Bee\u2122 Scan OCS&nbsp; configuration:&nbsp;<\/p>\n<p>&#8220;conf\/ocs.conf&#8221; file in the installation folder:&nbsp;<\/p>\n<div class=\"code panel pdl\" style=\"border-width: 1px;\">\n<div class=\"codeContent panelContent pdl\">\n<pre class=\"theme: Confluence; brush: java; gutter: false\" style=\"font-size:12px;\">dts.service.languages=[\"en\",\"fr\",\"chi\",\"tha\"]<\/pre>\n<\/div>\n<\/div>\n<p>a restart of the Open Bee\u2122 Scan OCS&nbsp;service is required.&nbsp;<\/p>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Open Bee\u2122 Scan OCS&nbsp;uses Tesseract as its OCR engine. By default, Tesseract&#8217;s dictionaries for English, French, German, Dutch, Italian, Spanish, Portuguese, Chinese,&nbsp; Korean and Japanese languages are included during installation.&nbsp; It is possible to add others as follows:&nbsp; Download additional dictionaries from&nbsp;https:\/\/code.google.com\/p\/tesseract-ocr\/downloads\/list&nbsp;&nbsp; Unzip the contents in the &#8220;tesseracttessdata&#8221; folder of the Open Bee\u2122 Scan OCS [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"parent":20163,"menu_order":4,"comment_status":"closed","ping_status":"closed","template":"templates\/ob-help-products.php","meta":{"footnotes":""},"class_list":["post-20398","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/help.openbee.com\/en\/wp-json\/wp\/v2\/pages\/20398","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/help.openbee.com\/en\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/help.openbee.com\/en\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/help.openbee.com\/en\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/help.openbee.com\/en\/wp-json\/wp\/v2\/comments?post=20398"}],"version-history":[{"count":0,"href":"https:\/\/help.openbee.com\/en\/wp-json\/wp\/v2\/pages\/20398\/revisions"}],"up":[{"embeddable":true,"href":"https:\/\/help.openbee.com\/en\/wp-json\/wp\/v2\/pages\/20163"}],"wp:attachment":[{"href":"https:\/\/help.openbee.com\/en\/wp-json\/wp\/v2\/media?parent=20398"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}