{"id":24168,"date":"2024-09-26T16:22:53","date_gmt":"2024-09-26T14:22:53","guid":{"rendered":"https:\/\/help.openbee.com\/open-bee-scan-capture\/knowledge-base\/how-to-guides\/why-is-there-no-longer-auto-regex-generation-for-decimal-numeric-or-date-indexes\/"},"modified":"2026-05-13T09:39:39","modified_gmt":"2026-05-13T07:39:39","slug":"why-is-there-no-longer-auto-regex-generation-for-decimal-numeric-or-date-indexes","status":"publish","type":"page","link":"https:\/\/help.openbee.com\/en\/open-bee-scan-capture\/knowledge-base\/how-to-guides\/why-is-there-no-longer-auto-regex-generation-for-decimal-numeric-or-date-indexes\/","title":{"rendered":"Why is there no longer auto-regex generation for decimal, numeric, or date indexes?"},"content":{"rendered":"\n<h2 class=\"wp-block-heading\">General<\/h2>\n\n<p>In order to optimize this functionality, we have restricted the use of regex auto-generation for Date, Decimal, and Numeric indexes.<\/p>\n\n<p>For these indexes, the basic regexes present within the software allow you to detect most of the desired values. If the value is not detected, research efforts must be focused on: the area, the discrepancies between the amounts, the rules, etc. <\/p>\n\n<h2 class=\"wp-block-heading\">Decimal and numerical indexes<\/h2>\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p>For example, decimal and numeric values change lengths very often.<br\/>If we take the case of an invoice amount, this value can take different forms: 1.42 or 35.48 or 3652.45 &#8230; All of these values can be found natively by Capture, so there&#8217;s no need to use auto-generation. <br\/>However, when the values were not found due to bad customizations or other manipulations, the end user tended to use auto-generation thinking to correct the problem. Not only did this not correct the first problem, but it had the effect of aggravating it since after auto-generation, Capture obtained a very strict regex to look for as a favorite. Ex auto-generation on the value<strong> 1.45<\/strong> led Capture to only search for values corresponding to <strong>a digit, followed by a comma, followed by a digit, followed by a digit<\/strong>.<br\/>The software could therefore no longer find an amount like 125.56.  <\/p>\n<\/blockquote>\n\n<h3 class=\"wp-block-heading\">Example scenario<\/h3>\n\n<p>Let&#8217;s take the case of this invoice, simple customization areas allow you to find the values<\/p>\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"984\" height=\"217\" src=\"https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-2.png\" alt=\"\" class=\"wp-image-23192\" srcset=\"https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-2.png 984w, https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-2-300x66.png 300w, https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-2-768x169.png 768w\" sizes=\"auto, (max-width: 984px) 100vw, 984px\" \/><\/figure>\n\n<p><br\/><strong>Consider the following customer scenario:<\/strong> <\/p>\n\n<p>An invoice from the same supplier arrives with an OCR error<\/p>\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"966\" height=\"166\" src=\"https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-3.png\" alt=\"\" class=\"wp-image-23195\" srcset=\"https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-3.png 966w, https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-3-300x52.png 300w, https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-3-768x132.png 768w\" sizes=\"auto, (max-width: 966px) 100vw, 966px\" \/><\/figure>\n\n<p>As usual, the customer uses the regex auto-generation since the amounts are not found<\/p>\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"782\" height=\"200\" src=\"https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-5.png\" alt=\"\" class=\"wp-image-23204\" srcset=\"https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-5.png 782w, https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-5-300x77.png 300w, https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-5-768x196.png 768w\" sizes=\"auto, (max-width: 782px) 100vw, 782px\" \/><\/figure>\n\n<p><br\/>He therefore finds himself with a new regex as the favorite in his list of regexes<\/p>\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"649\" height=\"62\" src=\"https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-4.png\" alt=\"\" class=\"wp-image-23201\" srcset=\"https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-4.png 649w, https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-4-300x29.png 300w\" sizes=\"auto, (max-width: 649px) 100vw, 649px\" \/><\/figure>\n\n<p>This new regex:  <em>((^|\\s|(?&lt;=\\:)|(?&lt;=\\;)|(?&lt;=\\,)|(?&lt;=\\\u00b0)|(?&lt;=\\.)|(?&lt;=\\())[0-9][0-9],[0-9][a-zA-Z])(\\s|$|(? =\\.)|(?=\\,)|(?=\\;)|(?=\\))) <\/em><br\/>Now only accepts values of type <strong>1 digit 1 digit decimal point one digit one letter<\/strong><\/p>\n\n<p>If we look at the invoice after generation, it didn&#8217;t solve the problem of amount, the 26 is still not found<\/p>\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"828\" height=\"213\" src=\"https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-8.png\" alt=\"\" class=\"wp-image-23215\" srcset=\"https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-8.png 828w, https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-8-300x77.png 300w, https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-8-768x198.png 768w\" sizes=\"auto, (max-width: 828px) 100vw, 828px\" \/><\/figure>\n\n<p>But worse still, it has worsened the situation, if we look at our invoice which was correctly recognized before this manipulation, it is no longer because of this new regex<\/p>\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"674\" height=\"159\" src=\"https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-7.png\" alt=\"\" class=\"wp-image-23211\" srcset=\"https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-7.png 674w, https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-7-300x71.png 300w\" sizes=\"auto, (max-width: 674px) 100vw, 674px\" \/><\/figure>\n\n<h3 class=\"wp-block-heading\">Case study (Customer)<\/h3>\n\n<p>Here is a concrete case found in a client, which illustrates what is very often found in clients complaining about recognition that no longer works<\/p>\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"580\" src=\"https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-9-1024x580.png\" alt=\"\" class=\"wp-image-23218\" srcset=\"https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-9-1024x580.png 1024w, https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-9-300x170.png 300w, https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-9-768x435.png 768w, https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-9.png 1110w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"476\" src=\"https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-10-1024x476.png\" alt=\"\" class=\"wp-image-23221\" srcset=\"https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-10-1024x476.png 1024w, https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-10-300x139.png 300w, https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-10-768x357.png 768w, https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-10.png 1104w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"336\" src=\"https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-11-1024x336.png\" alt=\"\" class=\"wp-image-23224\" srcset=\"https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-11-1024x336.png 1024w, https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-11-300x98.png 300w, https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-11-768x252.png 768w, https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-11.png 1112w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n<p>The engine is throttled on all sides with auto-generated regexes, so there is no longer any chance of being able to find the amounts of this document<\/p>\n\n<h2 class=\"wp-block-heading\">Index dates<\/h2>\n\n<p>In the case of date-type indexes, especially for dates with months written in letters, we ran into the same danger. Indeed, if we take for example the date <strong>March 1, 2020<\/strong>, auto-generation on this value resulted in the creation of a regex <strong>A number, a space, a letter, a letter, a letter, a letter, a space, a number, a number, a number, a number.<\/strong><br\/>The software could therefore no longer find the value on November 25, 2020. <br\/><br\/>Another problem in the case of date indexes is that, in addition to the regex, it is necessary to tell Capture the <strong>format of the date, e.g. dd MMMM yyyyy,<\/strong> so that it can associate the value found with a Date object. In the majority of cases, the format was not populated by users due to lack of knowledge, which had the effect of blocking the search for Capture with a self-generated but unusable favorite regex. <\/p>\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"764\" height=\"167\" src=\"https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-12.png\" alt=\"\" class=\"wp-image-23227\" srcset=\"https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-12.png 764w, https:\/\/help.openbee.com\/wp-content\/uploads\/2024\/09\/image-12-300x66.png 300w\" sizes=\"auto, (max-width: 764px) 100vw, 764px\" \/><\/figure>\n","protected":false},"excerpt":{"rendered":"<p>General In order to optimize this functionality, we have restricted the use of regex auto-generation for Date, Decimal, and Numeric indexes. For these indexes, the basic regexes present within the software allow you to detect most of the desired values. If the value is not detected, research efforts must be focused on: the area, the [&hellip;]<\/p>\n","protected":false},"author":97,"featured_media":0,"parent":21191,"menu_order":13,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-24168","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/help.openbee.com\/en\/wp-json\/wp\/v2\/pages\/24168","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/help.openbee.com\/en\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/help.openbee.com\/en\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/help.openbee.com\/en\/wp-json\/wp\/v2\/users\/97"}],"replies":[{"embeddable":true,"href":"https:\/\/help.openbee.com\/en\/wp-json\/wp\/v2\/comments?post=24168"}],"version-history":[{"count":1,"href":"https:\/\/help.openbee.com\/en\/wp-json\/wp\/v2\/pages\/24168\/revisions"}],"predecessor-version":[{"id":24169,"href":"https:\/\/help.openbee.com\/en\/wp-json\/wp\/v2\/pages\/24168\/revisions\/24169"}],"up":[{"embeddable":true,"href":"https:\/\/help.openbee.com\/en\/wp-json\/wp\/v2\/pages\/21191"}],"wp:attachment":[{"href":"https:\/\/help.openbee.com\/en\/wp-json\/wp\/v2\/media?parent=24168"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}