Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

For the threshold-based automation, Confidence thresholds are key. These confidence thresholds are set for each type of field. It can be quite cumbersome to identify the right confidence threshold for each field individually, as you may care more about the trade-off that you need to make on the document level between the error rate and the degree of automation.

 

...

Trade-off error-rate with automation

We allow you to intuitively trade-off the error-rate and the degree of automation through a dedicated chart in the automation pane.

This chart is built based on historical data and shows you what kind of trade-offs are possible with the performance of the Machine Learning pipeline and models of the time of the initial prediction. If needed, you can update the predictions from the past with the latest ML pipeline and models through the Sampling pane.You can the sampling pane.

This graph is interactive, meaning that when you click on any dot in the graph to select the trade-off that best suits your business need. You will notice that the automation bar chart and the confidence thresholds further down the automation pane are updated accordingly.Just click publish if you want these settings to go into effectof the blue dots, the thresholds below are updated to match the dot clicked.

 

...

Set individual confidence thresholds

You can also set individual confidence thresholds. This may be useful if a certain field is "nice-to-have" and you don't want it to adversely impact the automation degree even if the confidence level is low. Conversely, you may want to set the bar very high for certain fields if you really can't afford to make a mistake on that particular field.

You can immediately see what the confidence threshold would have meant on historical data (in terms of true/false positives/negatives).

These thresholds can be tweaked for both the fields and also line items. So it is possible to tweak thresholds preferences per specific line in case some lines are more or less important to extract correct values than the rest of the table.