How Invofox Measures Document Extraction Accuracy

By clicking “Accept All Cookies”, you agree to the storing of cookies on your device to enhance site navigation, analyze site usage, and assist in our marketing efforts. View our Privacy Policy for more information.

Preferences Deny Accept

Privacy Preference Center

When you visit websites, they may store or retrieve data in your browser. This storage is often necessary for the basic functionality of the website. The storage may be used for marketing, analytics, and personalization of the site, such as storing your preferences. Privacy is important to us, so you have the option of disabling certain types of storage that may not be necessary for the basic functioning of the website. Blocking categories may impact your experience on the website.
When you visit or log in to our website, we and our partners may use cookies or similar tools to link your activity to other information they already have about you—like your email or home address. This information may then be used to send you marketing messages or other communications to those addresses. You may opt out of receiving this advertising by visiting https://app.retention.com/optout
You also have the option to object to the collection of your personal data in accordance with the General Data Protection Regulation. To exercise this right, please visit: https://www.rb2b.com/rb2b-gdpr-opt-out
You can find more information about how email-based retargeting and Retention.com work by visiting https://support.retention.com/en/articles/8826312-how-retention-com-attribution-works
-Residents of California: If you live in California, you have the right to tell companies not to sell your personal information. To do this, just send an email to support@retention.com. In your message, please say that you want to stop the sale of your personal information. You can also choose someone else to send this request for you. Make sure to include the email address of the person who wants to opt out. Any personal details you share in your email will only be used to handle your request. You can find the CCPA Opt-Out Form by visiting: https://app.retention.com/ccpa_details/
-Residents of Europe: Retention.com follows GDPR privacy rules carefully. To help with this, we use a tool in our scripts called geofencing. This tool works through your browser and helps in two important ways:
Location-Based Use: Our services are set up for users who have signed up on U.S.-based websites. We don’t use your real-time IP address to decide whether to collect or use your data. Instead, if you gave your permission on a U.S. website, we keep that data—even if you're later using the internet from another country.
GDPR Compliance: Because we limit our services to users from U.S. websites, we make sure our data practices follow GDPR rules. This is part of our promise to respect privacy laws around the world.

Reject all cookies Allow all cookies

Manage Consent Preferences by Category

Essential

Always Active

These items are required to enable basic website functionality.

Marketing

Essential

These items are used to deliver advertising that is more relevant to you and your interests. They may also be used to limit the number of times you see an advertisement and measure the effectiveness of advertising campaigns. Advertising networks usually place them with the website operator’s permission.

Personalization

Essential

These items allow the website to remember choices you make (such as your user name, language, or the region you are in) and provide enhanced, more personal features. For example, a website may provide you with local weather reports or traffic news by storing data about your current location.

Analytics

Essential

These items help the website operator understand how its website performs, how visitors interact with the site, and whether there may be technical issues. This storage type usually doesn’t collect information that identifies a visitor.

Confirm my preferences and close

Ground Truth: The Starting Point for Measuring Accuracy

Accurate benchmarking starts with an accurate baseline — what we call the ground truth.
It defines the correct data for every field in a document, allowing us to measure extraction accuracy objectively across your processing pipeline.
When a customer shares their labeled data, we use it as the standard reference. If not, Invofox helps define it so the comparisons are consistent and reproducible.

How Invofox Handles Complex Data in Accuracy Evaluation

Document data rarely looks identical, even when it’s correct. Our accuracy evaluation logic adapts to each data type to ensure comparisons remain fair and consistent:

Numbers

Compared within tolerance ranges

Dates

Standardized to avoid time-zone mismatches.

Booleans

Account for missing or unchecked states.

Arrays and Tables

Evaluated by content, not order (unless order is business-critical).

Texts/Strings

Compared at three levels:

Exact Match: For critical fields like totals, IDs, or contract numbers, values must be 100% identical.
Normalized Match: Formatting differences (case, spaces, punctuation) are cleaned before comparison to avoid false mismatches.
Similarity Match (Levenshtein Distance): For flexible fields like names or addresses, we calculate how similar two strings are on a scale from 0–1.

Numbers

Compared within tolerance ranges

Dates

Standardized to avoid time-zone mismatches.

Booleans

Account for missing or unchecked states.

Arrays and Tables

Evaluated by content, not order (unless order is business-critical).

Texts/Strings

Compared at three levels:

Exact Match: For critical fields like totals, IDs, or contract numbers, values must be 100% identical.
Normalized Match: Formatting differences (case, spaces, punctuation) are cleaned before comparison to avoid false mismatches.
Similarity Match (Levenshtein Distance): For flexible fields like names or addresses, we calculate how similar two strings are on a scale from 0–1.

Maintaining Benchmark Consistency When Your Schema Evolves

Adding or removing a field can make old benchmarks impossible to compare.
Invofox tracks schema versions and normalizes changes automatically, so your accuracy results remain valid over time.
When new keys are introduced, we flag affected documents to help you identify what’s changed and maintain clear visibility into your evolving data model.

Accuracy Evaluation Built on Transparency

We believe accuracy metrics should be verifiable, not subjective.
That’s why Invofox runs every eval in-house, using consistent parameters and transparent rules.
Each customer receives both summary metrics and the raw data used to calculate them — no black boxes, no hidden assumptions.
When clients share live feedback, Invofox applies the same evaluation criteria in real time, continuously computing our metrics and refining the models to improve accuracy.

Field

Client Accuracy

Client False Positives

Invofox Accuracy

Invofox False Positives

Document Number

89.4%

5.4%

99.3%

Tax Base Amount

87.9%

3.8%

98.8%

OrderRef

88.7%

6.2%

99.1%

How Invofox Measures Document Extraction Accuracy

Ground Truth: The Starting Point for Measuring Accuracy

How Invofox Handles Complex Data in Accuracy Evaluation

Numbers

Dates

Booleans

Arrays and Tables

Texts/Strings

Numbers

Dates

Booleans

Arrays and Tables

Texts/Strings

From Field Accuracy to Full-Document Reliability

Want to See How Your Vendor Compares?

Maintaining Benchmark Consistency When Your Schema Evolves

Accuracy Evaluation Built on Transparency

Frequently Asked Questions about Accuracy Evaluation

Ready to see how Invofox measures accuracy?