Hi Everyone,
I’m currently working on an invoice automation and reconciliation solution using Power Automate + AI Builder/OCR, where invoices are processed and converted into structured JSON for downstream SAP reconciliation.
We are facing major challenges because invoices come from more than 1000+ vendors, and every vendor has a different invoice layout, table structure, font style, alignment, tax section, and page format.
Our current production accuracy is around 60%, which is creating significant manual validation effort.
Some common issues we are facing:
- Incorrect extraction of invoice number, GSTIN, dates, and totals
- Line item tables breaking across pages
- Multi-line descriptions shifting columns
- Different tax structures (CGST/SGST/IGST/TDS)
- Low-quality scans and image-based PDFs
- OCR reading incorrect numeric values
- Header/footer duplication on multi-page invoices
One critical issue:
In some invoices, the OCR/model reads quantities incorrectly. For example, it gets confused between values like 5 and 6, along with several similar numeric recognition issues.
This causes reconciliation failures and incorrect invoice posting.
Currently, we are mainly using:
- AI Builder “Get text from document”
- Power Automate parsing
- JSON mapping logic
However, since this is primarily OCR-based extraction and not vendor-trained document understanding, accuracy varies heavily depending on invoice quality and layout.
I would appreciate guidance or suggestions on:
- Best approach for handling 1000+ vendor invoice formats
- Improving OCR accuracy in production
- Preprocessing techniques before AI Builder
- Better approaches for table extraction
- Whether Azure Document Intelligence/Form Recognizer performs better than standard AI Builder OCR
- Hybrid approaches using OCR + LLM + rule engine
- Confidence-score-based validation strategies
- Recommended architecture for scalable invoice-to-JSON extraction
Additionally, has anyone worked with for enterprise invoice OCR scenarios?
- Is ABBYY suitable for improving OCR reading accuracy in large-scale invoice processing?
- How does ABBYY compare with Azure Document Intelligence/Form Recognizer for numeric field accuracy, table extraction, and low-quality scanned invoices?
- Has anyone achieved better production accuracy using ABBYY in hybrid architectures with Power Platform or SAP integrations?
Also, if anyone has articles, documentation, benchmarks, or production learnings related to AI Builder extraction accuracy and large-scale invoice processing, that would be extremely helpful.
Would really appreciate recommendations and insights from teams handling enterprise-scale invoice processing systems.
Thanks in advance.