We’re setting up a flow where we receive a high volume of invoice PDFs (different suppliers, some scanned, some multi-page) and need to push structured data into Power Automate / Power Apps.
The fields we need are typical invoice fields:
invoice number
vendor name
invoice date
tax / VAT
totals
line items
We’re currently evaluating two approaches:
Using AI Builder directly inside Power Automate
Preprocessing invoices with an external invoice data extractor (for example, tools like DigiParser or similar) and then sending structured JSON into Power Automate
Has anyone here tried either approach at scale?
Specifically curious about:
reliability with different invoice layouts
handling multi-page invoices
cost and performance trade-offs
whether external preprocessing simplifies flows long-term
Would appreciate hearing real-world experiences or recommendations.
Thanks!

Report
All responses (
Answers (