web
You’re offline. This is a read only version of the page.
close
Skip to main content

Notifications

Announcements

Community site session details

Community site session details

Session Id :
Power Platform Community / Forums / Power Automate / OCR action incorrectly...
Power Automate
Suggested Answer

OCR action incorrectly identifying text from PDF.

(0) ShareShare
ReportReport
Posted on by 2
Hey, 
 
I have been trying to create a flow in power automate to create extract some fields from my invoices dynamically, into excel. I am using the 'Recognize text in an image or document' action and input my invoice pdf via the 'Get file content' action. I have realized that some fields are not being extracted correctly such as in my invoice the number '156' is appearing as '951' in the output of the OCR. Another example includes the phrase 'GEL FRESH 700ML' being recognized as 'GEL FRESH JOUNL' as well the the number '38.50' being outputted as '18.50'. These are just some of the many incorrectly recognized texts I have provided. Anyone facing the same issue? Please provide a solution. 
 
Thanks. 
Categories:
I have the same question (0)
  • Suggested answer
    venturemavenwill Profile Picture
    1,189 Super User 2025 Season 2 on at
    Have you tried to use AI Prompt in the flow instead?
    This allows you to use the GPT4 model, which I find more reliable than standard OCR 
  • Michael E. Gernaey Profile Picture
    53,493 Super User 2025 Season 2 on at
     
    Are you training a model and using AI Builder to leverage your Model? There is an Invoice one OOB that you can try but I would train a model with some of your files so that you can see if you get better results.
     
  • CU05081759-2 Profile Picture
    2 on at
    I have used Invoice processing model previously, but the moment the position of any of my fields changes or in some cases where there are more of each, the identification fails. 
  • Suggested answer
    takolota1 Profile Picture
    4,974 Moderator on at
    You could try providing a GPT prompt both the raw file & the OCR text & see if that improves accuracy. But frankly I always tell my prompts to default to the OCR text values because I find them to be more accurate than whatever file reader they bolted on to GPT.
     
    Maybe in some cases your prompt is getting confused on which number goes to which label & returning the wrong number, in which case I find it helpful to provide the OCR text as a text replica that replicates a lot of the vertical & horizontal spacing of text in the file.

Under review

Thank you for your reply! To ensure a great experience for everyone, your content is awaiting approval by our Community Managers. Please check back later.

Helpful resources

Quick Links

Forum hierarchy changes are complete!

In our never-ending quest to improve we are simplifying the forum hierarchy…

Ajay Kumar Gannamaneni – Community Spotlight

We are honored to recognize Ajay Kumar Gannamaneni as our Community Spotlight for December…

Leaderboard > Power Automate

#1
Michael E. Gernaey Profile Picture

Michael E. Gernaey 519 Super User 2025 Season 2

#2
Tomac Profile Picture

Tomac 296 Moderator

#3
abm abm Profile Picture

abm abm 232 Most Valuable Professional

Last 30 days Overall leaderboard