Hello
I'm using a structured and have also used un-structured custom document processing model to capture table data from a pdf but as you can see below image it sometimes misses out the hyphen that i need in the amount column. Really can't see why it's doing this? There's no consistency to it..
If someone from MS can please advise as i need to get this over the line for a finance project.
Hello,
You are facing a known issue related to the OCR engine. Our team is working to improve that capability.
As a workaround, you could make the logic conditional or use GPT data extraction method.
This post should help you building that:
Custom model can no longer handle hyphens ( - ) in... - Power Platform Community (microsoft.com)
Hope that helps,
Samia
Hi Samia
Thanks for the quick response and suggestions provided. This has now been resolved bizarrely by playing around with the settings in print preview on the original Excel file. The OCR is now picking up the hyphen after quite a few tests. It's just really weird that it was picking up the hyphen some of the time before but as you say your team is looking into improving the algorithm.
Many Thanks
Rob
Under review
Thank you for your reply! To ensure a great experience for everyone, your content is awaiting approval by our Community Managers. Please check back later.
In our never-ending quest to improve we are simplifying the forum hierarchy…
We are honored to recognize Ajay Kumar Gannamaneni as our Community Spotlight for December…
These are the community rock stars!
Stay up to date on forum activity by subscribing.
WarrenBelz 757 Most Valuable Professional
Michael E. Gernaey 322 Super User 2025 Season 2
MS.Ragavendar 209 Super User 2025 Season 2