web
You’re offline. This is a read only version of the page.
close
Skip to main content

Notifications

Announcements

Community site session details

Community site session details

Session Id :
Power Platform Community / Forums / Power Apps / AI Builder misses text...
Power Apps
Answered

AI Builder misses text after a colon in document processing

(1) ShareShare
ReportReport
Posted on by 22

Hi!

 

We're using a custom AI model to process PDF documents that our clients sent to a mailbox. Each PDF document has 7 pages with a lot of fields that need to be filled in. One of the fields is a filenumber with the following format XX:000000

So this can be for example TD:123456. Some clients fill in the form a bit different, so sometimes it's being filled in as TD: 123456 or TD 123456.

When we train our model with this values and test the model after successful training, it seems impossible for the model to identify the full value when this is being filled in as TD: 123456. The only part that's being extracted is TD:

The other possible values are being detected correctly. It looks like it's struggling with the colon (:).

The accuracy score for the field is 99% and in the quick test it returns a confidence score of 99% as well, despite it's not detected correctly.

 

What can we do with the AI model to get this value correct?

 

Thanks in advance.

Categories:
I have the same question (0)
  • Verified answer
    bleupen Profile Picture
    22 on at

    The issue is resolved, there was to less difference in the documents trained in the different collections. Therefore the wrong collection was used and our trained documents in the collection weren't even used. The collection being used can be found in the Power Automate Flow action. Thanks for your help!

  • bleupen Profile Picture
    22 on at

    Hi plarrue,

     

    Thanks for your repsonse.

     

    We've added more documents with this filenumber, but getting the same results. There are 8 documents in this collection now. There are some more fields where a colon is used, and the AI consistently stops extracting/recognizing all text after this character. 

     

    Thanks again!

  • plarrue Profile Picture
    Moderator on at

    Hi @bleupen 

     

    Thank you for sharing your use case. One way this can improve is by adding more documents for training to your model.

    Could you please try by retraining your model with more documents where the field is TD: 123456 and see if you have a better result?

     

     

    Thanks!

Under review

Thank you for your reply! To ensure a great experience for everyone, your content is awaiting approval by our Community Managers. Please check back later.

Helpful resources

Quick Links

Forum hierarchy changes are complete!

In our never-ending quest to improve we are simplifying the forum hierarchy…

Ajay Kumar Gannamaneni – Community Spotlight

We are honored to recognize Ajay Kumar Gannamaneni as our Community Spotlight for December…

Leaderboard > Power Apps

#1
Kalathiya Profile Picture

Kalathiya 427

#2
WarrenBelz Profile Picture

WarrenBelz 360 Most Valuable Professional

#3
MS.Ragavendar Profile Picture

MS.Ragavendar 336 Super User 2025 Season 2

Last 30 days Overall leaderboard