web
You’re offline. This is a read only version of the page.
close
Skip to main content

Notifications

Announcements

Community site session details

Community site session details

Session Id :
Power Platform Community / Forums / Power Automate / Extraction of Repeatin...
Power Automate
Unanswered

Extraction of Repeating Fields in a pdf using AI Builder-Form Processing Model

(0) ShareShare
ReportReport
Posted on by 86

I have pdfs in which a single field is present more than once, since field value each time is different, I'm trying to the tag all the field values. Is it possible to tag more than one field value for a field in AI Builder ?
If No, what would be the alternate option ?

 

Categories:
I have the same question (0)
  • antoinec Profile Picture
    on at

    Hi Prakak, if what you are trying to extract is a list of different items with the same structure, you probably want to look into tables: https://docs.microsoft.com/en-us/ai-builder/create-form-processing-model#tag-tables. Any item with repeating structure can be extracted using a table even if it doesn't not look like a table. Do let us know if that works,

     

    Antoine

  • Prajakta05 Profile Picture
    86 on at

    Hello,
    I tried capturing repeating fields in table format using single page table. I wanted to extract table from 2 pages so, I added 2 tables named as Location schedule 1 & Location schedule 2. After training the model for 17 pdfs, it predicted both the tables on the same location.

    Prajakta05_0-1647251493195.pngPrajakta05_1-1647251504441.png

    Help me with this.

  • antoinec Profile Picture
    on at

    Which of the values are you looking to extract? Could you highlight them on a screenshot to help understand what you're looking to achieve? Thanks!

  • Prajakta05 Profile Picture
    86 on at

    Hello,

    I'm trying to extract set of fields called 'Location Schedule' from page 2 & 3. I've tried extracting these sets as table using advanced tagging, but it predicts incorrectly.

    I've attached screenshot for your ready reference.

    Thanks in advance !

     

    Table 1 from page 2

    Prajakta05_0-1648622065652.png

     

    Table 2 from page 3

     

    Prajakta05_1-1648622144964.png

     

  • JoeF-MSFT Profile Picture
    on at

    Hi @Prajakta05 - thanks for the additional info!

     

    What I would recommend, is to use the following flow template that will process the document page by page: https://docs.microsoft.com/en-us/ai-builder/form-processing-multipage#use-a-cloud-flow-to-process-all-pages-in-the-document You will see in this page the link to the template as well as the instructions on how to use it.

     

    By processing page by page, the model will be able to correctly extract the table in each page and at the end of the flow execution you will have captured all tables. 

     

    Keep us posted if this approach works for you or not. 

  • somboNR Profile Picture
    2 on at

    Edit:

    Didn't realize we can specify pages for AI Builder to scan.

    Page Range

     

    We can get how many pages in the PDF from this link.

    PDF Pages in Power Automate 

     

    We can loop the document processor according to the number of pages later

     

    Hope this helps others looking for this.

     

    Hi @JoeF-MSFT ,

     

    How do you propose we do this page by page? I couldn't find a free way to split the pdf in the flow.

     

    Thanks

  • takolota1 Profile Picture
    4,974 Moderator on at

    You could also try this template using AI Builder to extract any PDF data to a JSON object:

    https://powerusers.microsoft.com/t5/Power-Automate-Cookbook/Extract-Data-From-PDFs-and-Images-With-GPT/td-p/2201345

Under review

Thank you for your reply! To ensure a great experience for everyone, your content is awaiting approval by our Community Managers. Please check back later.

Helpful resources

Quick Links

Forum hierarchy changes are complete!

In our never-ending quest to improve we are simplifying the forum hierarchy…

Ajay Kumar Gannamaneni – Community Spotlight

We are honored to recognize Ajay Kumar Gannamaneni as our Community Spotlight for December…

Leaderboard > Power Automate

#1
Michael E. Gernaey Profile Picture

Michael E. Gernaey 519 Super User 2025 Season 2

#2
Tomac Profile Picture

Tomac 296 Moderator

#3
abm abm Profile Picture

abm abm 232 Most Valuable Professional

Last 30 days Overall leaderboard