web
You’re offline. This is a read only version of the page.
close
Skip to main content

Announcements

News and Announcements icon
Community site session details

Community site session details

Session Id :
Power Platform Community / Forums / Power Automate / Document Processing AI...
Power Automate
Answered

Document Processing AI - Multiple Similar Pages in on PDF

(0) ShareShare
ReportReport
Posted on by 10

Hi There,

 

I am trying to automate a flow that extracts the same information from each page of a multi-page document. It's a package of engineering drawings. Each page has exactly the same layout where the key pieces of information (drawing number, document revision, job number, etc.) are in the same location. However, the number of pages in each document may differ. Ideally I would like to add a line to a table from each page document (I am starting with excel, but am buffing up knowledge for SQL/other databases depending on what my company funds) table will look something like this:

 

DATA SOURCE -->FROM DOCUMENT COLLECTION FROM FILE NAME OR SIMILARFROM PROCESSING AIFROM PROCESSING AI FROM PROCESSING AI FROM PROCESSING AI FROM PROCESSING AI 
COLUMN TITLE -->Document TypeDWG PackageWO NumberCRNVendor DWG NumberDWG RevisionCustomer Approval
ROW 1Structural DrawingPackage 1xxxxxxDWG-STR-xxxxxxD-VEN-CUST22-xxx0[initials]
ROW 2Structural DrawingPackage 1xxxxxxDWG-STR-xxxxxxD-VEN-CUST22-xxx1[initials]
ROW 3Piping IsometricPackage 2xxxxxxDWG-PIP-xxxxxxD-VEN-CUST22-xxx1[initials]
ROW 4Structural DrawingPackage 3xxxxxxDWG-STR-xxxxxxD-VEN-CUST22-xxx0[initials]
ROW 5Piping IsometricPackage 4xxxxxxDWG-PIP-xxxxxxD-VEN-CUST22-xxx0[initials]

 

The red coloured headings are that fields I can reliably extract from the first page of each package with the document processing AI, these are the most important pieces of information for me to extract. The green 'customer approval' column would be useful, currently I have this information being extracted as a table along with the yellow information below, once again only from the first page at the moment. Yellow information is not necessary but as it's in a table it seems easy to gather it as a table. Below is an example from a drawing, black items do not need to be captured and have been redacted. 

 

Example Drawing Block 1.png

 

Basically what I want the AI to do is just repeat the same field processing on each page of a PDF adding a line for each page.

Categories:
I have the same question (0)
  • Verified answer
    JoeF-MSFT Profile Picture
    Microsoft Employee on at

    Hi @StruggleTownAUS - thanks for sharing your use case. It looks like automating it can save you a lot of time. 🙂

     

    When training the document processing model in AI Builder, did you tag all tables on the document as shown here? https://learn.microsoft.com/en-us/ai-builder/create-form-processing-model#multipage-tables

     

    Another thing you can try is selecting 'Unstructured documents' on the first step of the training process: https://learn.microsoft.com/en-us/ai-builder/create-form-processing-model#select-the-type-of-document This uses a newer AI technology behind the scenes that performs better with multipage tables. Despite it's name, it also works great on structured documents. 🙂

Under review

Thank you for your reply! To ensure a great experience for everyone, your content is awaiting approval by our Community Managers. Please check back later.

Helpful resources

Quick Links

Introducing the 2026 Season 1 community Super Users

Congratulations to our 2026 Super Users!

Kudos to our 2025 Community Spotlight Honorees

Congratulations to our 2025 community superstars!

Congratulations to the March Top 10 Community Leaders!

These are the community rock stars!

Leaderboard > Power Automate

#1
Haque Profile Picture

Haque 605

#2
Valantis Profile Picture

Valantis 340

#3
11manish Profile Picture

11manish 284

Last 30 days Overall leaderboard