web
You’re offline. This is a read only version of the page.
close
Skip to main content

Announcements

News and Announcements icon
Community site session details

Community site session details

Session Id :
Power Platform Community / Forums / Power Automate / Data extraction || uns...
Power Automate
Suggested Answer

Data extraction || unstructured || Tables || Data not part of the table

(0) ShareShare
ReportReport
Posted on by 2
Hello everyone,

I’m aware of some limitations within the AI Builder, but I’m hoping someone here can help me work around a particular challenge I’m facing.

I’m working with multipage PDFs, some of which are over 200 pages long. Each page contains one of eight different types of documents (e.g., time worked data, payroll data, etc.). While the documents differ slightly, I can extract the necessary information as tables using the AI Builder. However, I’m encountering an issue with associating two critical pieces of data with these tables:

  1. Company Name: This appears at the top center of the page and varies from page to page. A single PDF may contain documents from multiple companies.
  2. Date ("For the pay period ending"): This appears once before the table on the relevant pages.

I understand that one of the AI Builder’s limitations is that it only allows tagging data once per table. This means I can't tag data for each row of the table or tag the company name and date fields multiple times across the pages.

I’m considering using a separate AI model to extract the Page/Company Name/Date information and then running it in parallel with the main AI model that extracts the table data. However, I’m not sure how to set this up effectively.

Has anyone encountered a similar challenge or have any suggestions on how to approach this? Any guidance on running two models in parallel or other workarounds would be greatly appreciated!

Thank you in advance for your help!

Categories:
I have the same question (0)
  • codeninja.sj Profile Picture
    115 on at
    If the data is not sensitive, can you upload a sample PDF file that was used to train the model?
  • Suggested answer
    CU19112040-0 Profile Picture
    on at
    You could pair your table-extraction model with a lightweight classifier to pull the page-level company and date data, then merge both outputs afterward; if you need help structuring that flow, you can always Contact Pearl Lemon for guidance. This setup usually works well for large PDFs with mixed document types.

Under review

Thank you for your reply! To ensure a great experience for everyone, your content is awaiting approval by our Community Managers. Please check back later.

Helpful resources

Quick Links

Introducing the 2026 Season 1 community Super Users

Congratulations to our 2026 Super Users!

Kudos to our 2025 Community Spotlight Honorees

Congratulations to our 2025 community superstars!

Leaderboard > Power Automate

#1
David_MA Profile Picture

David_MA 77 Super User 2026 Season 1

#2
Haque Profile Picture

Haque 68

#3
Expiscornovus Profile Picture

Expiscornovus 56 Most Valuable Professional

Last 30 days Overall leaderboard