web
You’re offline. This is a read only version of the page.
close
Skip to main content

Announcements

News and Announcements icon
Community site session details

Community site session details

Session Id :
Power Platform Community / Forums / Power Automate / Extracting Data from PDF
Power Automate
Answered

Extracting Data from PDF

(0) ShareShare
ReportReport
Posted on by 5,331 Moderator
Practicing flows and trying to extract data from a PDF document of property information.

I am unable to find the PDF invoice column headings in the flow list of dynamic content.

PDF Invoice (The column headers are in bold) -



I've replicated the column headings that I want in my output in the flow below. My issue, none of the heading in the invoice appear in the flow choices of dynamic
content. How do I get the PDF column headers to appear in the list of dynamic content, or otherwise identify which content item to associate with my output
column heading?

Categories:
I have the same question (0)
  • Verified answer
    Robu1 Profile Picture
    1,621 Super User 2026 Season 1 on at
    Hi  ,
     
    Thank your for choosing Power Platform Community.
     
    This issue typically occurs because Power Automate doesn’t automatically recognize formatted elements in a PDF, such as bold column headers. Instead, it extracts raw text without structural data.
     
    Here’s how you can approach solving it:
     
    Possible Solutions:

    Use AI Builder for PDF Extraction
    AI Builder's Form Processing Model allows you to define key fields in a document, training it to recognize specific data points such as column headers.
    You can upload sample invoices to train the model, so it recognizes your headings consistently.
    Extract Text and Use Regular Expressions (RegEx)
    If using Extract text from PDF in Power Automate, parse the text output manually.
    Apply RegEx to identify headers based on expected formatting (e.g., headers that are always in uppercase or follow specific patterns).
    Convert the PDF to Another Format First
    If possible, convert the PDF to Excel or CSV via Power Automate or manual conversion.
    Extract column data directly from structured files rather than raw PDFs.
    Verify PDF Data Extraction Source
    Sometimes, headers may not be appearing because of document formatting issues (e.g., if the PDF is scanned, it might be treated as an image).
    Try opening the document in Adobe Acrobat or another editor to check whether the text can be copied and pasted.

    If any of these fixes the issue, please mark as resolved to help others with find it.
     
    Happy to help 
    Robu1
    SuperUser| Moderator
     

Under review

Thank you for your reply! To ensure a great experience for everyone, your content is awaiting approval by our Community Managers. Please check back later.

Helpful resources

Quick Links

Introducing the 2026 Season 1 community Super Users

Congratulations to our 2026 Super Users!

Kudos to our 2025 Community Spotlight Honorees

Congratulations to our 2025 community superstars!

Leaderboard > Power Automate

#1
Haque Profile Picture

Haque 523

#2
Valantis Profile Picture

Valantis 318

#3
David_MA Profile Picture

David_MA 235 Super User 2026 Season 1

Last 30 days Overall leaderboard