web
You’re offline. This is a read only version of the page.
close
Skip to main content

Notifications

Announcements

Community site session details

Community site session details

Session Id :
Power Platform Community / Forums / Power Automate / Extract PDF data and s...
Power Automate
Unanswered

Extract PDF data and save into sharepoint list

(0) ShareShare
ReportReport
Posted on by

Hello Friends,

I have one business case, where i have to get PDF attachment from outlook, then extract data from that PDF file and finally save extracted data into sharepoint list. AI Builder or third party tools are not allowed here. so anyone has any idea how to achieve this?  

Note: both cloud and desktop flow solutions are welcome

 

Thank you in advance!

Categories:
I have the same question (0)
  • Gopala_Krishna Profile Picture
    1,495 on at

    @Anonymous 

    You can make use of the PDF extraction actions available in desktop flow to extract data from the PDF document. There are also few other actions available such as OCR recognition which can also be leveraged to get the desired output.

     

    https://learn.microsoft.com/en-us/power-automate/desktop-flows/actions-reference/pdf

     

    If the information shared helps you please consider giving a thumbs up and mark solution as resolved

    Please follow my website PowerCards for more information related to Power Platform
  • takolota1 Profile Picture
    4,974 Moderator on at

    If the PDF data is in different formats or wordings across documents, then you may need something with AI Builder like this to extract any PDF data to a JSON object:

    https://powerusers.microsoft.com/t5/Power-Automate-Cookbook/Extract-Data-From-PDFs-and-Images-With-GPT/td-p/2201345

     

    Otherwise, if you have standardized PDF formats, you may be able to use the text conversion piece in that template and some more complicated regex.

  • takolota1 Profile Picture
    4,974 Moderator on at

    If the PDF data is in different formats or wordings across documents, then you may need something with AI Builder like this to extract any PDF data to a JSON object:

    https://powerusers.microsoft.com/t5/Power-Automate-Cookbook/Extract-Data-From-PDFs-and-Images-With-GPT/td-p/2201345

     

    Otherwise, if you have standardized PDF formats, you may be able to use the text conversion piece in that template and some more complicated regex.

  • jkrey Profile Picture
    26 on at

    I had a similar need to comb through multiple PDFs, extract specific data, and save the results to a Sharepoint list to use in a PowerApp. I ended up using Python's PyMuPDF (https://pypi.org/project/PyMuPDF/), writing Python code to extract the data fields to a .csv file. From there, data were imported to Excel and then exported to Sharepoint. NOTE: The PDF format is primarily for printing, so don't expect that blocks of text will be in a logical order. It can be painstaking to find out which block number on a page actually contains the data you want. Perhaps the newer PDF extraction to Excel tool in Power Automate would make this simpler.

Under review

Thank you for your reply! To ensure a great experience for everyone, your content is awaiting approval by our Community Managers. Please check back later.

Helpful resources

Quick Links

Forum hierarchy changes are complete!

In our never-ending quest to improve we are simplifying the forum hierarchy…

Ajay Kumar Gannamaneni – Community Spotlight

We are honored to recognize Ajay Kumar Gannamaneni as our Community Spotlight for December…

Leaderboard > Power Automate

#1
Michael E. Gernaey Profile Picture

Michael E. Gernaey 522 Super User 2025 Season 2

#2
Tomac Profile Picture

Tomac 364 Moderator

#3
abm abm Profile Picture

abm abm 243 Most Valuable Professional

Last 30 days Overall leaderboard