Skip to main content

Notifications

Community site session details

Community site session details

Session Id : dsPYZYZXz/E4Blq02poEuR
Power Automate - Power Automate Desktop
Unanswered

Extract Pdf Specific Data To Excel In power automate desktop

Like (1) ShareShare
ReportReport
Posted on 23 Jan 2021 14:57:27 by

Extract Pdf Specific Data To Excel In power automate desktop
consider i have 100 pdfs i need to extract invoice number, bill address ,total amount for each pdf to excel sheet row by row.

please help me ASAP

  • Diaaeldin78 Profile Picture
    2 on 21 Oct 2023 at 15:59:45
    Re: Extract Pdf Specific Data To Excel In power automate desktop
  • takolota1 Profile Picture
    4,859 Super User 2025 Season 1 on 15 Jun 2023 at 23:21:49
    Re: Extract Pdf Specific Data To Excel In power automate desktop

    If anyone wants to extract data from a PDF or image without training a model for select documents, try this new GPT data extraction method: https://powerusers.microsoft.com/t5/Power-Automate-Cookbook/Extract-Data-From-PDFs-and-Images-With-GPT/td-p/2201345

     

    It doesn’t require specifying certain document areas, wordings, styles, etc. It just OCRs the file, converts it to a replica text (txt), and passes it to a GPT prompt where you can ask GPT to do whatever you want with the document data.

  • wintechchen Profile Picture
    12 on 13 Jan 2023 at 20:37:41
    Re: Extract Pdf Specific Data To Excel In power automate desktop

    Hi Shannonmatesic,

    Not from Power Automate Desktop which is hard to use comparing to Power Automate online solution. I would suggest to use Power Automate online solution as it is much easier and has more connectors that you can do. Plus with AI Builder Form Processing model, I am able to extract data from PDF invoices to Excel or to other output easily. 

  • momlo Profile Picture
    1,527 Super User 2024 Season 1 on 13 Jan 2023 at 20:15:09
    Re: Extract Pdf Specific Data To Excel In power automate desktop

    Well, first of all, asking people on community on internet forum to answer something ASAP is not best way to ask question on internet forum 🙂

     

    Secondly, there are other posts discussing this topic in good level of details. Did you search those, what did you try so far? Can you share your code and point where it fails?

     

    Thirdly you have missed valuable information in your question, so it is just guessing what you can or cannot do.

    • Are those pdfs generated from some kind of system, with text layer
    • Are those pdfs scanned papers without text layer
    • Is that mix both?
    • Are those all in the same format? From the same vendor/system?
    • Are those from may different vendors/systems?

    When it comes to reading text from PDF with text layers - you have dedicated actions in PAD for doing that.

    If invoices you want to read are standardised, same format - it is super easy - just use rad pdf text, or pdf tables actions and that is that, no need for AI Builder here.

    If invoices are not the same but still limited number of layers - still no need to use AIB, just read text from PDFs - just create dedicated routines.

    If those are scanned images without text layer - you could use built in OCR, or use AIB wich is extra paid

     

    I would say please answer ASAP, but that would not be nice, so I will not go for that 🙂

  • shannonmatesic Profile Picture
    2 on 13 Jan 2023 at 18:35:25
    Re: Extract Pdf Specific Data To Excel In power automate desktop

    I have the same problem with financial invoices created in a horizontal direction. Did you find a solution?

  • wintechchen Profile Picture
    12 on 02 May 2021 at 22:12:05
    Re: Extract Pdf Specific Data To Excel In power automate desktop

    Hi,

    I am able to use AI Builder to create a Form processing model to extract PDF invoice data and then to Excel. My challenge is that financial invoice is created in a horizontal direction, not like purchase order invoice in vertical direction. Also the invoice often might not have all items on the list. That makes hard to know which column name is at position 1.  Any idea to handle that situation?

  • JoeF-MSFT Profile Picture
    on 24 Jan 2021 at 16:40:20
    Re: Extract Pdf Specific Data To Excel In power automate desktop

    Hi @Anonymous,

     

    Building on the answer from @miketran13, the following Microsoft Learn module will teach you how you can use AI Builder to extract data from invoices and store the extracted results in an Excel file: Get started with invoice processing in AI Builder - Learn | Microsoft Docs

  • Community Power Platform Member Profile Picture
    on 24 Jan 2021 at 15:24:30
    Re: Extract Pdf Specific Data To Excel In power automate desktop

    Thank you for  your information.

    Great answer. But if you consider other RPA tools like UI path, i observed this task is very simple.

    My expectation compare to other tools Microsoft should provide simplified basic solution for many scenarios.

     

     

  • miketran13 Profile Picture
    720 on 24 Jan 2021 at 06:19:09
    Re: Extract Pdf Specific Data To Excel In power automate desktop

    Hi 

     

    In your cases, if you just want to extract data from PDF with a specific metadata likes invoice number, bill address,... and store it into a file, then you just need to create a Cloud Flow that includes AI Builder form action. So, you can extract the metadata you need and store it somewhere on the cloud. 

     

    If you need to extract data from PDF then you have to perform other actions on the window likes update SAP, Salesforce, or other applications then you should use Power Automate Desktop. But, to do that on Power Automate Desktop, it is a bit more difficult and complicated since you have to use an OCR service on Power Automate Desktop, righ now there are actions for OCR with Tesseract OCR, Google, Microsoft, IBM available on Power Automate Desktop

     

    Thanks and hope it can help you. 

    Mike

    ---------------------------------

    Did I answer your question? Please consider to Mark my post as a solution! to guide others

     

     

Under review

Thank you for your reply! To ensure a great experience for everyone, your content is awaiting approval by our Community Managers. Please check back later.

Helpful resources

Quick Links

Understanding Microsoft Agents - Introductory Session

Confused about how agents work across the Microsoft ecosystem? Register today!

Warren Belz – Community Spotlight

We are honored to recognize Warren Belz as our May 2025 Community…

Congratulations to the April Top 10 Community Stars!

Thanks for all your good work in the Community!

Leaderboard > Power Automate - Power Automate Desktop

#1
eetuRobo Profile Picture

eetuRobo 18 Super User 2025 Season 1

#2
Nived_Nambiar Profile Picture

Nived_Nambiar 10 Super User 2025 Season 1

#3
stampcoin Profile Picture

stampcoin 6

Overall leaderboard