web
You’re offline. This is a read only version of the page.
close
Skip to main content

Notifications

Announcements

Community site session details

Community site session details

Session Id :
Power Platform Community / Forums / Power Automate / How to extract text fr...
Power Automate
Unanswered

How to extract text from PDF using PAD?

(0) ShareShare
ReportReport
Posted on by 182

Dear community ,

 

How to extract the data from PDF's and store in excel using PAD?

 

My flow is failing at step 'Extract text with OCR' with error message - Failed to extract text with OCR.

 

Steps used-

1-Create Tesseract OCR engine

2-Extract text with OCR

3-Write text to file ( just for testing) , eventually it will be excel sheet.

 

Please let me know if I have to do any specific configurations?

I have the same question (0)
  • JamesP_MSFT Profile Picture
    Microsoft Employee on at

    Hello @vaibhavtandon87,

    Right now there is not an ability to extract text or images from a PDF file. 
    The appropriate group of actions will be available in Power Automate Desktop in the near future.

     

    Best regards, 
    James

  • vaibhavtandon87 Profile Picture
    182 on at

    Thanks James, is it in near future? Tentative timelines will help.

  • Verified answer
    JamesP_MSFT Profile Picture
    Microsoft Employee on at

    @vaibhavtandon87 

    The team has almost completed work on the said feature, so it will be available really soon.

    Best regards, 
    James

  • Alexanderderv Profile Picture
    3 on at

    This sounds very interesting and will sure be useful! 

    When you say it will be available really soon, could it be before the ending of 2020 or at the beginning of 2021?

     

    Keep up the good work!

  • vaibhavtandon87 Profile Picture
    182 on at

    @JamesP_MSFT ,

     

    I can see the functions like extract from pdf which is great!

     

    Could you please guide, if I extract a table on the first page along with headers containing useful information, how to pull that into excel as separate information?

     

    What it is doing is taking all the content from PDF page and just dumping that into a cell. Can i further decompose that information into useful information and how?

     

     

  • Community Power Platform Member Profile Picture
    on at

    Hi, @JamesP_MSFT ! Good evening! 🙂

    Any news on this CV working with PDF-files?... I'm wondering if you know the site I can track for future PAD updates. 🙂 I am able to work with the alternative "Extract text from PDF" and just use RegEx with some extra steps... But would love to implement this alternative as soon as it has been released! 

    GK

  • Community Power Platform Member Profile Picture
    on at

    Hi,

     

    Any news on this?

  • Hiwm Profile Picture
    38 on at

    Can I extract information like names and dates with regex?

Under review

Thank you for your reply! To ensure a great experience for everyone, your content is awaiting approval by our Community Managers. Please check back later.

Helpful resources

Quick Links

Forum hierarchy changes are complete!

In our never-ending quest to improve we are simplifying the forum hierarchy…

Ajay Kumar Gannamaneni – Community Spotlight

We are honored to recognize Ajay Kumar Gannamaneni as our Community Spotlight for December…

Leaderboard > Power Automate

#1
Michael E. Gernaey Profile Picture

Michael E. Gernaey 522 Super User 2025 Season 2

#2
Tomac Profile Picture

Tomac 364 Moderator

#3
abm abm Profile Picture

abm abm 243 Most Valuable Professional

Last 30 days Overall leaderboard