web
You’re offline. This is a read only version of the page.
close
Skip to main content

Notifications

Announcements

Community site session details

Community site session details

Session Id :
Power Automate
Unanswered

Extract Text with OCR

(0) ShareShare
ReportReport
Posted on by

Hi all,

 

I would like to extract text from a website login page after entering the password.

This text is dynamic when the website is loaded each time.

 

Is there a way to identify the specific subregion of text that the OCR needs to read?

 

Regards

Hidayat

 

 

I have the same question (0)
  • DanielOlsson Profile Picture
    111 on at

    On action Extract text with OCR you can choose "Search mode" and add X and Y cords of the area.Screenshot_9.png

  • Community Power Platform Member Profile Picture
    on at

    Hi @DanielOlsson ,

     

    Thanks for the feedback.

    I tried this but received an error, Failed to extract text with OCR.

     

    For the image to select, I tried both with the text and without text.

     

    Image without text to capture

    Without Text.PNG

     

    Image with text to capture

    With Text.PNG

     

    Regards

    Hidayat

     

  • Verified answer
    geavgous Profile Picture
    Microsoft Employee on at

    Hi Hidayat, 

     

    Thanks for exploring the potential of Power Automate Desktop! Instead of OCR, you could use the Web Automation action , called "Extract Data from Web". This action allows you to select specific web element and get anything you want from it. 

    You can learn more by checking the documentation page in Web automation - Power Automate | Microsoft Docs

     

    In addtion, you could use the Image based recording for using OCR. You can find more details in Recording in a desktop flow - Power Automate | Microsoft Docs

     

    Let me know how it goes!

     

    Thanks, 

    George

  • MichaelAnnis Profile Picture
    5,727 Moderator on at

    I like using the "Create Tesseract OCR engine" and using the multipliers before using "Extract text with OCR"

    Mess with the multipliers until you get the result you need.  The higher the multiplier, the slower the bot; however, it greatly increases the accuracy of the OCR.

     

    Best of luck.

  • Community Power Platform Member Profile Picture
    on at

    Hi @DanielOlsson@geavgous , @MichaelAnnis ,

    Thank you for pointing me into the intended direction.

    I manage to get the solution by combining all of you guys feedback 😀

    And reading the documentation should instead be the first step. 😅

    Awesome work all!

    Overview of Workflow

    Capture.PNG

    Extract text with OCR step

    1) Search mode: Selected subregion relative to image

    2) Tolerance: Increased to 10

     

    Capture2.PNG

     

     

    Regards

    Hidayat

  • JAWL Profile Picture
    393 on at

    Hi Hidayat,

    1. How you actually identify the X1, X2, Y1, Y2 coordinates? Can we capture the coordinates from the position of mouse?
    2. I use Capture Image in the OCR action but it's display is blurred. Or you upload clear image instead?

    JAWL_0-1636078327530.png

     

  • DanielOlsson Profile Picture
    111 on at

    Hello, you can use the move mouse action to determine the X and Y cords of your picture. Not as an active action but while you develop your flow.  Your picture might be blurred as its zooms in when it take the snapshot but the quality is as well determined by the resolution of your screen as well as how you connected to the device you run your flow on, is it local or do you make some kind of remote connection? 

  • Salman222711 Profile Picture
    4 on at

    Hello Jawl, you can use move mouse to image activity>Advanced>Search Mode>Search on Specified subregion of screen or foreground Window.Then you will be getting option to select the region.By double tapping and dragging the area X1,X2,Y1,Y2 Positions will be reflecting.These positions you can note down and use in other OCR actions.Please Check the below image.

     

    Thanks Regards

    Salman

    Screenshot (15).png

     

  • JAWL Profile Picture
    393 on at

    Is it possible to use OCR action move mouse to text found based on the whole multipage pdf file (and not the active screen)?

  • RRRprogram Profile Picture
    5 on at

    Excellennttt brother...Since I am non Computer science background.. I was struggling to find this answer... thanks for your help.

Under review

Thank you for your reply! To ensure a great experience for everyone, your content is awaiting approval by our Community Managers. Please check back later.

Helpful resources

Quick Links

Forum hierarchy changes are complete!

In our never-ending quest to improve we are simplifying the forum hierarchy…

Ajay Kumar Gannamaneni – Community Spotlight

We are honored to recognize Ajay Kumar Gannamaneni as our Community Spotlight for December…

Leaderboard > Power Automate

#1
Michael E. Gernaey Profile Picture

Michael E. Gernaey 523 Super User 2025 Season 2

#2
Tomac Profile Picture

Tomac 406 Moderator

#3
abm abm Profile Picture

abm abm 245 Most Valuable Professional

Last 30 days Overall leaderboard