web
You’re offline. This is a read only version of the page.
close
Skip to main content

Announcements

News and Announcements icon
Community site session details

Community site session details

Session Id :
Power Platform Community / Forums / Power Automate / Extract Text with OCR ...
Power Automate
Unanswered

Extract Text with OCR - Use another language to specify "trained" data

(0) ShareShare
ReportReport
Posted on by 4

Hi,

 

I'm using the "Extract text with OCR" action to read text from a grid in an application running under Citrix. I'm getting OK-ish results after following the suggestions in this post - https://powerusers.microsoft.com/t5/Power-Automate-Desktop/Move-mouse-to-text-on-Screen-OCR-worked-yesterday-now-dont-work/m-p/1197401#M5294 but still not good enough.

 

The text that i need to read (and then compare to an input variable so it needs to be exact) aren't real words in any language.  It also has lots of slashes e.g. C560XL/XLS/IR. I was thinking that if i could train the engine with this text then i would get better results. Power Automate Desktop has the "Use Other Language" option and allows you to set the language data path. I can't find any instructions on how this works and i keep getting "Failed to create the OCR engine" error.

 

Can anyone give me more information on this feature? Will it do what i want it to do? How do i specify the data file?

 

I've tried downloading the language data from here - https://github.com/tesseract-ocr/langdata but i'm clearly doing something wrong ...

 

cjibb02_0-1635373445893.png

 

Any help will be much appreciated!

I have the same question (0)
  • fraenK Profile Picture
    2,125 on at

    For the additional language did you try this? https://docs.microsoft.com/en-us/power-automate/desktop-flows/how-to/ocr-multilingual-documents

    BUT the built-in OCR functionality based on Tesseract is not that great.

    Did you try the Microsoft cognitive action for OCR?

    https://docs.microsoft.com/en-us/power-automate/desktop-flows/actions-reference/microsoftcognitive#ocrmicrosoft

     

    Or would there be a chance to export the grid content as text from the application itself or take a screenshot and run it through a more advanced 3rd party OCR tool install PAD within Citrix?

     

    Unfortunately other RPA product are more advanced with Citrix based automation.

  • cjibb02 Profile Picture
    4 on at

    Thanks, yes I've followed the steps in that link but i always get "Failed to create OCR engine" even when i use the language files downloaded from github - https://github.com/tesseract-ocr/langdata. I have a support ticket open with Microsoft for this and will update with what they come back with. 

     

    I haven't tried the Microsoft OCR Cognitive action yet but will take some screen shots and run these through that service. There will be challenges identifying and cropping the part of the screen i want to run the OCR on, saving as an image and then parsing the results. So I'm not sure if it will be practical but will be interesting how it compares to Tesseract over the whole screen image.

Under review

Thank you for your reply! To ensure a great experience for everyone, your content is awaiting approval by our Community Managers. Please check back later.

Helpful resources

Quick Links

Introducing the 2026 Season 1 community Super Users

Congratulations to our 2026 Super Users!

Kudos to our 2025 Community Spotlight Honorees

Congratulations to our 2025 community superstars!

Congratulations to the March Top 10 Community Leaders!

These are the community rock stars!

Leaderboard > Power Automate

#1
Haque Profile Picture

Haque 594

#2
Valantis Profile Picture

Valantis 328

#3
David_MA Profile Picture

David_MA 281 Super User 2026 Season 1

Last 30 days Overall leaderboard