Skip to main content

Notifications

Community site session details

Community site session details

Session Id : iF/CgSy8EFuCEVUFKgQtjh
Copilot Studio - General
Unanswered

Copilot responses from PDF files

Like (2) ShareShare
ReportReport
Posted on 22 Sep 2024 06:48:56 by 6
we have uploaded PDF files to sharepoint as data source
is there any limitation in reading text from those pdf's/images are also embedded within PDF
 
 
 
Categories:
  • Suggested answer
    Vinoth Selvam Profile Picture
    1,527 Super User 2025 Season 1 on 23 Oct 2024 at 08:06:15
    Copilot responses from PDF files
     
    Currently Copilot should be able to read the text present in the PDF documents without any issues. We just need to make sure that PDF file is properly formatted.
     
    Regarding embedded images with PDF, currently copilot cannot process this. But there is announcement from Microsoft that this feature will be out soon, Copilot will soon be able to process the Embedded images inside PDF also.
     
    But for now, you can check these possibilities,
     
     
     
     
    Thanks.
  • Suggested answer
    SaiRT14 Profile Picture
    1,966 Super User 2025 Season 1 on 22 Oct 2024 at 16:46:11
    Copilot responses from PDF files
    Pls try the following:
    • Power Apps and Power Automate have limitations when it comes to extracting text from PDF files, especially when the PDF contains embedded images or non-selectable text.
    • Native Power Automate PDF actions do not directly support extracting text from PDF files that contain images or scanned documents.
    • If your PDFs contain embedded images or are scanned documents (e.g., the text is part of the image), extracting text will not work unless Optical Character Recognition (OCR) is used.
    • OCR is required to extract text from image-based PDFs or PDFs with embedded images. Power Automate doesn’t have a built-in OCR feature, but you can integrate with services like AI Builder or third-party OCR tools.
    • AI Builder (a part of the Microsoft Power Platform) can be used to extract text from PDFs, including handling image-based PDFs via OCR.
    • SharePoint and Power Automate may encounter issues with large PDF files or PDFs with highly complex formatting. Processing times may increase, or extraction may fail for large documents.
    let me know if you need more details. 
  • Mahesh Chintha Profile Picture
    158 on 23 Sep 2024 at 17:20:52
    Copilot responses from PDF files
    We have seen GPT behind Copilot Studio was able to read images and company logos and does the OCR on high quality images, but we see the OCR is degraded from last week.
     
    I recommend uploading documents with high quality images and test the current version.

Under review

Thank you for your reply! To ensure a great experience for everyone, your content is awaiting approval by our Community Managers. Please check back later.

Helpful resources

Quick Links

🌸 Community Spring Festival 2025 Challenge 🌸

WIN Power Platform Community Conference 2025 tickets!

Markus Franz – Community Spotlight

We are honored to recognize Markus Franz as our April 2025 Community…

Kudos to the March Top 10 Community Stars!

Thanks for all your good work in the Community!

Leaderboard

#1
WarrenBelz Profile Picture

WarrenBelz 146,660 Most Valuable Professional

#2
RandyHayes Profile Picture

RandyHayes 76,287 Super User 2024 Season 1

#3
Pstork1 Profile Picture

Pstork1 66,004 Most Valuable Professional

Leaderboard
Loading started