Skip to main content

Notifications

Copilot Studio - General
Unanswered

Copilot responses from PDF files

Posted on by 2
we have uploaded PDF files to sharepoint as data source
is there any limitation in reading text from those pdf's/images are also embedded within PDF
 
 
 
Categories:
  • Suggested answer
    Vinoth Selvam Profile Picture
    Vinoth Selvam 541 on at
    Copilot responses from PDF files
     
    Currently Copilot should be able to read the text present in the PDF documents without any issues. We just need to make sure that PDF file is properly formatted.
     
    Regarding embedded images with PDF, currently copilot cannot process this. But there is announcement from Microsoft that this feature will be out soon, Copilot will soon be able to process the Embedded images inside PDF also.
     
    But for now, you can check these possibilities,
     
     
     
     
    Thanks.
  • Suggested answer
    SaiRT14 Profile Picture
    SaiRT14 336 on at
    Copilot responses from PDF files
    Pls try the following:
    • Power Apps and Power Automate have limitations when it comes to extracting text from PDF files, especially when the PDF contains embedded images or non-selectable text.
    • Native Power Automate PDF actions do not directly support extracting text from PDF files that contain images or scanned documents.
    • If your PDFs contain embedded images or are scanned documents (e.g., the text is part of the image), extracting text will not work unless Optical Character Recognition (OCR) is used.
    • OCR is required to extract text from image-based PDFs or PDFs with embedded images. Power Automate doesn’t have a built-in OCR feature, but you can integrate with services like AI Builder or third-party OCR tools.
    • AI Builder (a part of the Microsoft Power Platform) can be used to extract text from PDFs, including handling image-based PDFs via OCR.
    • SharePoint and Power Automate may encounter issues with large PDF files or PDFs with highly complex formatting. Processing times may increase, or extraction may fail for large documents.
    let me know if you need more details. 
  • Mahesh Chintha Profile Picture
    Mahesh Chintha 137 on at
    Copilot responses from PDF files
    We have seen GPT behind Copilot Studio was able to read images and company logos and does the OCR on high quality images, but we see the OCR is degraded from last week.
     
    I recommend uploading documents with high quality images and test the current version.

Under review

Thank you for your reply! To ensure a great experience for everyone, your content is awaiting approval by our Community Managers. Please check back later.

Helpful resources

Quick Links

September 2024 Newsletter…

September 2024 Community Newsletter…

Community Update Sept 16…

Power Platform Community Update…

Tuesday Tip #2 Global Search…

Welcome to a brand new series, Tuesday Tips…

Leaderboard

#1
WarrenBelz Profile Picture

WarrenBelz 142,076

#2
RandyHayes Profile Picture

RandyHayes 76,308

#3
Pstork1 Profile Picture

Pstork1 63,535

Leaderboard