web
You’re offline. This is a read only version of the page.
close
Skip to main content

Announcements

News and Announcements icon
Community site session details

Community site session details

Session Id :
Power Platform Community / Forums / Copilot Studio / Review upload images c...
Copilot Studio
Suggested Answer

Review upload images capability

(1) ShareShare
ReportReport
Posted on by Microsoft Employee
Hello! I’d love some guidance on how to add the ability for my agent to review uploaded images, specifically for text evaluation. This capability works in my custom Copilot 365 agent, but I haven’t been able to figure out how to set it up in Copilot Studio. - If I wanted a walk through/help with a person, is it possible to set it up? Thank you!
Categories:
I have the same question (0)
  • Suggested answer
    Haque Profile Picture
    3,653 on at

    Hi @JM-20021632-0

    Let's see if these steps help:

    For setting up text evaluation from image in copilots, first we need to enable capacity to upload images, to do so
     
    1. Please make sure CSE (Copilot Studio Environment) has provision image uploads as input to the agent. For this we need to configure the input schema or interface to accept image files (png/jpg)
     
    2 Let's have our OCR service/API is in place to integrate that hepls to extract text from image, possible options for this action are Azure Cognitive Services CV-OCR, MS Read API or any opensouce OCR library (if running locally). For this, we need to set up API keys and permissions for the OCR services.
     
    3.  In Copilot Studio, let's implement image processing logic (backend or workflow) to receive the uploaded image, send the image to the OCR service and receive extracted text results.
     
    4.  Once text is extracted, send this text to agent's eveluation logic. We can leverage here prompt engineering or custom logic to analyze, summarize, or validate the extracted text.
     
    5. To get the results to the user, we can format the agent's output on the basis of text evaluation. We can provide feedback or insights deduced from the image text.
     
    If the above pipeline works, we can test differently:
     
    • To ensure OCR accuracy, let's test with various image qualities and text styles.
    • Let's handle any error like OCR failures and unreadable images.
     
    Pleae let me know if this helps.
     
     
     
     
  • JM-20021632-0 Profile Picture
    Microsoft Employee on at
    Hey, Haque! - Thank you so much for the thoughtful response. I'll definitely investigate and give it a shot.  - Since I'm definitely a novice here, I'm not sure I completely know how to go about following the steps you provided. Especially around the API connections. Outside of community chats is there a service/group that can help employees like me walk through these kinds of requests? 

Under review

Thank you for your reply! To ensure a great experience for everyone, your content is awaiting approval by our Community Managers. Please check back later.

Helpful resources

Quick Links

Season of Sharing Community Challenge Launch!

Jump in, show your community spirit, and win prizes!

Kudos to our 2025 Community Spotlight Honorees

Expanding mentorship, skilling, and AI innovation

Congratulations to the May Top 10 Community Leaders!

These are the community rock stars!

Leaderboard > Copilot Studio

#1
Valantis Profile Picture

Valantis 277

#2
11manish Profile Picture

11manish 206

#3
sannavajjala87 Profile Picture

sannavajjala87 156 Super User 2026 Season 1

Last 30 days Overall leaderboard