web

You’re offline. This is a read only version of the page.

Skip to main content

Power Platform Community

Cancel

Search

Announcements

Welcome to the Power Platform Communities

News and Announcements icon

Community site session details

Session Id :

Power Platform Community / Forums / Copilot Studio / Best approach for gene...

Copilot Studio

Suggested Answer

Best approach for generating output with AI Builder with Knowledge Base

edit

Subscribe (0)

Share

Report

Report

Posted on by YJ-02010636-0

40

I’m building a Copilot Studio agent in Teams where users upload DOCX and PDF files for analysis (summarization, extraction, comparison, etc.).

I’ve been using AI Builder so far, but I’m running into practical limitations:

File size constraints

Token limits for larger documents or batches of files

Limited flexibility when users upload many documents in a single session

I’m exploring alternative architectures and would appreciate guidance from others who’ve solved this at scale.

A few specific questions:

Dataverse as file storage – Is Dataverse a recommended approach for storing uploaded documents for Copilot Studio scenarios?
- Are there best practices for using Dataverse file columns vs alternatives (e.g., SharePoint, Azure Blob + references)?

Cost considerations – How expensive does Dataverse become for unstructured file storage at scale?
- In practice, does Dataverse file storage get costly compared to external storage options?

Processing patterns – For large or many files, is the preferred pattern to:
- Pre‑process documents outside Copilot Studio (e.g., chunking, indexing, embeddings), then
- Let Copilot Studio orchestrate over processed outputs rather than raw files?

Categories:

Building Copilot Studio chatbots in Microsoft Teams

I have the same question (0)

All responses Img

All responses (1)

Answers Img

Answers (0)

Sort by

Suggested answer

MichaelFP 2,001 Moderator on at

Like
a
(0)

Report
Copy link

Link copied!

For your scenario I would recommend to use RAG, because you will have less token consumption, because they will search using small chunking and embedding to find the information what you want.

Was this reply helpful? Yes No

Under review

Thank you for your reply! To ensure a great experience for everyone, your content is awaiting approval by our Community Managers. Please check back later.

Join the conversation

Helpful resources

News and Announcements

Welcome to the Power Platform Communities

Quick Links

Season of Sharing Community Challenge Launch!

Jump in, show your community spirit, and win prizes!

Kudos to our 2025 Community Spotlight Honorees

Expanding mentorship, skilling, and AI innovation

Congratulations to the May Top 10 Community Leaders!

These are the community rock stars!

Subscribe to this forum!

Stay up to date on forum activity by subscribing.

Select categories

Autonomous agents

Bot administration

Bot analytics

Bot extensibility

Building Copilot Studio chatbots in Microsoft Teams

Calling actions from Copilot Studio

Copilot Studio pre-built agents/templates

Copilot Studio skills development

General topics

Model context protocol

Publish & channel management

Topic creation & management

Leaderboard > Copilot Studio

#1

#2

#3

sannavajjala87 156 Super User 2026 Season 1

Last 30 days Overall leaderboard

Featured topics

Announcing the "Microsoft Copilot Studio ❤️ MCP" lab

Block PII/PCI in Copilot Studio agent user prompt

Product updates

Microsoft Power Platform Community release plans

© Microsoft

Manage Cookies
Privacy & cookies
Terms of use
Trademarks

Your Privacy Choices Consumer Health Privacy

Messages

Profile
Messages
My activity
Sign out