web
You’re offline. This is a read only version of the page.
close
Skip to main content

Announcements

News and Announcements icon
Community site session details

Community site session details

Session Id :
Power Platform Community / Forums / Copilot Studio / Best file format for S...
Copilot Studio
Suggested Answer

Best file format for SharePoint knowledge source when mirroring SQL databases?

(0) ShareShare
ReportReport
Posted on by

Hi everyone! 

I'm building an agent in Copilot Studio and I need it to be able to query data that currently lives in external SQL databases. The problem is that due to our company's data architecture and security policies, we can't connect the agent directly to the SQL server or to the database itself (no direct connectors or custom plugins to external SQL are available for us).

As a workaround, our BI team has offered to regularly export and update the database contents into files stored in a SharePoint site, so the agent can use them as a knowledge source.

My questions are:


  1. What file format should the BI team use when exporting the data to SharePoint? (e.g., .xlsx, .csv, .docx, .json, .pdf, etc.) — Which format does Copilot Studio handle best for structured/tabular data as a knowledge source?


  2. Are there any size or row limitations we should be aware of? The databases have several thousand rows each.


  3. Is there a recommended way to structure the files so the agent can retrieve accurate answers? (e.g., split by category, use a specific layout, avoid merged cells, etc.)


  4. For those who have tried something similar: does the agent actually perform well querying tabular data from SharePoint knowledge sources, or would you recommend a different approach entirely (e.g., using Power Automate flows as actions/tools to query the files in real time)?

Any guidance, best practices, or lessons learned would be super appreciated! 🙏

Thanks in advance!

I have the same question (0)
  • Assisted by AI
    David_MA Profile Picture
    14,771 Super User 2026 Season 1 on at
    I don't have a ton of experience with Copilot Studio yet. But if your BI team can export the SQL data to a SharePoint list (we do this using Layer2 Cloud Connector), you could create a workflow that the agent can call to pull the data from the SharePoint list.
     
    It's funny I came across this post, as I was writing up a message to a co-worker who is wondering how Copilot could be used and the confusion between Copilot and Power Automate. I just passed the idea by Copilot for M365 to verify my idea:
     
    A company could build an agent for its sales team. It could answer questions about how salespeople should contact customers, give them tips for organizing meetings, what they should say when visiting a customer who is interested in a certain product etc.. The agent could be configured to call a Power Automate workflow as well. This could be useful if the salesperson says, next month I will be in Las Vegas. What customers do we have in the area that I can reach out to so I can stay in touch with them to see if they have any needs. Copilot could ask what ZIP code they will be visiting and then kick off a workflow to access a SharePoint list to pull contact information for the customers in the ZIP code the salesperson will be visiting and e-mail them the list with contact details.
     
    Is this the type of scenario you have for the SQL data? Copilot felt it was a strong example of using the two features together. I will not include what Copilot wrote as the reply is very long, but if you want to see what it had to say let me know. I've checked the box that this reply is AI-assisted even though nothing I've responded with was written by AI.
  • Suggested answer
    Vish WR Profile Picture
    3,483 on at
     

    Q1. What file format should the BI team use?

     

    Use PDF or DOCX — not CSV or XLSX.

     

    Copilot Studio's knowledge index is built for natural language retrieval, not spreadsheet parsing. CSV/XLSX can be indexed but the agent often mismatches column headers with values across many rows.

     

    Best practice: Ask your BI team to export data as a well-structured PDF or DOCX with clear table headers. Even better — convert rows into short prose summaries per record or category for much higher retrieval accuracy.

     

    Q2. Are there size or row limitations?

     

    Yes. Key limits to know:

     

    - Max file size: 512 MB per file

     

    - Practical limit: Large flat files with thousands of rows dilute retrieval quality significantly

     

    - Recommendation: Split files by category, region, or entity type — smaller focused files always perform better

     

    Q3. How should files be structured?

     

    Follow these guidelines for best results:

     

    - Use a clear, consistent header row on every file

     

    - Add a short title/summary at the top describing what the file contains

     

    - Group related data together (e.g., one file per table or category)

     

    - Avoid merged cells — they confuse the parser

     

    - Avoid one massive flat dump of all tables combined

     

    Q4. Does the agent actually perform well on tabular data?

     

    Honest answer: partially.

     

    - General Q&A about the data           → Works well

     

    - Specific lookups (find record X)     → Inconsistent

     

    - Aggregations / filters (value > X)   → Not reliable

     

    Recommended hybrid approach:

     

    - Use SharePoint as a knowledge source for general context and descriptive questions

     

    - Use a Power Automate flow as an agent action/tool for precise, real-time data lookups

     

  • Vish WR Profile Picture
    3,483 on at
    Wanted to check if you are able to resolve your problem? Let me know if you need any clarification 
     
     
    When replying, please don't forget to mention my userid so that it will be notified 
     
    Vishnu WR
     
    Please  Does this answer your question if my post helped you solve your issue. This will help others find it more readily. It also closes the item. If the content was useful in other ways, please consider answering Yes to Was this reply helpful? or give it a Like 
  • Romain The Low-Code Bearded Bear Profile Picture
    2,868 Super User 2026 Season 1 on at
    Many good idea, just a quick tips from similar experiences :
     
    If you have some sofware engineer skill, there is an advanced way :
    Get a CSV inside a sharepoint doc lib
    Inject CSV inside a dataverse (small base, you could erase rebuild, big database, need to create a routine to only add new line, update other, etc.) <--- this part could go from easy to ultra complicated with complexe database
    Use dataverse as a knowledge source

Under review

Thank you for your reply! To ensure a great experience for everyone, your content is awaiting approval by our Community Managers. Please check back later.

Helpful resources

Quick Links

Season of Sharing Community Challenge Launch!

Jump in, show your community spirit, and win prizes!

Kudos to our 2025 Community Spotlight Honorees

Expanding mentorship, skilling, and AI innovation

Congratulations to the April Top 10 Community Leaders!

These are the community rock stars!

Leaderboard > Copilot Studio

#1
Valantis Profile Picture

Valantis 244

#2
Romain The Low-Code Bearded Bear Profile Picture

Romain The Low-Code... 221 Super User 2026 Season 1

#3
Vish WR Profile Picture

Vish WR 196

Last 30 days Overall leaderboard