Best file format for SharePoint knowledge source when mirroring SQL databases?

(0) Share

Report

Posted on by VC-27051507-0

Hi everyone!

I'm building an agent in Copilot Studio and I need it to be able to query data that currently lives in external SQL databases. The problem is that due to our company's data architecture and security policies, we can't connect the agent directly to the SQL server or to the database itself (no direct connectors or custom plugins to external SQL are available for us).

As a workaround, our BI team has offered to regularly export and update the database contents into files stored in a SharePoint site, so the agent can use them as a knowledge source.

My questions are:

What file format should the BI team use when exporting the data to SharePoint? (e.g., .xlsx, .csv, .docx, .json, .pdf, etc.) — Which format does Copilot Studio handle best for structured/tabular data as a knowledge source?

Are there any size or row limitations we should be aware of? The databases have several thousand rows each.

Is there a recommended way to structure the files so the agent can retrieve accurate answers? (e.g., split by category, use a specific layout, avoid merged cells, etc.)

For those who have tried something similar: does the agent actually perform well querying tabular data from SharePoint knowledge sources, or would you recommend a different approach entirely (e.g., using Power Automate flows as actions/tools to query the files in real time)?

Any guidance, best practices, or lessons learned would be super appreciated! 🙏

Thanks in advance!

Categories:

Building Copilot Studio chatbots in Microsoft Teams

Copilot Studio skills development

I have the same question (0)

All responses (4)

Answers (0)

Sort by

Assisted by AI

David_MA 14,771 Super User 2026 Season 1 on at

Like (0)

Report
Copy link

Link copied!

I don't have a ton of experience with Copilot Studio yet. But if your BI team can export the SQL data to a SharePoint list (we do this using Layer2 Cloud Connector), you could create a workflow that the agent can call to pull the data from the SharePoint list.

It's funny I came across this post, as I was writing up a message to a co-worker who is wondering how Copilot could be used and the confusion between Copilot and Power Automate. I just passed the idea by Copilot for M365 to verify my idea:

A company could build an agent for its sales team. It could answer questions about how salespeople should contact customers, give them tips for organizing meetings, what they should say when visiting a customer who is interested in a certain product etc.. The agent could be configured to call a Power Automate workflow as well. This could be useful if the salesperson says, next month I will be in Las Vegas. What customers do we have in the area that I can reach out to so I can stay in touch with them to see if they have any needs. Copilot could ask what ZIP code they will be visiting and then kick off a workflow to access a SharePoint list to pull contact information for the customers in the ZIP code the salesperson will be visiting and e-mail them the list with contact details.

Is this the type of scenario you have for the SQL data? Copilot felt it was a strong example of using the two features together. I will not include what Copilot wrote as the reply is very long, but if you want to see what it had to say let me know. I've checked the box that this reply is AI-assisted even though nothing I've responded with was written by AI.

Was this reply helpful? Yes No
Suggested answer

Vish WR 3,483 on at

Like (1)

Report
Copy link

Link copied!

@VC-27051507-0

Q1. What file format should the BI team use?

Use PDF or DOCX — not CSV or XLSX.

Copilot Studio's knowledge index is built for natural language retrieval, not spreadsheet parsing. CSV/XLSX can be indexed but the agent often mismatches column headers with values across many rows.

Best practice: Ask your BI team to export data as a well-structured PDF or DOCX with clear table headers. Even better — convert rows into short prose summaries per record or category for much higher retrieval accuracy.

Q2. Are there size or row limitations?

Yes. Key limits to know:

- Max file size: 512 MB per file

- Practical limit: Large flat files with thousands of rows dilute retrieval quality significantly

- Recommendation: Split files by category, region, or entity type — smaller focused files always perform better

Q3. How should files be structured?

Follow these guidelines for best results:

- Use a clear, consistent header row on every file

- Add a short title/summary at the top describing what the file contains

- Group related data together (e.g., one file per table or category)

- Avoid merged cells — they confuse the parser

- Avoid one massive flat dump of all tables combined

Q4. Does the agent actually perform well on tabular data?

Honest answer: partially.

- General Q&A about the data → Works well

- Specific lookups (find record X) → Inconsistent

- Aggregations / filters (value > X) → Not reliable

Recommended hybrid approach:

- Use SharePoint as a knowledge source for general context and descriptive questions

- Use a Power Automate flow as an agent action/tool for precise, real-time data lookups

Was this reply helpful? Yes No
Vish WR 3,483 on at

Like (0)

Report
Copy link

Link copied!

@VC-27051507-0

Wanted to check if you are able to resolve your problem? Let me know if you need any clarification

When replying, please don't forget to mention my userid so that it will be notified

Vishnu WR

LinkedIn - My Tech Space

Please ✅ Does this answer your question if my post helped you solve your issue. This will help others find it more readily. It also closes the item. If the content was useful in other ways, please consider answering Yes to Was this reply helpful? or give it a Like ♥

Was this reply helpful? Yes No
Romain The Low-Code... 2,868 Super User 2026 Season 1 on at

Like (0)

Report
Copy link

Link copied!

Many good idea, just a quick tips from similar experiences :

If you have some sofware engineer skill, there is an advanced way :
Get a CSV inside a sharepoint doc lib

Inject CSV inside a dataverse (small base, you could erase rebuild, big database, need to create a routine to only add new line, update other, etc.) <--- this part could go from easy to ultra complicated with complexe database

Use dataverse as a knowledge source

Was this reply helpful? Yes No