Hello,
in general the files and content on the Sharepoint site is indexed, which means that probably (I'm not 100% how it works as I don't have the access to the codebase :-)) everything is converted to embeddings - it's a set of vectors for the machine to understand what the data is like. Then when you prompt the model:
1. It converts your prompt to an embedding.
2. It searches the knowledge that it has to find the nearest neighbour, based on your prompt embedding.
3. It outputs it, if it finds anything.
Given those facts it all depends on how you prompt it, so it won't be that stable as users can use various prompts. The best way for now to make it more stable is to use a different datasource. In your case you could do something like:
1. Define a SP List with rows containing the title, description and a link to the file.
2. Use generative orchestration with action Get items and instruct the model to fill ODATA filter, based on the user prompt.
OR
2. Use classic approach and create a topic with trigger phrases, questions and Get items action to fetch the data.
That way the bot will be able to filter out the data first, before presenting it to the user.
In case of any other questions, let me know. If the answer helped you, mark it, so that others can benefit from it.
Best regards,
Artur Stepniak