Announcements
Hello everyone,
We have deployed an agent within our team to help them answer their specific questions. Agent is deployed to Teams The agent is powered by the GPT‑4.1 model, but unexpectedly, some users can no longer use it and are seeing the following error message during conversations:
“An error has occurred. Error code: OpenAIModelTokenLimit Conversation ID: … ”
I tried clearing the conversation, and it works for the first question, but the error appears again right after.
From what I understand, each model has a token limit per conversation ? Does the retrieval of many sharepoint/files consume token as well for each request ?
Should I upgrade to GPT‑5 to increase this limit? optimize sources ?
EDIT : after some research and from my udnerstanding, each request consume token and there is x thousand of tokens allowed to my model
But is it allowed per agent ? per model ? How can I know how much token is left ?
thank you
Under review
Thank you for your reply! To ensure a great experience for everyone, your content is awaiting approval by our Community Managers. Please check back later.
Congratulations to our 2026 Super Users!
Congratulations to our 2025 community superstars!
These are the community rock stars!
Stay up to date on forum activity by subscribing.
Valantis 618
Haque 147
Vish WR 140