
Hello Everyone,
Could we have a AI model to give us insight about how fluent and grammatically correct the user is in a language suppose english, from a audio input ? and display the details in a canvas app. Input could be any audio file maybe in .mp4, .wbmp , or .wav format.
How we could achieve it ?
Hi @BhumikaB in AI Builder there is unfortunately no model that could do that. AI Builder is primarily focused AI applications on text or images. Videos and audio is unfortunately not possible. If I think out loud, you might look into Azure Cognitive services and combine a ''speech to text'' (to extract te text) solution with Azure OpenAI (to review the extracted text based on correct grammer). This proposed solution however is based on solutions outside of AI Builder - so be aware of that!
---
If this answer helped you, would you be so kind to mark this answer as ''solved'' and give it a like? This will help other people (who have the same question) a lot! 😃