Improved Smart Endpointing and Turn Detection Models | Voters

Improved Smart Endpointing and Turn Detection Models

complete

A. Salim

The current state of the smart endpointing model is not as great and sometimes just delays the response by almost 1.5 secs when quick words like "Yes" or "Yeah" are spoken. 
I think adding a more advanced endpointing and turn detection model models are a big request by the community right and would really improve the calls quality. 
Livekit has implemented a perfect open source plugin that does this for it's text-to-speech models : 
https://github.com/livekit/agents/blob/main/livekit-plugins/livekit-plugins-turn-detector
https://blog.livekit.io/using-a-transformer-to-improve-end-of-turn-detection/
I hope we get something similar or an improved version of the Smart Endpointing model we have :)

February 15, 2025

This post was marked as

complete

Nikhil Gupta

marked this post as

planned

This post was marked as

complete

This post was marked as

planned

Prime Mind

Def high on the list!!

Henry Williams

marked this post as

under review

We are investigating this request right now.