Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

How are you doing semantic end-of-turn detection without adding latency to the critical path? Is it a separate lightweight model or integrated into the LLM stream?


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: