Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Google Voice actually uses the same technology and same dataset, AFAIK, which is why I was so confused by reading this article.

Google's stuff is pretty good some of the time, but they've hardly solved this problem to the degree the article suggests (as anyone who has actually used this for more than 5 minutes could tell you).



long-form transcription is a pretty different problem for language models than parsing search queries. There's lots of audio-processing overlap sure, but parsing a voicemail definitely has different, harder challenges.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: