Hacker News new | past | comments | ask | show | jobs | submit login
Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels (github.com/mpc001)
41 points by yagizdegirmenci 9 days ago | hide | past | favorite | 1 comment





Not referenced in the README, here's a great video demonstration of this type of AVSR network running in real time:

https://m.youtube.com/watch?v=XDO8OYnmkNY&t=120s




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: