|
|
| | Ask HN: Why don't we use subtitled films/tv to train speech recognition? | |
30 points by sycren on Aug 30, 2011 | hide | past | web | favorite | 34 comments |
|
| There are thousands of films and tv episodes that have subtitles throughout their duration. Millions of music that are sung that we can find lyrics for. Would it not be possible to use this material to train speech recognition. This would then make it possible to train in the multiple different dialects and accents of a particular language. Speech recognition as a technology, has always appeared to move slowly although with the advent of mobile popularity, the technology is becoming increasingly popular. Is anyone doing anything like this? |
|

Applications are open for YC Summer 2019
Guidelines
| FAQ
| Support
| API
| Security
| Lists
| Bookmarklet
| Legal
| Apply to YC
| Contact
|
Over time you could build up a database of voice prints and grammars for not just celebrities, policitians, but also criminals (for automatic identification).
I had this idea almost 4 years ago, submitted it to the company, but it wasn't taken seriously.
If anybody is interested in this, let me know!