What are the state of the art open solutions to local voice recognition? Prefera... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

colechristensen on March 9, 2022 | parent | context | favorite | on: DeepSpeech 60x Smaller, 9x faster, and 2x accuracy

What are the state of the art open solutions to local voice recognition? Preferably with available models that a small org can also train themselves without millions in hardware.

rileyphone on March 9, 2022 | [–]

I will add https://github.com/coqui-ai/STT, which is a continuation of DeepSpeech. Also, I've been messing around with https://github.com/ideasman42/nerd-dictation, which works on a VOSK backend - accuracy is decent, especially with the bigger model.

trowngon on March 9, 2022 | | [–]

Vosk https://github.com/alphacep/vosk-api

albertzeyer on March 9, 2022 | [–]

Kaldi, K2, ESPNet.

misc1234 on March 9, 2022 | [–]

Nemo

Join us for AI Startup School this June 16-17 in San Francisco!
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact