Some of that stuff isn't too hard, if you can narrow down the domain of words yo...

blackkettle · on April 22, 2016

actually somebody even cross-compiled pocketsphinx to javascript with emscripten for this purpose:

https://syl22-00.github.io/pocketsphinx.js/live-demo.html

this works pretty well - all in the browser, especially if you drop in some better acoustic models.

IshKebab · on April 24, 2016

Yeah I wouldn't call that "pretty well" - I said "not a number" and it outputted "one two one" on the digits example.

Maybe it just wasn't trained well enough to reject non-number inputs, but.. yeah doesn't exactly change my experience that Sphinx is awful.

blackkettle · on April 24, 2016

You have to use a decent acoustic model - not the one in the demo. If you do I think it works 'pretty well' as a proof of concept. That said I'm not recommending Sphinx as a recognition framework, it is way behind the times in 2016, but this is the only 'in the wild' demo of this I've seen on the web, so I felt it was worth mentioning.