Audapolis: An editor for spoken-word audio with automatic transcription

unraveller · 2023-08-24T09:32:21

Seems to use the VOSK voice recognition model under the hood for most languages, but doesn't really say that upfront or how it fairs against state of the art. Vosk has significantly higher word error rate over whisper models in english but since it's an editor of sorts it probably doesn't matter that much. They are working on adding whisper as an option.

https://scribe.rip/version-1/analysis-of-automatic-speech-re...

billconan · 2023-08-23T16:59:35

Can it update the audio once the transcript has been changed? i.e. does it do voice cloning?