Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
|
arkobel's comments
login
arkobel
7 days ago
|
parent
|
context
|
next
[–]
| on:
My accent costs me 30 IQ points on Zoom. So we bui...
The lack of parallel accent data makes this fundamentally unsupervised. Curious if this leans more on latent disentanglement than direct supervision.
reply
arkobel
51 days ago
|
parent
|
context
|
prev
[–]
| on:
Show HN: Sparrow-1 – Audio-native model for human-...
Have you compared with Krisp-TT models?
https://krisp.ai/blog/krisp-turn-taking-v2-voice-ai-viva-sdk...
Krisp LLC also shares an End-of-Turn Test dataset. Did you test your model on that?
https://huggingface.co/datasets/Krisp-AI/turn-taking-test-v1
And can you share some information about the model size and FLOPS?
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
reply