Hacker News new | past | comments | ask | show | jobs | submit login

However, this looks like it only works with speech - i.e. you can't ask it, "What's the tune I'm humming?" or "Why is my car making this noise?"

I could be wrong but I haven't seen any non-speech demos.




Fwiw, the live demo[0] included different kinds of breathing, and getting feedback on it.

[0]: https://youtu.be/DQacCB9tDaw?t=557


What about the breath analysis?


I did see that, though my interpretation is that breathing is included in its voice tokenizer which helps it understand emotions in speech (the AI can generate breath sounds after all). Other sounds, like bird songs or engine noises, may not work - but I could be wrong.


I suspect that like images and video, their audio system is or will become more general purpose. For example it can generate the sound of coins falling onto a table.


allegedly google assistant can do the "humming" one but i have never gotten it to work. I wish it would because sometimes i have a song stuck in my head that i know is sampled from another song.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: