However, this looks like it only works with speech - i.e. you can't ask it, "Wha...

cube2222 · on May 13, 2024

Fwiw, the live demo[0] included different kinds of breathing, and getting feedback on it.

throwaway11460 · on May 13, 2024

What about the breath analysis?

pants2 · on May 13, 2024

I did see that, though my interpretation is that breathing is included in its voice tokenizer which helps it understand emotions in speech (the AI can generate breath sounds after all). Other sounds, like bird songs or engine noises, may not work - but I could be wrong.

CooCooCaCha · on May 13, 2024

I suspect that like images and video, their audio system is or will become more general purpose. For example it can generate the sound of coins falling onto a table.

genewitch · on May 13, 2024

allegedly google assistant can do the "humming" one but i have never gotten it to work. I wish it would because sometimes i have a song stuck in my head that i know is sampled from another song.