Show HN: Vocal timing conditioned audio diffusion in real-time

badFEengineer · 2023-11-15T16:43:49

I've been pretty bearish on gen AI for music, but this is the most fun I've had playing with an AI tool in a long time- the filters remind me of the OG Instagram filter effect, where even shitty photos from phones could "magically" be enhanced.

This + the Music ControlNet post from yesterday gives me some hope that audio AI will go the direction of creative tools, rather than dystopian full song generation.

ricepaddies3 · 2023-11-15T17:59:12

I'm impressed with the quality of the sound! Some of my generations were for certain bops I'm finding myself regenerating on "Surprise" just to see what the model can toss up.

Would it be possible for the model to generate based on the recorded melody in the future? It might also be cool to have increased controls, e.g. choose between male and female vocals, and things like that.

Super nice work!

sarawiltberger · 2023-11-15T16:45:29

Very cool! Is this the state of the art music gen model out there?