There is one single app I've been able to find that offers Parakeet-v3 for free locally and it's called Spokenly. They have paid cloud models available as well, but the local Parakeet-v3 implementation is totally free and is the best STT has to offer these days regardless. Super fast and accurate. I consider single-user STT basically a solved problem at this point.
I've been running a variation of this for the past 3 weeks. I swapped out the default pi agent back to Claude Code because I didn't like the smaller feature set. Bought a phone line and communicate with my agent via iMessage on a clamshelled mac. A Tailscale network connect the head agent to all the computers on my network including my laptop, a few raspberry pi's, steam deck, and all the IoT devices in my house. As I discover new uses, I ask it to make skills and it is remarkable what it's been able to handle all through the single chat interface because it has 24/7 access to all my computers' file systems and my home network. It's been really fun to see how far I can take it, and the skills framework built into CC/Codex now make it feel infinitely extensible.
I should note, a lot of the functionality I built into my agent was custom after-the-fact because (at least three weeks ago) the clawdis repo was in a state that I found very broken and with tons of false information. Luckily it's easy work for Claude to get things working for you, but really the key unlock was the phone line through iMessage and the unrestricted access to all my systems. It really does feel like I'm able to work with any of my files anywhere now, while hardly requiring much of my attention at all. I would recommend something like this at the bare minimum if you intend to implement a system like this: https://github.com/kenryu42/claude-code-safety-net
Both clawdbot and pi have improved and expanded functionality a lot during the last 3 weeks, maybe worth another look? What you have described sounds a lot like the experience I’m having.
At this point I really don't want to even imagine the headache of implementing their codebase updates into all the custom scaffolding that my own fork relies on now. I think my plan going forward is to cherry pick features that sound interesting and re-implement on my own using my agent that has proper documentation on my personal configuration. Will check out what's new though.
I listened to it when you posted before. Better than most of the others I have listened wich were all much more "cold".
The visual stuff also helps to make it more powerfull and cohesive.
The bad part is that it wanders a lot to get nowhere and it does not create a climax that bridges with the second part. The same sounds and ambient with a producer behind that creates an arragment for it would be much more powerful.
I'm not the person you responded to, but these are some examples from someone I know that had accompanying music videos (actual video production) made for them:
I feel I must push back on this dang. I was being kind and not snarky, but critisim was earned as I listened to the tracks that where suggested. Had I said these are wonderful no flag. OP stated something that would lead someone to believe they where as good as grandfather comment description. It was in fact not audioly pleasent despite the great visuals.
reply