Greetings!
YakGPT is a simple, frontend-only, ChatGPT UI you can use to either chat normally, or, more excitingly, use your mic + OpenAI's Whisper API to chat hands-free.
Some features:
* A few fun characters pre-installed
* No tracking or analytics, OpenAI is the only thing it calls out to
* Optimized for mobile use via hands-free mode and cross-platform compressed audio recording
* Your API key and chat history are stored in browser local storage only
* Open-source, you can either use the deployed version at Vercel, or run it locally
Planned features:
* Integrate Eleven Labs & other TTS services to enable full hands-free conversation
* Implement LangChain and/or plugins
* Integrate more ASR services that allow for streaming
Source code: https://github.com/yakGPT/yakGPT
I’d love for you to try it out and hear your feedback!
Most people can talk faster than they can type, but they can read faster than other people can talk. So an interface where I speak but read the response is an ideal way of interfacing with ChatGPT.
What would be nice is if I didn't have to press the mic button to speak -- if it could just tell when I was speaking (perhaps by saying "hey YakGPT"). But I see how that might be hard to implement.
Would love to hook this up to some smart glasses with a heads-up display where I could speak and read the response.