Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Show HN: Monika–Open-Source AI Assistant with Whisper STT and Emotional TTS (github.com/aymanelotfi)
2 points by aymanelotfi 6 months ago | hide | past | favorite
Hi HN,

I'm sharing Monika, an open-source AI assistant I've built. The main focus was on leveraging local processing for the speech components to enhance privacy and create more natural-sounding interactions.

Key components: * Speech-to-Text: Uses OpenAI's Whisper running locally. * Text-to-Speech: Uses RealtimeTTS with the Orpheus model for emotional expression, also running locally. * NLP: Uses Google Gemini on the backend

It includes Voice Activity Detection (VAD) and a basic web interface using Flask. The idea was to see how well local STT and expressive local TTS could work together for a conversational agent.

Tech stack: Python, Flask, Whisper, Gemini, RealtimeTTS.

Video Demo: [https://www.youtube.com/watch?v=_vdlT1uJq2k]

The project is MIT licensed. I'd appreciate any feedback, thoughts on the approach, or suggestions you might have!



Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: