Uses Groq + Llama vision models for fast inference.
Feel free to play around, appreciating all feedback!