jaykr_'s comments

jaykr_ · on Feb 9, 2025

Seems like Gemini 2.0 Flash Thinking got silently updated in AI Studio to accept audio input, as well as image and text, making it the first reasoning model I'm aware of that works across audio. Trying it out on a few audio tasks (transcription, sound analysis) seems to perform a little better than 2.0 Flash or even Pro. Curious what you guys make of it!

jaykr_ · on Nov 21, 2024

This is awesome! I really appreciate the time you took to document everything!

PaulPauls · on Nov 22, 2024

Thank you for saying that! I have a much, much harder time documenting everything and writing out each decision in continuous text than actually writing the code. So it took a look time for me to write all of this down - so I'm happy you appreciate it! =)