Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Did you try setting thinkingLevel to minimal?

thinkingConfig: { thinkingLevel: "low", }

More about it here https://ai.google.dev/gemini-api/docs/gemini-3#new_api_featu...





Yes I tried it with minimal and it's roughly 3 seconds for prompts that take flash 2.5 1 second.

On that note it would be nice to get these benchmark numbers based on the different reasoning settings.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: