Hacker News new | past | comments | ask | show | jobs | submit login

Suppose I give Gemini a 10 minute video. Will it spend 10 minutes “watching” it if I ask it to extract something? Or does it know how to speed up the video? I assume it must do some sort of preprocessing like extracting keyframes; it surely (?) can’t be looking at the raw encoded video bytes, after all.





It won't take 10 minutes, but it might still take a minute or two (for Pro) - though Flash and Flash 8B should be significantly faster.

It does process a version of the raw video but it can run that faster than the default video playback rate.

That is quite a bit of detail here: https://ai.google.dev/gemini-api/docs/vision?lang=python#pro...

"The File API service extracts image frames from videos at 1 frame per second (FPS) and audio at 1Kbps, single channel, adding timestamps every second."




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: