That would make each API call cost at least $3 ($3 is price per million input to...

coder543 · 2024-07-24T18:09:06 1721844546

Gemini 1.5 Pro charges $0.35/million tokens up to the first million tokens or $0.70/million tokens for prompts longer than one million tokens, and it supports a multi-million token context window.

Substantially cheaper than $3/million, but I guess Anthropic’s prices are higher.

reitzensteinm · 2024-07-24T22:50:29 1721861429

You're looking at the pricing for Gemini 1.5 Flash. Pro is $3.50 for <128k tokens, else $7.

coder543 · 2024-07-24T22:54:13 1721861653

Ah... oops. For some reason, that page isn't rendering properly on my browser: https://imgur.com/a/XLFBPMI

When I glanced at the pricing earlier, I didn't notice there was a dropdown at all.

freediver · 2024-07-24T20:29:10 1721852950

It is also much worse.

coder543 · 2024-07-24T20:31:29 1721853089

Is it, though? In my limited tests, Gemini 1.5 Pro (through the API) is very good at tasks involving long context comprehension.

Google's user-facing implementations of Gemini are pretty consistently bad when I try them out, so I understand why people might have a bad impression about the underlying Gemini models.

rkwz · 2024-07-24T16:51:13 1721839873

Maybe they're summarizing/processing the documents in a specific format instead of chatting? If they needed chat, might be easier to build using RAG?

impossiblefork · 2024-07-24T21:00:29 1721854829

So do it locally after predigesting the book, so that you have the entire KV-cache for it.

Then load that KV-cache and add your prompt.

tr4656 · 2024-07-24T17:22:09 1721841729

This might be when it's better to not use the API and just pay for the flat-rate subscription.