Ask HN: How are people getting around GPT4 rate limits?

fpdavis · 2023-09-09T02:50:30

That is a very high volume, especially for being in development. Here are a few things we have done...

  * Use local machine learning models wherever possible.
  * Summarize and consolidate calls whenever possible (i.e. reduce token sizes using language analytics). 
  * Log all calls/responses so it is possible to reuse them and/or to train your ML models. This can cut down on duplicate calls.
  * Monitor your API call logs to make sure the system isn't making calls it shouldn't.
  * Throttle your calls by introducing delays/bottlenecks in the user interface (by far my least favorite).
  * Charge more for your services to decrease demand.
  * Contact your account rep and see what options they have to offer with a higher price tier.

throwaway888abc · 2023-09-07T23:37:57

Speak to your Azure account manager. They will happily increase your limits based on your wallet size.

aithrowaway · 2023-09-08T01:24:53

Interesting. According to https://learn.microsoft.com/en-us/azure/ai-services/openai/q... >

"Quota increase requests can be submitted from the Quotas page of Azure OpenAI Studio. Please note that due to overwhelming demand, we are not currently approving new quota increase requests. Your request will be queued until it can be filled at a later time."

But I guess you're saying that exceptions can be made, which isn't too surprising I suppose. Frustrating though if true since this kind of preferential access would seem to run counter to OpenAI's stated mission and mean they (or MSFT at least) are picking winners rather than allowing the best product to win in the market.

adastra22 · 2023-09-08T01:30:43

What he’s saying is that if you pay them enough money, they will.

aithrowaway · 2023-09-08T01:45:23

Yeah, I understood. I hope it's not true. Selfishly in part, since I don't have that kind of cash available, but also because one of OpenAI's founding principles is to "avoid undue concentration of power". Giving the latest models (sans very restrictive limits) to large or well-funded companies first would seem to violate this principle.

Being a small startup already puts you at a disadvantage. If established/funded companies are always getting the best model months ahead of you, it's really hard to compete even if you can make a better product.

adastra22 · 2023-09-08T01:52:43

Why wouldn't that be the case? Running these models cost them money. Somebody is paying for it. Those limits are to keep the free tier from driving Azure out of business. I also want a Unicorn, but I can't have one.

Regardless, OpenAI turned its back on its founding principles a long, long time ago.

aithrowaway · 2023-09-08T02:27:21

I'm not talking about free tier limits. Paid users are being limited.

Zetobal · 2023-09-08T05:39:00

There are pages about in their docs they increased the limits in hours everytime we talked with a rep. You are building a product on top of theirs it's time that you get in contact with them. Everything else is just negligence and it's not only about the API limits... you want to know the company that your project depends on.

aithrowaway · 2023-09-08T06:00:58

From https://platform.openai.com/docs/guides/rate-limits/overview (emphasis mine):

GPT-4 rate limits

During the rollout of GPT-4, the model will have more aggressive rate limits to keep up with demand. You can view your current rate limits in the rate limits section of the account page. We are unable to accommodate requests for rate limit increases due to capacity constraints. We are prioritizing general access to GPT-4 first and will subsequently raise rate limits automatically as capacity allows.

Zetobal · 2023-09-08T06:06:00

As I said talk with them.

aithrowaway · 2023-09-08T06:26:17

Ok, I'll try that. Thanks.

ashu1461 · 2023-09-08T00:54:30

just curious Even if the tokens is increased, how are you planning to deal with the cost

aithrowaway · 2023-09-08T01:21:14

If and when the rate limits aren't an issue, I'll probably go the typical route of charging some amount per month (probably 20-50 range), which includes some base number of tokens, then charge overage based on OpenAI costs plus a small markup (maybe 5-10%).

I think the tool could be useful enough that many users won't be too cost conscious, but I'll only find out for sure by launching it.