GPT Rate Limits

GPT Rate Limits

GPT Rate limits, imposed by an API to control the frequency of user or client server access within a specified timeframe, are accessible under the rate limits section on the account management page for any given organization.

As GPT-4 is rolled out, it will exhibit stricter rate limits in order to meet the demand. The current rate limits can be inspected under the rate limits section on the account page. However, due to limited capacity, requests for increases in rate limits cannot be entertained at this moment. Initially, the priority lies in granting general access to GPT-4. Later, as the capacity permits, the rate limits will be gradually increased.

OpenAI applies rate limits to the number of requests made to its API. The limitations could be enforced on a per-minute basis across several parameters like requests, tokens, or in the scenario of image models, images.

Default Organization Rate Limits

Rate limits are measured in two ways: RPM (requests per minute) and TPM (tokens per minute). 

MODELRPMTPM
CHAT
gpt-3.5-turbo3,50090,000
gpt-3.5-turbo-03013,50090,000
gpt-3.5-turbo-06133,50090,000
gpt-3.5-turbo-16k3,500180,000
gpt-3.5-turbo-16k-06133,500180,000
gpt-420040,000
gpt-4-031420040,000
gpt-4-061320040,000
TEXT
ada3,000250,000
ada-code-search-code3,000250,000
ada-code-search-text3,000250,000
ada-search-document3,000250,000
ada-search-query3,000250,000
ada-similarity3,000250,000
babbage3,000250,000
babbage-code-search-code3,000250,000
babbage-code-search-text3,000250,000
babbage-search-document3,000250,000
babbage-search-query3,000250,000
babbage-similarity3,000250,000
code-davinci-edit-00120150,000
code-search-ada-code-0013,000250,000
code-search-ada-text-0013,000250,000
code-search-babbage-code-0013,000250,000
code-search-babbage-text-0013,000250,000
curie3,000250,000
curie-instruct-beta3,000250,000
curie-search-document3,000250,000
curie-search-query3,000250,000
curie-similarity3,000250,000
davinci3,000250,000
davinci-instruct-beta3,000250,000
davinci-search-document3,000250,000
davinci-search-query3,000250,000
davinci-similarity3,000250,000
text-ada-0013,000250,000
text-babbage-0013,000250,000
text-curie-0013,000250,000
text-davinci-0013,000250,000
text-davinci-0023,000250,000
text-davinci-0033,000250,000
text-davinci-edit-00120150,000
text-embedding-ada-0023,0001,000,000
text-search-ada-doc-0013,000250,000
text-search-ada-query-0013,000250,000
text-search-babbage-doc-0013,000250,000
text-search-babbage-query-0013,000250,000
text-search-curie-doc-0013,000250,000
text-search-curie-query-0013,000250,000
text-search-davinci-doc-0013,000250,000
text-search-davinci-query-0013,000250,000
text-similarity-ada-0013,000250,000
text-similarity-babbage-0013,000250,000
text-similarity-curie-0013,000250,000
text-similarity-davinci-0013,000250,000
MODERATION
text-moderation-latest1,0001,250
text-moderation-stable1,0001,250
IMAGEIMG / MIN
DALL·E 250
AUDIO
whisper-15025,000,000
OTHER
Default limits for all other models3,000250,000
Default GPT Rate Limits

Read more related articles:


Posted

Tags: