GPT Rate Limits - ChatGPT 5

GPT Rate limits, imposed by an API to control the frequency of user or client server access within a specified timeframe, are accessible under the rate limits section on the account management page for any given organization.

As GPT-4 is rolled out, it will exhibit stricter rate limits in order to meet the demand. The current rate limits can be inspected under the rate limits section on the account page. However, due to limited capacity, requests for increases in rate limits cannot be entertained at this moment. Initially, the priority lies in granting general access to GPT-4. Later, as the capacity permits, the rate limits will be gradually increased.

OpenAI applies rate limits to the number of requests made to its API. The limitations could be enforced on a per-minute basis across several parameters like requests, tokens, or in the scenario of image models, images.

Default Organization Rate Limits

Rate limits are measured in two ways: RPM (requests per minute) and TPM (tokens per minute).

MODEL	RPM	TPM
CHAT
gpt-3.5-turbo	3,500	90,000
gpt-3.5-turbo-0301	3,500	90,000
gpt-3.5-turbo-0613	3,500	90,000
gpt-3.5-turbo-16k	3,500	180,000
gpt-3.5-turbo-16k-0613	3,500	180,000
gpt-4	200	40,000
gpt-4-0314	200	40,000
gpt-4-0613	200	40,000
TEXT
ada	3,000	250,000
ada-code-search-code	3,000	250,000
ada-code-search-text	3,000	250,000
ada-search-document	3,000	250,000
ada-search-query	3,000	250,000
ada-similarity	3,000	250,000
babbage	3,000	250,000
babbage-code-search-code	3,000	250,000
babbage-code-search-text	3,000	250,000
babbage-search-document	3,000	250,000
babbage-search-query	3,000	250,000
babbage-similarity	3,000	250,000
code-davinci-edit-001	20	150,000
code-search-ada-code-001	3,000	250,000
code-search-ada-text-001	3,000	250,000
code-search-babbage-code-001	3,000	250,000
code-search-babbage-text-001	3,000	250,000
curie	3,000	250,000
curie-instruct-beta	3,000	250,000
curie-search-document	3,000	250,000
curie-search-query	3,000	250,000
curie-similarity	3,000	250,000
davinci	3,000	250,000
davinci-instruct-beta	3,000	250,000
davinci-search-document	3,000	250,000
davinci-search-query	3,000	250,000
davinci-similarity	3,000	250,000
text-ada-001	3,000	250,000
text-babbage-001	3,000	250,000
text-curie-001	3,000	250,000
text-davinci-001	3,000	250,000
text-davinci-002	3,000	250,000
text-davinci-003	3,000	250,000
text-davinci-edit-001	20	150,000
text-embedding-ada-002	3,000	1,000,000
text-search-ada-doc-001	3,000	250,000
text-search-ada-query-001	3,000	250,000
text-search-babbage-doc-001	3,000	250,000
text-search-babbage-query-001	3,000	250,000
text-search-curie-doc-001	3,000	250,000
text-search-curie-query-001	3,000	250,000
text-search-davinci-doc-001	3,000	250,000
text-search-davinci-query-001	3,000	250,000
text-similarity-ada-001	3,000	250,000
text-similarity-babbage-001	3,000	250,000
text-similarity-curie-001	3,000	250,000
text-similarity-davinci-001	3,000	250,000
MODERATION
text-moderation-latest	1,000	1,250
text-moderation-stable	1,000	1,250
IMAGE		IMG / MIN
DALL·E 2	∞	50
AUDIO
whisper-1	50	25,000,000
OTHER
Default limits for all other models	3,000	250,000

Default GPT Rate Limits