What is the maximum token limit for response generation in DSPY #1223

SanjanaVegesna · 2024-06-29T17:16:55Z

SanjanaVegesna
Jun 29, 2024

I am using Gpt-4o LM for my project. I checked that the token limit for it is 128000. However, when I give max_tokens=50000, I am getting the following error:
1029 log.debug("Re-raising status error")
-> 1030 raise self._make_status_error_from_response(err.response) from None
1031
1032 return self._process_response(

BadRequestError: Error code: 400 - {'error': {'message': 'max_tokens is too large: 50000. This model supports at most 4096 completion tokens, whereas you provided 50000.', 'type': 'invalid_request_error', 'param': 'max_tokens', 'code': None}}

Can someone explain why this is happening?

amrit-lal-singh · 2024-09-10T21:05:37Z

amrit-lal-singh
Sep 10, 2024

The number 128k refers to the limit of total number of input and output tokens.

If you need the model to produce 3K tokens in its output, you can provide up to 125k tokens in your input (including system and user messages). The combined total of input and output tokens cannot exceed 128k.

The current model is restricted to a maximum of 4k or more accurately 4096 tokens for its output.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What is the maximum token limit for response generation in DSPY #1223

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

What is the maximum token limit for response generation in DSPY #1223

Uh oh!

SanjanaVegesna Jun 29, 2024

Replies: 1 comment

Uh oh!

amrit-lal-singh Sep 10, 2024

SanjanaVegesna
Jun 29, 2024

amrit-lal-singh
Sep 10, 2024