External Publication
Visit Post

Switching APIs from 4.1 mini to 5.1 mini - a lot more tokens generated

OpenAI Developer Community March 9, 2026
Source

You can set a max_completion_tokens to force it, though you might want to experiment with your prompt to make the model more succinct. Literally ask it to summarise, be succinct etc.

Discussion in the ATmosphere

Loading comments...