Switching APIs from 4.1 mini to 5.1 mini - a lot more tokens generated
OpenAI Developer Community
March 9, 2026
You can set a max_completion_tokens to force it, though you might want to experiment with your prompt to make the model more succinct. Literally ask it to summarise, be succinct etc.
Discussion in the ATmosphere