Gpt-5.4 ignores reasoning_effort="none" when max_completion_tokens is used
OpenAI Developer Community
April 26, 2026
Thanks for retesting. This looks like a separate follow-up from the original reasoning_effort: "none" + max_completion_tokens issue.
If reasoning_tokens is now 0 and visible text is returned, the original reasoning-token bug looks fixed. The remaining gap you’re seeing is different: usage reports the full token cap, but the delivered text appears to tokenize to fewer visible tokens.
Since the docs say max_completion_tokens / max_output_tokens include generated output tokens, including reasoning tokens, this is worth checking with the server-side trace rather than guessing from local tokenization.
Could you share just one affected request ID and its usage block? That should be enough to verify what was counted versus returned.
Discussion in the ATmosphere