External Publication

GPT 5.2 Extended thinking webchat has unworkably slow token (4 tps) generation

OpenAI Developer Community February 5, 2026

If you can time token generation after start of first token (so not including thinking time, which is a different kettle of fish), that would help give some specifics on the problem. The modes are Auto, Instant, Thinking, and Extended Thinking on my web client. When I saw the problem, it only occurred on Thinking and Extended Thinking.

Discussion in the ATmosphere