External Publication

Codex Rate Limits Discussion Thread

OpenAI Developer Community May 20, 2026

I’ve tried hard to adapt to the new Codex limits instead of just complaining about them.

I wrapped my use cases in skills, optimized my AGENTS.md, added agent-specific npm scripts with limited output, narrowed task scopes, and tried older models like GPT-5.4 mini with low effort/intelligence settings.

But even then, the first message of a brand-new Codex chat can instantly consume around 10% of the hourly limit. That makes the limit feel unpredictable and very hard to work around, even when using the product carefully.

I used to promote Codex to colleagues because the pricing felt reasonable compared to Claude Code. I can’t honestly say that anymore. For the first time, moving more of this workflow to local models like Qwen actually starts to make practical sense.

I’m not asking for unlimited usage. I’m asking for predictable, usable limits and clearer visibility into what a task is going to cost before it consumes a significant part of the allowance. The change in token usage changed; this suddenly made existing accounts less usable. I am hoping someone from OpenAI reads this and reconsiders.

Discussion in the ATmosphere