External Publication
Visit Post

Rate limits for omni-moderation based on tier?

OpenAI Developer Community March 27, 2026
Source

The rate limits are displayed here: omni-moderation Model | OpenAI API

The limiting factor is “tokens per minute”, not requests per minute

As you can see even on highest tier, it only allows 500,000 tokens per minute. Compare that to the regular LLM models which allow far higher rate limits. There must be at least some Tier 5 users/developers complaining about this, no? Not to mention, it is not reasonable to expect someone to pay $1,000 up front just to be able to use a moderation API with a decent amount of traffic.

P.S. if you look at that documentation, text-moderation-latest and text-moderation-stable have much more reasonable/generous rate limits than omni-moderation. However in all their wisdom OpenAI appears to have sunset these models without warning anybody, because when trying to request them, it says the model does not exist.

Discussion in the ATmosphere

Loading comments...