External Publication
Visit Post

New feature: moderation scores in Chat API responses, by parameter

OpenAI Developer Community June 21, 2026
Source
Thanks for raising this and for sharing the detailed context. The moderation scores returned with Responses and Chat Completions API calls are intended to help developers enforce their own application policies and decide whether generated content should be shown to users, especially when the output is flagged or exceeds a chosen threshold for the supported moderation categories. As noted in the documentation: > The model still generates normally. Review the moderation results before you show the output to a user or take downstream actions. In other words, moderation results included with a generation response do not prevent the model from processing the input. They are designed to provide visibility into the input and/or output so your application can take appropriate action before displaying content or triggering downstream workflows. If your goal is to screen user input before sending it for generation, I’d recommend using the standalone moderation endpoint directly. I also appreciate the note about the response object documentation and the concern that returning moderation scores alongside generation responses can feel redundant or potentially confusing. I’ll pass this feedback along to the team and follow up once there are updates or clarifications to share.

Discussion in the ATmosphere

Loading comments...