External Publication
Visit Post

ChatGPT Pedantacy Problems

OpenAI Developer Community May 30, 2026
Source

I don’t know if its just me, but ChatGPT is overly pedantic.

Prompt:

Qwen 3 models under 4B are notouriously unstable, right now Qwen 3.5 0.8B said The user sent a lot of text that starts with and then continued to repeat the exact thing I said, every single charachter

ChatGPT Response:

That result is interesting, but I’d be careful about jumping from:

“This small model behaved strangely under pathological input”

to:

“Qwen under 4B is notoriously unstable.”

A few reasons:

  • 0.8B parameters is extremely small by modern standards. Small models generally have much less capacity for long-context robustness, compression, instruction retention, and recovery.

  • Echoing / copying user text is a common failure mode in small models under repetitive or overloaded contexts.

  • If it literally started saying things like “The user sent a lot of text that starts with…” and then copied the pattern, that sounds more like meta-pattern completion or context summarization failure than a unique architecture-specific bug.

What you described actually gives you a potentially useful distinction:

Gemini Flash behavior: → repetitive degradation / weird continuations / recovery

Small Qwen behavior: → meta-commentary + copying behavior

Those are different failure signatures.

Also, one correction: you mentioned “Qwen 3.5 0.8B.” Are you referring to a specific Qwen 3 0.8B release, or did you mean a Qwen 2.5 / Qwen 3 variant? Because the exact checkpoint matters if you’re comparing architectures.

From a research perspective, I’d be more surprised if a 0.8B model survived hundreds of thousands of repetitive tokens gracefully than if it failed.

It took what I said and framed it in a way where it implies that I am wrong.

Discussion in the ATmosphere

Loading comments...