External Publication
Visit Post

Introducing GPT-5.4 mini and nano — our most capable small models yet

OpenAI Developer Community March 18, 2026
Source

It seems just about every bit of documentation is wrong.

  1. The models comparison page has rounded values for token pricing that are incorrect:

gpt-5.4-mini vs gpt-5-mini

Bad rounding: $0.075 → 0.08 $0.025 → 0.03

  1. The “patches” multiplier in documentation is all screwed up:

What it should look like, given the currently realized costs:

Model Multiplier
gpt-5.4-mini-2026-03-17 1.2x
gpt-5.4-nano-2026-03-17 1.2x
gpt-5.4-2026-03-05 1.2x
gpt-5.3-codex 1.2x
gpt-5.2-2025-12-11 1.2x
gpt-5-mini-2025-08-07 1.2x
gpt-5-nano-2025-08-07 1.5x
o4-mini-2025-04-16 1.72x
gpt-4.1-mini-2025-04-14 1.62x
gpt-4.1-nano-2025-04-14 2.46x
gpt-5.3-codex 1.2x
codex-mini-latest 1.72x
  1. The maximum vision input size of the new GPT-5.4-mini and nano models is wrong. You can send 1600x1600 and get billed for 50x50 patches = 2500 (haven’t tested “original”).

  2. Chat Completions with gpt-5.4-mini and nano is resizing wrong or billing wrong (cheaper). Here is sending that 1600x1600px for 2500 patches/tokens:

model vision mult ChatC Ccalculated Responses Rcalculated
gpt-5.4 patch 1.2 2813 2338 3008 2500
gpt-5.4-mini patch 1.2 2813 2338 3008 2500
gpt-5.4-nano patch 1.2 2813 2338 3008 2500
gpt-5-mini patch 1.2 1834 1522 1833 1521
gpt-5-nano patch 1.5 2290 1522 2289 1521

The amount of input “usage” received back per endpoint is the columns “ChatC” and “Responses”. Image consumption then by input difference due to inclusion of the image, and then reversing the apparent multiplier.

If it were downsized, like the original mini and nano: 1248 × 1248 px (39 × 39 patches) = 1521 tokens.

This is not the only model disparity in billing between API endpoints. Here’s sending the image to others to where it should be downsized. Price should be the same between Chat Completions and Responses, but is not.

model vision vision_mult chat input calculated responses input calculated
gpt-5.3-chat-latest patch 1.2 1667 1383 1833 1521
gpt-5.3-codex patch 1.2 - - 1833 1521
gpt-5.2-chat-latest patch 1.2 1667 1383 1833 1521

And I have even another way of sending the same image to Chat Completions - the same vision, cheaper price for me still on the “chat” model:

model vision vision_mult chat input calculated r rerror
gpt-5.4 patch 1.2 2813 2338 E400 E400
gpt-5.3-chat-latest patch 1.2 1548 1284 E400 E400

Vision price inflation

Ultimately, when I integrate what the API is currently costing into my own calculator, - the same image, downsized manually to the old “high” resolution (which doesn’t happen as documented), vision still costs 3x on mini and 3.2x on nano new models.

Updated models:

hotnova.com

OpenAI Vision Token Calculator

Discussion in the ATmosphere

Loading comments...