← Back to API models

GPT-5.4 mini

by OpenAI · budget tier · GPT-5.4 family

The cheaper sibling of GPT-5.4 — same 272K window, an eighth of the output price, built for well-defined tasks.

Input $0.25 / 1M
Output $2 / 1M
Context 272K tokens
Family GPT-5.4
OpenAI Platform ↗ Updated June 10, 2026
§ API pricing

Per-token rates.

Input
$0.25/1M tokens
Prompt tokens
  • A tenth of full GPT-5.4's rate
  • Matches Gemini 3.1 Flash-Lite
  • Vision inputs billed as tokens
Output
$2/1M tokens
Completion tokens
  • An eighth of full GPT-5.4's $15
  • Above Flash-Lite's $1.50
  • 5× GPT-5.4 nano's $0.40
Context
272Ktokens
Window
  • Same window as full GPT-5.4
  • No budget-tier window cut
  • ~200K words of practical input
Subscription
Freetier model
ChatGPT
  • Default model on ChatGPT Free
  • Included in Go ($8) and Plus ($20)
  • API is pay-as-you-go

Why GPT-5.4 mini exists

GPT-5.4 mini is the middle child of the GPT-5.4 family: smarter than nano, an eighth of the output price of full GPT-5.4, with the same 272K context window. OpenAI also uses it as the default model on ChatGPT's free tier, which tells you the positioning — good enough for everyday tasks at a cost OpenAI can give away.

On the API, the pitch is "well-defined tasks": summarization, extraction, classification, formulaic generation. When the prompt fully specifies the job, mini does it at 8–10× less than the full model. When the task needs planning or multi-step reasoning, the savings evaporate in retries — that's full GPT-5.4 territory.

Capabilities

Mini handles the same multimodal surface as its bigger sibling (vision in, structured output, tool calling) and is fast enough for interactive products. The honest weakness: it follows instructions more literally and plans less. Agent loops longer than a couple of steps, ambiguous prompts, and subtle code work all favor the full model.

Typical use cases

  • Summarization and extraction pipelines at volume
  • Classification, tagging, and routing
  • Templated content generation
  • Chat features where cost-per-message matters
  • First-pass triage before escalating to GPT-5.4 or 5.5

Sibling and rival comparison

ModelInput / 1MOutput / 1MContext
GPT-5.4 mini$0.25$2272K
GPT-5.4$2.50$15272K
GPT-5.4 nano$0.05$0.40272K
Gemini 3.1 Flash-Lite$0.25$1.501M
Claude Haiku 4.5$1$5200K
Mistral Small 3.1$0.20$0.60128K

Cross-family, Gemini 3.1 Flash-Lite matches mini's input price with cheaper output and a 1M window; Mistral Small undercuts it on both rates. Mini's draw is ecosystem: same API shape, tooling, and behavior as the rest of the GPT stack, so a GPT-5.5 product can route easy traffic to mini with one parameter change.

← See all OpenAI / ChatGPT plans