← Back to API models

Mistral Small 3.1

by Mistral AI · budget tier

Mistral's volume workhorse — the cheapest European-hosted general model on our ledger, with an open-weights sibling.

Input $0.20 / 1M
Output $0.60 / 1M
Context 128K tokens
Open weights Yes
La Plateforme ↗ Updated June 10, 2026
§ API pricing

Per-token rates.

Input
$0.20/1M tokens
Prompt tokens
  • Under GPT-5.4 mini's $0.25
  • A tenth of Large 3's rate
  • Vision inputs supported
Output
$0.60/1M tokens
Completion tokens
  • A third of GPT-5.4 mini's $2
  • A tenth of Large 3's $6
  • Cheapest EU-hosted output here
Context
128Ktokens
Window
  • Half of Large 3's 256K
  • ~96K words of practical input
  • Enough for most documents
Open weights
Apachelicensed sibling
Self-hosting
  • Downloadable open-weights release
  • Run on your own GPUs
  • Same family, no per-token fees

Why Small 3.1 exists

Small 3.1 is Mistral's answer for the 90% of API traffic that doesn't need a flagship: $0.20 input and $0.60 output per 1M tokens, undercutting GPT-5.4 mini on both rates. For teams with EU data-residency requirements, it's often the only model in this price class that ticks the compliance box without a US or Chinese provider in the loop.

The other thing no rival here offers: an open-weights sibling. If your volume grows to where per-token pricing hurts, you can move the same family onto your own GPUs — the API becomes a prototyping stage rather than a permanent bill.

Capabilities

Small 3.1 is a competent generalist with vision support: classification, extraction, summarization, routine drafting, and solid multilingual coverage across European languages. Tool calling works, simple agent loops work.

The honest weakness: the 128K window is the smallest in its class, and hard reasoning is out of scope — that's Large 3 territory, or a different provider entirely.

Typical use cases

  • EU data-residency-compliant volume pipelines
  • Classification, extraction, and summarization at scale
  • Multilingual European-language products
  • Prototype on API, graduate to self-hosted open weights
  • Cost-floor chat features

Sibling and rival comparison

ModelInput / 1MOutput / 1MContext
Mistral Small 3.1$0.20$0.60128K
Mistral Large 3$2$6256K
GPT-5.4 mini$0.25$2272K
Gemini 3.1 Flash-Lite$0.25$1.501M
DeepSeek V4-Flash$0.14$0.281M

On pure price only DeepSeek V4-Flash beats it, and that means Chinese-hosted infrastructure. Against GPT-5.4 mini and Gemini Flash-Lite, Small trades context window for cheaper output and EU jurisdiction. If sovereignty matters, this is the budget pick; if window size matters, look at the 1M rivals.

← See the full Mistral lineup