The price sheet lies

List price tells you almost nothing about what AI really costs.

Kimi K2.6 is listed 50% cheaper than Nova 2.0 Pro Preview (medium). It costs 106% more to actually run.

The flip

Same models, two orderings

Each model ranked by list price on the left, and by what it actually costs to run Artificial Analysis's Intelligence Index on the right. Follow a line across — where it climbs, the sticker price lied.

LIST PRICE $ per million tokens ACTUAL COST $ to run the Intelligence Index gpt-oss-20B (high)$0.09 gpt-oss-20B (high) — list $0.09/Mtok GP MiniMax-M2.7$0.53 MiniMax-M2.7 — list $0.53/Mtok MI MiniMax-M3$0.53 MiniMax-M3 — list $0.53/Mtok MN MiMo-V2.5-Pro$0.54 MiMo-V2.5-Pro — list $0.54/Mtok MM Gemini 3.1 Flash-Lite$0.56 Gemini 3.1 Flash-Lite — list $0.56/Mtok GE Qwen3.5 122B A10B (Reas…$1.10 Qwen3.5 122B A10B (Reasoning) — list $1.10/Mtok QW Nemotron 3 Ultra 550B A…$1.18 Nemotron 3 Ultra 550B A55B (Reasoning) — list $1.18/Mtok NE Qwen3.5 397B A17B (Reas…$1.35 Qwen3.5 397B A17B (Reasoning) — list $1.35/Mtok QE Kimi K2.6$1.71 Kimi K2.6 — list $1.71/Mtok KI GLM-5.1 (Reasoning)$2.15 GLM-5.1 (Reasoning) — list $2.15/Mtok GL Gemini 3.5 Flash (high)$3.38 Gemini 3.5 Flash (high) — list $3.38/Mtok GM Nova 2.0 Pro Preview (m…$3.44 Nova 2.0 Pro Preview (medium) — list $3.44/Mtok NO GPT-5.5 (xhigh)$11.25 GPT-5.5 (xhigh) — list $11.25/Mtok GT $30gpt-oss-20B (high) gpt-oss-20B (high) — actually $30 to run the Intelligence Index ($0.0178/task) GP $144MiniMax-M2.7 MiniMax-M2.7 — actually $144 to run the Intelligence Index ($0.0742/task) MI $235MiniMax-M3 MiniMax-M3 — actually $235 to run the Intelligence Index ($0.1567/task) MN $99MiMo-V2.5-Pro MiMo-V2.5-Pro — actually $99 to run the Intelligence Index ($0.0322/task) MM $95Gemini 3.1 Flash-Lite Gemini 3.1 Flash-Lite — actually $95 to run the Intelligence Index ($0.0430/task) GE $447Qwen3.5 122B A10B (Re… Qwen3.5 122B A10B (Reasoning) — actually $447 to run the Intelligence Index ($0.2412/task) QW $444Nemotron 3 Ultra 550B… Nemotron 3 Ultra 550B A55B (Reasoning) — actually $444 to run the Intelligence Index ($0.2446/task) NE $528Qwen3.5 397B A17B (Re… Qwen3.5 397B A17B (Reasoning) — actually $528 to run the Intelligence Index ($0.3331/task) QE $839Kimi K2.6 Kimi K2.6 — actually $839 to run the Intelligence Index ($0.3146/task) KI $674GLM-5.1 (Reasoning) GLM-5.1 (Reasoning) — actually $674 to run the Intelligence Index ($0.2404/task) GL $1,142Gemini 3.5 Flash (hig… Gemini 3.5 Flash (high) — actually $1,142 to run the Intelligence Index ($0.6811/task) GM $407Nova 2.0 Pro Preview … Nova 2.0 Pro Preview (medium) — actually $407 to run the Intelligence Index ($0.1728/task) NO $2,588GPT-5.5 (xhigh) GPT-5.5 (xhigh) — actually $2,588 to run the Intelligence Index ($0.8261/task) GT
Cheaper on paper, pricier to run Looks pricey, secretly a bargain Every other measured model

13 models with both a list price and a measured run cost · updated 1 hour ago

Why the sticker lies

You don't buy tokens. You buy answers.

A price sheet quotes dollars per million tokens. But a task isn't a fixed number of tokens — and that's where the bill hides.

List price is per token

The number on the pricing page is $ per million input/output tokens. It says nothing about how many tokens a model will spend to finish your task.

Reasoning models are verbose

A "thinking" model can emit 10–50× more tokens chewing through the same problem. Cheap per token × a mountain of tokens = an expensive answer.

The real unit is the task

Run the same fixed benchmark across models and the cheap-looking ones often cost the most. Rank by the finished job, not the sticker.

Receipts

Where list price misleads the most

Sorted by how far a model moves when you re-rank by real cost. ▲ means it's pricier than its sticker suggests; ▼ means it's a quiet bargain.

Model List $/Mtok Actual run cost Re-rank
Nova 2.0 Pro Preview (medium) Amazon $3.44 $407 ▼ 6 cheaper
Gemini 3.1 Flash-Lite Google $0.56 $95 ▼ 3 cheaper
Kimi K2.6 Kimi $1.71 $839 ▲ 2 pricier
MiniMax-M3 MiniMax $0.53 $235 ▲ 2 pricier
Qwen3.5 122B A10B (Reasoning) Alibaba $1.10 $447 ▲ 2 pricier
MiniMax-M2.7 MiniMax $0.53 $144 ▲ 2 pricier
Qwen3.5 397B A17B (Reasoning) Alibaba $1.35 $528 ▲ 1 pricier
Gemini 3.5 Flash (high) Google $3.38 $1,142 ▲ 1 pricier
MiMo-V2.5-Pro Xiaomi $0.54 $99 ▼ 1 cheaper
Nemotron 3 Ultra 550B A55B (Reasoning) NVIDIA $1.18 $444 — even

List price is the blended 3:1 input:output rate. "Actual run cost" is what it costs to run the full Artificial Analysis Intelligence Index, from Artificial Analysis — a fixed task suite, so the only variable is how each model behaves.

Citation

Use this in your work

Every figure here is open data. Here's a ready-made citation, plus the methodology behind the numbers and the full State of AI.

Copy a citation

Free to use and cite under CC BY 4.0. See how this is measured.

APA

Champlin Enterprises. (2026). The AI Price Sheet Lies (MyTokenTracker) [Data set]. MyTokenTracker. Retrieved June 20, 2026, from https://mytokentracker.io/price-vs-cost

BibTeX
@misc{mytokentracker-price-vs-cost,
  title        = {The AI Price Sheet Lies (MyTokenTracker)},
  author       = {{Champlin Enterprises}},
  year         = {2026},
  howpublished = {MyTokenTracker, \url{https://mytokentracker.io/price-vs-cost}},
  note         = {Accessed June 20, 2026. Licensed CC BY 4.0.},
  url          = {https://mytokentracker.io/price-vs-cost}
}

Need a fixed point in time? Every day’s data is permanently archived in the open-data repository, so you can cite a specific date by linking that day’s committed file.

Free weekly digest

Stop guessing what AI costs

We track the real, measured cost of every model — not the sticker price. One line to install, free forever. Get the weekly headline in your inbox.

No spam, no account. One click to leave.