tax

Anthropic's +35% tokenizer tax on code and JSON

Published: 2026-05-27

Claude Opus 4.7 ships with a quietly retrained tokenizer that splits structural payloads — JSON, Python, TypeScript, regex — into noticeably more tokens than Opus 4.6 did on the same input. Our regression fixtures across 200 mixed prompts show a consistent +35% token count when the body contains structural markers ({, }, def, function, type annotations).

At the sticker price of $5/Mtok input and $25/Mtok output, the underlying rate looks competitive with GPT-5.5 — but on JSON-heavy agentic workloads, the effective cost is closer to $6.75/Mtok input and $33.75/Mtok output. That's a hidden 20–30% premium versus what a buyer would estimate from the price page alone.

The TokenFloor calculator detects this automatically: if the input contains structural markers and the model is in the Claude 4 family, the cost row is shown in EU gold with a tooltip explaining the multiplier. Anthropic has not published a changelog entry for the tokenizer change. For now: budget +35% on code-heavy Claude 4 traffic, or route the same workload to a sibling model.