Interactive LLM Leaderboard 2026: Compare AI Models, API Cost & ROI
Compare 350+ LLMs in 2026: live API pricing, ROI scores, coding, reasoning, speed, and context. Built for teams in the US, Canada, and Australia evaluating OpenAI, Anthropic, Google Gemini, and open-weight models.
Why teams use this live LLM comparison table in 2026
This leaderboard blends normalized benchmarks with transparent estimated monthly API spend so you can shortlist models fast. Whether you procure from the United States, Canada, or Australia, you get one place to compare flagship chat models, long-context SKUs, and cost-optimized tiers—then jump into comparisons or embed the same data on your site.
Workload & pricing toggles
Same three scenarios as the main AI API calculator: moderate traffic, large RAG-style context, or per-request max tokens with a lower request count.
Include Vision / Image Processing
Off — no image fees in cost estimates for vision-capable models.
Turn On to include image fees.
Use Cached Pricing
Enable to get 50% off input tokens where cached rates apply
Deep Reasoning / Thinking Mode
Model hidden reasoning / extended thinking charged like output tokens when enabled.
Batch Pricing
Enable for 50% off input & output where batch/async pricing applies
Cached / batch est. monthly values only change after the pipeline sets supports_caching or supports_batch in Supabase. The toggles here narrow the table to models whose catalog or provider typically supports those modes.
Magic quadrant (top 15)
X: est. monthly · Y: Overall · Dot: provider color · Hover for rank, model & detailsFull leaderboard
Showing 48 of 327 models.
| Pick | Model | Est. monthly | ROI score | Coding | Reasoning | Speed | Math | Context | Overall |
|---|---|---|---|---|---|---|---|---|---|
| OpenAI: gpt-oss-20b | $2.60 | 84 | 96 | 97 | 85 | 98 | 131K | 97 | |
| Grok 3 | $270.00 | 61 | 93 | 93 | 55 | 96 | 131K | 94 | |
| Google: Gemma 4 31B | $9.00 | 72 | 97 | 92 | 70 | 97 | 262K | 94 | |
| Z.ai: GLM 5 | $44.80 | 64 | 99 | 93 | 55 | 84 | 203K | 92 | |
| Z.ai: GLM 4.7 | $32.60 | 64 | 94 | 87 | 85 | 96 | 203K | 91 | |
| Z.ai: GLM 5.1 | $77.00 | 61 | 92 | 89 | 70 | 85 | 203K | 89 | |
| Qwen: Qwen3.5 397B A17B | $39.00 | 62 | 85 | 89 | 60 | 92 | 262K | 89 | |
| Qwen: Qwen3 32B | $5.60 | 73 | 85 | 89 | 60 | 92 | 41K | 89 | |
| Qwen: Qwen3 Max | $70.20 | 61 | 93 | 87 | 70 | 89 | 262K | 89 | |
| Mistral: Mistral Medium 3 | $36.00 | 63 | 92 | 87 | 70 | 91 | 131K | 89 | |
| Qwen: Qwen3.5-27B | $23.40 | 64 | 80 | 91 | 70 | 92 | 262K | 88 | |
| Baidu: ERNIE 4.5 21B A3B | $5.60 | 72 | 85 | 89 | 60 | 87 | 120K | 88 | |
| Qwen: Qwen3 VL 32B Instruct | $8.32 | 69 | 88 | 88 | 65 | 87 | 131K | 88 | |
| Qwen: Qwen3.5-35B-A3B | $19.50 | 64 | 76 | 89 | 70 | 95 | 262K | 87 | |
| AllenAI: Olmo 3 32B Think | $11.00 | 67 | 90 | 84 | 50 | 88 | 66K | 87 | |
| Qwen: Qwen3.5-122B-A10B | $31.20 | 62 | 81 | 90 | 55 | 85 | 262K | 87 | |
| Anthropic: Claude Opus 4.5 | $450.00 | 57 | 85 | 84 | 55 | 95 | 200K | 87 | |
| Upstage: Solar Pro 3 | $12.00 | 66 | 85 | 89 | 65 | 80 | 128K | 86 | |
| Amazon: Nova Premier 1.0 | $225.00 | 57 | 89 | 89 | 55 | 77 | 1.0M | 86 | |
| Xiaomi: MiMo-V2-Pro | $70.00 | 59 | 85 | 89 | 60 | 80 | 1.0M | 86 | |
| Elephant | Free | 78 | 90 | 83 | 70 | 88 | 262K | 86 | |
| Anthropic: Claude Opus 4 | $1,350.00 | 55 | 85 | 81 | 55 | 95 | 200K | 86 | |
| Arcee AI: Trinity Large Thinking | $17.30 | 64 | 85 | 89 | 55 | 80 | 262K | 86 | |
| Meta: Llama 3.3 70B Instruct | $7.20 | 69 | 88 | 89 | 70 | 77 | 131K | 86 | |
| MoonshotAI: Kimi K2 0711 | $45.80 | 60 | 90 | 87 | 60 | 80 | 131K | 86 | |
| Deep Cogito: Cogito v2.1 671B | $62.50 | 59 | 85 | 89 | 60 | 80 | 128K | 86 | |
| OpenAI: GPT-5.1 Chat | $150.00 | 58 | 91 | 88 | 85 | 77 | 128K | 86 | |
| AionLabs: Aion-1.0-Mini | $42.00 | 61 | 85 | 85 | 95 | 88 | 131K | 86 | |
| xAI: Grok 4.20 | $140.00 | 57 | 88 | 87 | 55 | 76 | 2.0M | 85 | |
| OpenAI: GPT-5.2 | $210.00 | 57 | 85 | 89 | 65 | 78 | 400K | 85 | |
| Claude Sonnet 4.6 | $270.00 | 56 | 93 | 75 | 70 | 96 | 1.0M | 85 | |
| Claude Opus 4.6 | $450.00 | 56 | 85 | 79 | 55 | 95 | 1.0M | 85 | |
| NVIDIA: Nemotron Nano 9B V2 | $3.20 | 75 | 72 | 84 | 85 | 98 | 131K | 85 | |
| OpenAI: GPT-5.1-Codex-Mini | $30.00 | 61 | 84 | 82 | 85 | 92 | 400K | 85 | |
| OpenAI: GPT-5.5 | $500.00 | 55 | 92 | 88 | 55 | 71 | 1.1M | 85 | |
| xAI: Grok 3 Mini Beta | $17.00 | 63 | 90 | 87 | 95 | 77 | 131K | 85 | |
| Z.ai: GLM 5V Turbo | $88.00 | 58 | 88 | 83 | 70 | 80 | 203K | 84 | |
| xAI: Grok 3 Mini | $17.00 | 63 | 88 | 86 | 92 | 76 | 131K | 84 | |
| DeepSeek: DeepSeek V4 Flash | $8.40 | 67 | 90 | 83 | 95 | 80 | 1.0M | 84 | |
| Nous: Hermes 4 70B | $9.20 | 66 | 85 | 81 | 60 | 88 | 131K | 84 | |
| Auto Router | VARIABLE | 77 | 84 | 83 | 70 | 86 | 2.0M | 84 | |
| Qwen: Qwen3 235B A22B Thinking 2507 | $20.93 | 62 | 74 | 89 | 50 | 83 | 131K | 84 | |
| GLM 5 Turbo | $88.00 | 57 | 88 | 91 | 70 | 60 | 203K | 83 | |
| Tencent: Hunyuan A13B Instruct | $11.30 | 64 | 64 | 86 | 55 | 94 | 131K | 83 | |
| xAI: Grok 4 | $270.00 | 55 | 87 | 80 | 70 | 87 | 256K | 83 | |
| Xiaomi: MiMo-V2.5-Pro | $70.00 | 57 | 78 | 78 | 70 | 95 | 1.0M | 82 | |
| NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 | $8.00 | 66 | 74 | 79 | 70 | 97 | 131K | 82 | |
| OpenAI: GPT-5.2 Pro | $2,520.00 | 52 | 85 | 85 | 55 | 73 | 400K | 82 |
Need a shareable artifact?
Download a print-ready PDF from the leaderboard and workload above. No email step—lead capture is off.
PDF Breakdown
Receive a comprehensive native vector PDF of this leaderboard: your workload, filters, top rankings, and a table snapshot.
By submitting, you agree to our Privacy Policy and Terms.
Whitelabel Overall Leaderboard
for your site
Embed the interactive overall view on your own domain — whitelabel branding, lead capture, and the same workload sliders your prospects already use on LeadsCalc.