Best Coding LLMs 2026: AI Models for Software Engineering
Rank the best coding LLMs in 2026 using SWE-bench signals, HumanEval benchmarks, and live API pricing. Compare autonomous editing and repo-scale tasks for US, Canadian, and Australian engineering orgs.
Compare coding LLMs with benchmarks and real API pricing in 2026
The coding tab prioritizes models that perform on real software engineering tasks—not trivia—then layers estimated monthly API cost for the same workload. Engineering leads from San Francisco to Toronto to Sydney use it to pair a daily-driver model with a cheaper tier for bulk codegen, while keeping vendor options defensible for security and procurement reviews.
Workload & pricing toggles
Same three scenarios as the main AI API calculator: moderate traffic, large RAG-style context, or per-request max tokens with a lower request count.
Include Vision / Image Processing
Off — no image fees in cost estimates for vision-capable models.
Turn On to include image fees.
Use Cached Pricing
Enable to get 50% off input tokens where cached rates apply
Deep Reasoning / Thinking Mode
Model hidden reasoning / extended thinking charged like output tokens when enabled.
Batch Pricing
Enable for 50% off input & output where batch/async pricing applies
Cached / batch est. monthly values only change after the pipeline sets supports_caching or supports_batch in Supabase. The toggles here narrow the table to models whose catalog or provider typically supports those modes.
Magic quadrant (top 15)
X: est. monthly · Y: Coding · Dot: provider color · Hover for rank, model & detailsFull leaderboard
Showing 48 of 327 models.
| Pick | Model | Est. monthly | ROI score | Coding | Reasoning | Speed | Math | Context | Overall |
|---|---|---|---|---|---|---|---|---|---|
| Z.ai: GLM 5 | $44.80 | 64 | 99 | 93 | 55 | 84 | 203K | 92 | |
| Morph: Morph V3 Large | $55.00 | 56 | 98 | 78 | 85 | 65 | 262K | 80 | |
| Google: Gemma 4 31B | $9.00 | 72 | 97 | 92 | 70 | 97 | 262K | 94 | |
| OpenAI: gpt-oss-20b | $2.60 | 84 | 96 | 97 | 85 | 98 | 131K | 97 | |
| Morph: Morph V3 Fast | $44.00 | 57 | 96 | 78 | 90 | 70 | 82K | 80 | |
| Z.ai: GLM 4.7 | $32.60 | 64 | 94 | 87 | 85 | 96 | 203K | 91 | |
| Qwen: Qwen3 Coder Next | $13.60 | 62 | 93 | 73 | 65 | 85 | 262K | 81 | |
| Claude Sonnet 4.6 | $270.00 | 56 | 93 | 75 | 70 | 96 | 1.0M | 85 | |
| Qwen: Qwen3 Max | $70.20 | 61 | 93 | 87 | 70 | 89 | 262K | 89 | |
| Grok 3 | $270.00 | 61 | 93 | 93 | 55 | 96 | 131K | 94 | |
| Z.ai: GLM 5.1 | $77.00 | 61 | 92 | 89 | 70 | 85 | 203K | 89 | |
| Mistral: Mistral Small 3.2 24B | $5.00 | 69 | 92 | 83 | 85 | 69 | 128K | 82 | |
| Mistral: Mistral Medium 3 | $36.00 | 63 | 92 | 87 | 70 | 91 | 131K | 89 | |
| OpenAI: GPT-5.5 | $500.00 | 55 | 92 | 88 | 55 | 71 | 1.1M | 85 | |
| OpenAI: GPT-5.1 Chat | $150.00 | 58 | 91 | 88 | 85 | 77 | 128K | 86 | |
| AllenAI: Olmo 3 32B Think | $11.00 | 67 | 90 | 84 | 50 | 88 | 66K | 87 | |
| DeepSeek: DeepSeek V4 Flash | $8.40 | 67 | 90 | 83 | 95 | 80 | 1.0M | 84 | |
| Elephant | Free | 78 | 90 | 83 | 70 | 88 | 262K | 86 | |
| OpenAI: GPT-5 Mini | $30.00 | 58 | 90 | 78 | 85 | 74 | 400K | 80 | |
| MoonshotAI: Kimi K2 0711 | $45.80 | 60 | 90 | 87 | 60 | 80 | 131K | 86 | |
| xAI: Grok 3 Mini Beta | $17.00 | 63 | 90 | 87 | 95 | 77 | 131K | 85 | |
| Amazon: Nova Premier 1.0 | $225.00 | 57 | 89 | 89 | 55 | 77 | 1.0M | 86 | |
| Xiaomi: MiMo-V2.5 | $36.00 | 57 | 89 | 75 | 85 | 75 | 1.0M | 78 | |
| GLM 5 Turbo | $88.00 | 57 | 88 | 91 | 70 | 60 | 203K | 83 | |
| Z.ai: GLM 5V Turbo | $88.00 | 58 | 88 | 83 | 70 | 80 | 203K | 84 | |
| xAI: Grok 4.20 | $140.00 | 57 | 88 | 87 | 55 | 76 | 2.0M | 85 | |
| xAI: Grok 3 Mini | $17.00 | 63 | 88 | 86 | 92 | 76 | 131K | 84 | |
| Mistral: Mistral Small Creative | $7.00 | 66 | 88 | 83 | 90 | 69 | 33K | 81 | |
| Pareto Code Router | VARIABLE | 74 | 88 | 73 | 85 | 80 | 200K | 78 | |
| Meta: Llama 3.3 70B Instruct | $7.20 | 69 | 88 | 89 | 70 | 77 | 131K | 86 | |
| Qwen: Qwen3 VL 32B Instruct | $8.32 | 69 | 88 | 88 | 65 | 87 | 131K | 88 | |
| xAI: Grok 4 Fast | $13.00 | 63 | 88 | 83 | 85 | 76 | 2.0M | 82 | |
| Mistral: Saba | $14.00 | 60 | 88 | 77 | 85 | 69 | 33K | 78 | |
| Mistral: Mistral Small 4 | $12.00 | 63 | 88 | 83 | 55 | 69 | 262K | 81 | |
| ByteDance Seed: Seed 1.6 Flash | $6.00 | 66 | 87 | 76 | 85 | 77 | 262K | 79 | |
| Mistral Large 2411 | $140.00 | 56 | 87 | 85 | 70 | 72 | 131K | 82 | |
| xAI: Grok 4 | $270.00 | 55 | 87 | 80 | 70 | 87 | 256K | 83 | |
| Mistral: Mistral Large 3 2512 | $35.00 | 59 | 87 | 85 | 60 | 72 | 262K | 82 | |
| OpenAI: GPT-5.4 Image 2 | $470.00 | 53 | 86 | 87 | 70 | 65 | 272K | 81 | |
| OpenAI: GPT-5.4 | $250.00 | 52 | 86 | 79 | 55 | 64 | 1.1M | 77 | |
| OpenAI: GPT-4.1 | $160.00 | 55 | 86 | 87 | 55 | 65 | 1.0M | 81 | |
| OpenAI: GPT-5 | $150.00 | 54 | 86 | 78 | 55 | 76 | 400K | 79 | |
| OpenAI: GPT-5.3 Chat | $210.00 | 53 | 85 | 86 | 70 | 61 | 128K | 79 | |
| Qwen: Qwen3.5 397B A17B | $39.00 | 62 | 85 | 89 | 60 | 92 | 262K | 89 | |
| ByteDance Seed: Seed-2.0-Mini | $8.00 | 60 | 85 | 66 | 85 | 70 | 262K | 72 | |
| Upstage: Solar Pro 3 | $12.00 | 66 | 85 | 89 | 65 | 80 | 128K | 86 | |
| OpenAI: GPT-5.2 | $210.00 | 57 | 85 | 89 | 65 | 78 | 400K | 85 | |
| Relace: Relace Search | $70.00 | 52 | 85 | 73 | 95 | 65 | 256K | 74 |
Need a shareable artifact?
Download a print-ready PDF from the leaderboard and workload above. No email step—lead capture is off.
PDF Breakdown
Receive a comprehensive native vector PDF of this leaderboard: your workload, filters, top rankings, and a table snapshot (sorted: Coding).
By submitting, you agree to our Privacy Policy and Terms.
Whitelabel Coding Leaderboard
for your site
Embed the interactive coding view on your own domain — whitelabel branding, lead capture, and the same workload sliders your prospects already use on LeadsCalc.