Best Reasoning LLMs 2026: AI Models for Logic & Multi-Step Tasks
Find top reasoning LLMs in 2026: logic, instruction-following, native thinking modes, and estimated API cost. Ideal for agents, analytics, and complex workflows across the US, Canada, and Australia.
Reasoning-focused LLM rankings with transparent API economics in 2026
Reasoning models power agents, planning, and long chains of tool use—but they often bill more output tokens when “thinking” is enabled. This view highlights logic-heavy capability while showing what that sophistication costs per month for your token mix, helping operators in the US, Canada, and Australia avoid surprise overages.
Workload & pricing toggles
Same three scenarios as the main AI API calculator: moderate traffic, large RAG-style context, or per-request max tokens with a lower request count.
Include Vision / Image Processing
Off — no image fees in cost estimates for vision-capable models.
Turn On to include image fees.
Use Cached Pricing
Enable to get 50% off input tokens where cached rates apply
Deep Reasoning / Thinking Mode
Model hidden reasoning / extended thinking charged like output tokens when enabled.
Batch Pricing
Enable for 50% off input & output where batch/async pricing applies
Cached / batch est. monthly values only change after the pipeline sets supports_caching or supports_batch in Supabase. The toggles here narrow the table to models whose catalog or provider typically supports those modes.
Magic quadrant (top 15)
X: est. monthly · Y: Reasoning · Dot: provider color · Hover for rank, model & detailsFull leaderboard
Showing 48 of 327 models.
| Pick | Model | Est. monthly | ROI score | Coding | Reasoning | Speed | Math | Context | Overall |
|---|---|---|---|---|---|---|---|---|---|
| OpenAI: gpt-oss-20b | $2.60 | 84 | 96 | 97 | 85 | 98 | 131K | 97 | |
| Z.ai: GLM 5 | $44.80 | 64 | 99 | 93 | 55 | 84 | 203K | 92 | |
| Grok 3 | $270.00 | 61 | 93 | 93 | 55 | 96 | 131K | 94 | |
| Google: Gemma 4 31B | $9.00 | 72 | 97 | 92 | 70 | 97 | 262K | 94 | |
| GLM 5 Turbo | $88.00 | 57 | 88 | 91 | 70 | 60 | 203K | 83 | |
| Qwen: Qwen3.5-27B | $23.40 | 64 | 80 | 91 | 70 | 92 | 262K | 88 | |
| Qwen: Qwen3.5-122B-A10B | $31.20 | 62 | 81 | 90 | 55 | 85 | 262K | 87 | |
| Z.ai: GLM 5.1 | $77.00 | 61 | 92 | 89 | 70 | 85 | 203K | 89 | |
| Qwen: Qwen3.5 397B A17B | $39.00 | 62 | 85 | 89 | 60 | 92 | 262K | 89 | |
| Upstage: Solar Pro 3 | $12.00 | 66 | 85 | 89 | 65 | 80 | 128K | 86 | |
| OpenAI: GPT-5.2 | $210.00 | 57 | 85 | 89 | 65 | 78 | 400K | 85 | |
| Xiaomi: MiMo-V2-Pro | $70.00 | 59 | 85 | 89 | 60 | 80 | 1.0M | 86 | |
| Qwen: Qwen3 32B | $5.60 | 73 | 85 | 89 | 60 | 92 | 41K | 89 | |
| Arcee AI: Trinity Large Thinking | $17.30 | 64 | 85 | 89 | 55 | 80 | 262K | 86 | |
| Baidu: ERNIE 4.5 21B A3B | $5.60 | 72 | 85 | 89 | 60 | 87 | 120K | 88 | |
| Meta: Llama 3.3 70B Instruct | $7.20 | 69 | 88 | 89 | 70 | 77 | 131K | 86 | |
| Deep Cogito: Cogito v2.1 671B | $62.50 | 59 | 85 | 89 | 60 | 80 | 128K | 86 | |
| Qwen: Qwen3.5-35B-A3B | $19.50 | 64 | 76 | 89 | 70 | 95 | 262K | 87 | |
| Amazon: Nova Premier 1.0 | $225.00 | 57 | 89 | 89 | 55 | 77 | 1.0M | 86 | |
| Qwen: Qwen3 235B A22B Thinking 2507 | $20.93 | 62 | 74 | 89 | 50 | 83 | 131K | 84 | |
| Anthropic: Claude Opus Latest | $450.00 | 52 | 74 | 88 | 55 | 60 | 1.0M | 78 | |
| OpenAI: GPT-5.1 Chat | $150.00 | 58 | 91 | 88 | 85 | 77 | 128K | 86 | |
| OpenAI: GPT-5.5 | $500.00 | 55 | 92 | 88 | 55 | 71 | 1.1M | 85 | |
| Qwen: Qwen3 VL 32B Instruct | $8.32 | 69 | 88 | 88 | 65 | 87 | 131K | 88 | |
| xAI: Grok 4.20 | $140.00 | 57 | 88 | 87 | 55 | 76 | 2.0M | 85 | |
| OpenAI: GPT-5.4 Image 2 | $470.00 | 53 | 86 | 87 | 70 | 65 | 272K | 81 | |
| Z.ai: GLM 4.7 | $32.60 | 64 | 94 | 87 | 85 | 96 | 203K | 91 | |
| Mistral: Mistral Medium 3 | $36.00 | 63 | 92 | 87 | 70 | 91 | 131K | 89 | |
| OpenAI: GPT-4.1 | $160.00 | 55 | 86 | 87 | 55 | 65 | 1.0M | 81 | |
| Qwen: Qwen3.5-9B | $5.50 | 68 | 65 | 87 | 70 | 83 | 262K | 80 | |
| Qwen: Qwen3 Max | $70.20 | 61 | 93 | 87 | 70 | 89 | 262K | 89 | |
| MoonshotAI: Kimi K2 0711 | $45.80 | 60 | 90 | 87 | 60 | 80 | 131K | 86 | |
| xAI: Grok 3 Mini Beta | $17.00 | 63 | 90 | 87 | 95 | 77 | 131K | 85 | |
| xAI: Grok 3 Mini | $17.00 | 63 | 88 | 86 | 92 | 76 | 131K | 84 | |
| Tencent: Hunyuan A13B Instruct | $11.30 | 64 | 64 | 86 | 55 | 94 | 131K | 83 | |
| OpenAI: GPT-5.3 Chat | $210.00 | 53 | 85 | 86 | 70 | 61 | 128K | 79 | |
| OpenAI: GPT-5.4 Pro | $3,000.00 | 49 | 67 | 85 | 55 | 72 | 1.1M | 77 | |
| AionLabs: Aion-1.0-Mini | $42.00 | 61 | 85 | 85 | 95 | 88 | 131K | 86 | |
| OpenAI: GPT-5.2 Pro | $2,520.00 | 52 | 85 | 85 | 55 | 73 | 400K | 82 | |
| Mistral Large 2411 | $140.00 | 56 | 87 | 85 | 70 | 72 | 131K | 82 | |
| Mistral: Mistral Large 3 2512 | $35.00 | 59 | 87 | 85 | 60 | 72 | 262K | 82 | |
| AllenAI: Olmo 3 32B Think | $11.00 | 67 | 90 | 84 | 50 | 88 | 66K | 87 | |
| DeepSeek: DeepSeek V3.2 | $13.86 | 61 | 67 | 84 | 55 | 82 | 131K | 79 | |
| Anthropic: Claude Opus 4.5 | $450.00 | 57 | 85 | 84 | 55 | 95 | 200K | 87 | |
| NVIDIA: Nemotron Nano 9B V2 | $3.20 | 75 | 72 | 84 | 85 | 98 | 131K | 85 | |
| Z.ai: GLM 5V Turbo | $88.00 | 58 | 88 | 83 | 70 | 80 | 203K | 84 | |
| Qwen: Qwen3 235B A22B Instruct 2507 | $3.84 | 70 | 73 | 83 | 55 | 78 | 262K | 79 | |
| OpenAI: GPT-5.2 Chat | $210.00 | 54 | 85 | 83 | 90 | 73 | 128K | 81 |
Need a shareable artifact?
Download a print-ready PDF from the leaderboard and workload above. No email step—lead capture is off.
PDF Breakdown
Receive a comprehensive native vector PDF of this leaderboard: your workload, filters, top rankings, and a table snapshot (sorted: Reasoning).
By submitting, you agree to our Privacy Policy and Terms.
Whitelabel Reasoning Leaderboard
for your site
Embed the interactive reasoning view on your own domain — whitelabel branding, lead capture, and the same workload sliders your prospects already use on LeadsCalc.