Best Open-Source LLMs 2026: Open-Weight Models vs. Hosted API Cost
Compare open-weight LLMs in 2026 for self-hosting and dedicated deployments, with pricing context for hosted variants. For teams in the US, Canada, and Australia evaluating open vs. proprietary APIs.
Open-source friendly LLM comparison with TCO in mind in 2026
Open-weight models can unlock on-prem and dedicated-cloud strategies for residency-sensitive workloads. This tab filters the open-weight subset so platform teams in the United States, Canada, and Australia can compare capability signals while still grounding decisions in realistic engineering and GPU spend—not list hype alone.
Workload & pricing toggles
Same three scenarios as the main AI API calculator: moderate traffic, large RAG-style context, or per-request max tokens with a lower request count.
Include Vision / Image Processing
Off — no image fees in cost estimates for vision-capable models.
Turn On to include image fees.
Use Cached Pricing
Enable to get 50% off input tokens where cached rates apply
Deep Reasoning / Thinking Mode
Model hidden reasoning / extended thinking charged like output tokens when enabled.
Batch Pricing
Enable for 50% off input & output where batch/async pricing applies
Cached / batch est. monthly values only change after the pipeline sets supports_caching or supports_batch in Supabase. The toggles here narrow the table to models whose catalog or provider typically supports those modes.
Magic quadrant (top 15)
X: est. monthly · Y: Open-weight · Dot: provider color · Hover for rank, model & detailsFull leaderboard
Showing 48 of 74 models (open-weight / self-hostable catalog hints).
| Pick | Model | Est. monthly | ROI score | Coding | Reasoning | Speed | Math | Context | Overall |
|---|---|---|---|---|---|---|---|---|---|
| Google: Gemma 4 31B | $9.00 | 72 | 97 | 92 | 70 | 97 | 262K | 94 | |
| Qwen: Qwen3.5 397B A17B | $39.00 | 62 | 85 | 89 | 60 | 92 | 262K | 89 | |
| Qwen: Qwen3 32B | $5.60 | 73 | 85 | 89 | 60 | 92 | 41K | 89 | |
| Qwen: Qwen3 Max | $70.20 | 61 | 93 | 87 | 70 | 89 | 262K | 89 | |
| Mistral: Mistral Medium 3 | $36.00 | 63 | 92 | 87 | 70 | 91 | 131K | 89 | |
| Qwen: Qwen3.5-27B | $23.40 | 64 | 80 | 91 | 70 | 92 | 262K | 88 | |
| Qwen: Qwen3 VL 32B Instruct | $8.32 | 69 | 88 | 88 | 65 | 87 | 131K | 88 | |
| Qwen: Qwen3.5-35B-A3B | $19.50 | 64 | 76 | 89 | 70 | 95 | 262K | 87 | |
| Qwen: Qwen3.5-122B-A10B | $31.20 | 62 | 81 | 90 | 55 | 85 | 262K | 87 | |
| Meta: Llama 3.3 70B Instruct | $7.20 | 69 | 88 | 89 | 70 | 77 | 131K | 86 | |
| NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 | $8.00 | 66 | 74 | 79 | 70 | 97 | 131K | 82 | |
| Mistral Large 2411 | $140.00 | 56 | 87 | 85 | 70 | 72 | 131K | 82 | |
| Mistral: Mistral Small 3.2 24B | $5.00 | 69 | 92 | 83 | 85 | 69 | 128K | 82 | |
| Mistral: Mistral Large 3 2512 | $35.00 | 59 | 87 | 85 | 60 | 72 | 262K | 82 | |
| Qwen: Qwen3 Coder Next | $13.60 | 62 | 93 | 73 | 65 | 85 | 262K | 81 | |
| Mistral: Mistral Small Creative | $7.00 | 66 | 88 | 83 | 90 | 69 | 33K | 81 | |
| DeepSeek: DeepSeek V4 Pro | $104.40 | 56 | 67 | 80 | 55 | 97 | 1.0M | 81 | |
| Mistral: Mistral Small 4 | $12.00 | 63 | 88 | 83 | 55 | 69 | 262K | 81 | |
| Qwen: Qwen3 30B A3B | $6.00 | 67 | 85 | 79 | 60 | 82 | 41K | 81 | |
| Qwen: Qwen3.5-9B | $5.50 | 68 | 65 | 87 | 70 | 83 | 262K | 80 | |
| Qwen: Qwen3 Coder Plus | $58.50 | 56 | 85 | 78 | 60 | 78 | 1.0M | 80 | |
| Qwen: Qwen3.6 Plus | $32.50 | 57 | 79 | 79 | 65 | 78 | 1.0M | 79 | |
| DeepSeek: DeepSeek V3.2 | $13.86 | 61 | 67 | 84 | 55 | 82 | 131K | 79 | |
| Qwen: Qwen3 235B A22B Instruct 2507 | $3.84 | 70 | 73 | 83 | 55 | 78 | 262K | 79 | |
| Mistral: Mistral 7B Instruct v0.1 | $6.30 | 65 | 85 | 79 | 90 | 70 | 3K | 78 | |
| Mistral: Saba | $14.00 | 60 | 88 | 77 | 85 | 69 | 33K | 78 | |
| DeepSeek: DeepSeek V3.1 Terminus | $16.30 | 58 | 68 | 78 | 65 | 75 | 164K | 75 | |
| Qwen: Qwen3 Coder 480B A35B | $18.80 | 57 | 85 | 73 | 60 | 65 | 262K | 74 | |
| Qwen: Qwen3 14B | $4.80 | 59 | 52 | 72 | 70 | 58 | 41K | 63 | |
| Mistral: Mistral Medium 3.1 | $36.00 | 41 | 40 | 62 | 70 | 38 | 131K | 51 | |
| Qwen: Qwen3 Coder 30B A3B Instruct | $5.50 | 47 | 52 | 44 | 55 | 31 | 160K | 43 | |
| Meta: Llama 3.1 70B Instruct | $20.00 | 28 | 0 | 0 | 55 | 95 | 131K | 24 | |
| DeepSeek: DeepSeek V3 0324 | $15.70 | 16 | 0 | 0 | 55 | 0 | 164K | 0 | |
| Qwen: Qwen3.5 Plus 2026-02-15 | $26.00 | 13 | 0 | 0 | 60 | 0 | 1.0M | 0 | |
| Qwen: Qwen Plus 0728 | $18.20 | 15 | 0 | 0 | 70 | 0 | 1.0M | 0 | |
| Qwen: Qwen3 VL 235B A22B Instruct | $16.80 | 15 | 0 | 0 | 0 | 0 | 262K | 0 | |
| Meta: Llama 4 Scout | $6.20 | 4 | 0 | 0 | 0 | 0 | 328K | 0 | |
| Mistral: Mistral Small 3.1 24B | $19.60 | 3 | 0 | 0 | 0 | 0 | 128K | 0 | |
| DeepSeek: DeepSeek V3.2 Speciale | $28.00 | 3 | 0 | 0 | 0 | 0 | 164K | 0 | |
| Qwen: Qwen VL Max | $41.60 | 2 | 0 | 0 | 0 | 0 | 131K | 0 | |
| Meta: Llama 4 Maverick | $12.00 | 3 | 0 | 0 | 0 | 0 | 1.0M | 0 | |
| Qwen: Qwen3 VL 30B A3B Instruct | $10.40 | 18 | 0 | 0 | 70 | 0 | 131K | 0 | |
| Mistral: Mistral Nemo | $0.70 | 9 | 0 | 0 | 0 | 0 | 131K | 0 | |
| Qwen: Qwen VL Plus | $9.56 | 18 | 0 | 0 | 55 | 0 | 131K | 0 | |
| Qwen2.5 72B Instruct | $8.70 | 4 | 0 | 0 | 0 | 0 | 33K | 0 | |
| AlfredPros: CodeLLaMa 7B Instruct Solidity | $44.00 | 2 | 0 | 0 | 0 | 0 | 4K | 0 | |
| Qwen: Qwen-Plus | $18.20 | 3 | 0 | 0 | 0 | 0 | 1.0M | 0 | |
| NVIDIA: Llama 3.1 Nemotron 70B Instruct | $60.00 | 2 | 0 | 0 | 0 | 0 | 131K | 0 |
Need a shareable artifact?
Download a print-ready PDF from the leaderboard and workload above. No email step—lead capture is off.
PDF Breakdown
Receive a comprehensive native vector PDF of this leaderboard: your workload, filters, top rankings, and a table snapshot (sorted: Open-weight).
By submitting, you agree to our Privacy Policy and Terms.
Whitelabel Open-weight Leaderboard
for your site
Embed the interactive open-weight view on your own domain — whitelabel branding, lead capture, and the same workload sliders your prospects already use on LeadsCalc.