Can I run these models commercially?

Licenses differ by base model. Check each vendor’s license and your counsel before shipping production workloads—this leaderboard does not provide legal advice.

Why are some open models missing?

We index models present in our autonomous benchmark pipeline and catalog. Request additions via support if a major open-weight SKU is absent.

Interactive leaderboard

Best Open-Source LLMs 2026: Open-Weight Models vs. Hosted API Cost

Compare open-weight LLMs in 2026 for self-hosting and dedicated deployments, with pricing context for hosted variants. For teams in the US, Canada, and Australia evaluating open vs. proprietary APIs.

Open-source friendly LLM comparison with TCO in mind in 2026

Open-weight models can unlock on-prem and dedicated-cloud strategies for residency-sensitive workloads. This tab filters the open-weight subset so platform teams in the United States, Canada, and Australia can compare capability signals while still grounding decisions in realistic engineering and GPU spend—not list hype alone.

Est. monthly ROI score Coding Reasoning Speed Math Context Overall Open-weight

Workload & pricing toggles

Workload presets

Same three scenarios as the main AI API calculator: moderate traffic, large RAG-style context, or per-request max tokens with a lower request count.

Include Vision / Image Processing

Off — no image fees in cost estimates for vision-capable models.

Turn On to include image fees.

OffOn

Use Cached Pricing

Enable to get 50% off input tokens where cached rates apply

OffOn

Deep Reasoning / Thinking Mode

Model hidden reasoning / extended thinking charged like output tokens when enabled.

OffOn

Batch Pricing

Enable for 50% off input & output where batch/async pricing applies

OffOn

Input Tokens≈ $100.00/mo

1K—1.0M

Output Tokens≈ $100.00/mo

100—500K

Monthly API Requests≈ $200.00 total

10—100K

Cached / batch est. monthly values only change after the pipeline sets supports_caching or supports_batch in Supabase. The toggles here narrow the table to models whose catalog or provider typically supports those modes.

Magic quadrant (top 15)

X: est. monthly · Y: Open-weight · Dot: provider color · Hover for rank, model & details

Full leaderboard

Showing 48 of 82 models (open-weight / self-hostable catalog hints).

Model	Est. monthly	ROI score	Coding	Reasoning	Speed	Math	Context	Overall
DeepSeek: DeepSeek V3.2 Speciale	$15.79	70	95	97	45	96	164K	96
Qwen: Qwen3.6 Max Preview	$104.00	64	96	95	55	97	262K	96
Qwen: Qwen3.5-122B-A10B	$31.20	67	95	94	45	95	262K	95
DeepSeek: DeepSeek V3.1 Terminus	$20.80	67	95	93	70	92	164K	93
Qwen: Qwen3 Max	$70.20	63	96	91	55	95	262K	93
DeepSeek: DeepSeek V4 Pro	$26.10	66	98	91	70	88	1.0M	92
DeepSeek: DeepSeek V3.1	$19.50	67	90	93	75	92	164K	92
Qwen: Qwen3 235B A22B Instruct 2507	$9.10	71	92	93	70	92	262K	92
Nex AGI: DeepSeek V3.1 Nex N1	$10.40	70	96	90	70	90	131K	92
Qwen: Qwen3.6 Plus	$32.50	65	95	91	70	92	1.0M	92
DeepSeek V3.2	$14.76	67	92	89	45	95	164K	91
Qwen3.6 35B A3B	$15.60	67	92	90	85	90	262K	91
Qwen: Qwen3.5-35B-A3B	$15.60	67	85	90	90	95	262K	90
Mistral Large (2512)	$35.00	63	90	89	65	92	262K	90
Qwen: Qwen3.5-27B	$23.40	65	88	92	85	90	262K	90
Qwen: Qwen3 Coder Plus	$58.50	61	92	88	60	88	1.0M	89
Qwen: Qwen3.7 Max	$103.25	60	88	89	60	90	1.0M	89
Gemma 4 31B	$9.60	68	78	89	45	95	262K	88
Qwen: Qwen3 VL 235B A22B Instruct	$27.40	63	85	88	65	85	262K	87
Meta: Llama 3.3 70B Instruct	$9.20	67	85	86	75	85	131K	86
Qwen: Qwen3.6 27B	$35.56	61	82	90	80	80	262K	86
Mistral Large 2411	$140.00	57	85	85	65	85	131K	85
Qwen: Qwen3 Next 80B A3B Instruct	$15.00	64	88	84	65	82	262K	85
Nous: Hermes 3 405B Instruct	$50.00	60	87	83	60	85	131K	85
Mistral Large 2407	$140.00	57	82	85	65	88	131K	85
Mistral: Mistral Medium 3	$36.00	60	80	83	75	90	131K	84
Qwen: Qwen3 32B	$6.00	69	78	85	70	88	131K	84
DeepSeek: DeepSeek V3 0324	$22.00	61	84	87	65	75	164K	83
Qwen: Qwen3.5 397B A17B	$39.00	59	82	84	65	82	262K	83
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5	$20.00	62	75	80	70	95	131K	83
Mistral: Mistral Medium 3.5	$135.00	56	90	80	70	80	262K	83
Qwen: Qwen3.5-9B	$5.50	69	70	87	85	83	262K	82
Llama 4 Maverick	$16.00	62	80	87	75	75	1.0M	82
Gemma 4 26B A4B	$8.30	66	77	81	90	88	262K	82
Qwen: Qwen3 Coder 480B A35B	$22.00	61	92	85	70	65	262K	82
Qwen: Qwen3.7 Plus	$25.60	59	80	81	80	82	1.0M	81
Qwen: Qwen3.5 Plus 2026-04-20	$30.00	59	80	83	70	80	1.0M	81
Qwen2.5 72B Instruct	$18.40	61	82	77	65	88	33K	81
Qwen3 235B A22B	$36.40	58	65	83	50	88	131K	80
Qwen: Qwen2.5 VL 72B Instruct	$42.00	57	82	75	65	88	128K	80
Qwen: Qwen3 VL 30B A3B Instruct	$12.00	62	75	86	90	75	262K	80
Qwen: Qwen VL Max	$41.60	57	75	80	60	80	131K	79
Qwen: Qwen-Plus	$18.20	60	74	83	80	75	1.0M	79
Meta: Llama 3.1 70B Instruct	$20.00	59	80	84	70	68	131K	79
Qwen: Qwen3 Coder Next	$12.40	61	85	72	95	85	262K	78
Qwen: Qwen3.5 Plus 2026-02-15	$26.00	57	75	79	80	75	1.0M	77
Qwen: Qwen3 VL 32B Instruct	$8.32	62	75	78	70	75	131K	76
Mistral: Mistral Small 3	$2.80	71	75	75	90	75	33K	75

Need a shareable artifact?

Get a print-ready PDF of your results and a CSV spreadsheet. Tap the button, then enter your work email. We use it to build your files and start the download—and to email you a copy if the site owner enabled that.

AI ROI Leaderboard & Discovery by LeadsCalc

Detailed analysis

PDF Breakdown

Receive a comprehensive native vector PDF of this leaderboard: your workload, filters, top rankings, and a table snapshot (sorted: Open-weight).

Instant setup

No CC required

By submitting, you agree to our Privacy Policy and Terms.

Agency accelerator

Whitelabel Open-weight Leaderboard
for your site

Embed the interactive open-weight view on your own domain — whitelabel branding, lead capture, and the same workload sliders your prospects already use on LeadsCalc.

1-Click CRM sync

Custom branding

Branded reports

Lead analytics

Free to start

$0/mo*

GET STARTED

NO CREDIT CARD REQUIRED

How it works

Methodology: How we rank Open-Weight LLMs

Transparent, benchmark-driven rankings—same craft as our single-model deep dives.

What “open-weight” means on this leaderboard

Open-weight rankings filter to models tagged as open-weight / self-hostable in our qualitative catalog, then score within that subset. This helps teams avoiding proprietary API lock-in—common among startups and enterprises in the US, Canada, and Australia evaluating on-prem or dedicated cloud deployments.

Battle Arena

Compare up to four LLMs side by side

Tick up to four models in the leaderboard table, then open Battle Arena for API pricing, benchmarks, and workload math in one view—perfect when you are shortlisting vendors for a pilot in the US, Canada, or Australia.

Prefer a head start? Jump into high-intent comparisons people search for every day—same interactive calculator, zero signup.

Open Battle ArenaUp to 4 models · Live estimates

Popular comparisons

Signals & spend

Value analysis

Benchmarks vs. estimated API cost—read the story your CFO cares about.

Hosted API vs. self-host: how to read the trade-offs

Self-hosting shifts cost from tokens to hardware, ops, and reliability. Use hosted estimates here as a directional anchor, then build your TCO model. Australian and Canadian buyers often start with sovereignty requirements; US buyers may weigh velocity of managed APIs against control.

Production deployment

Data Sovereignty & Custom Fine-Tuning

How teams in the US, Canada, and Australia deploy these models in production.

Air-gapped enterprise, HIPAA/SOC2 compliance, and domain adaptation

Open-weight models are non-negotiable for organizations with strict data residency requirements. Healthcare providers in the US, government agencies in Canada, and financial institutions in Australia deploy these models in air-gapped VPCs to ensure zero-data retention. They also serve as the foundation for LoRA fine-tuning, allowing teams to bake proprietary domain knowledge directly into the model weights.

Architecture

Self-Hosting vs Managed Endpoints

Strategies to reduce monthly API spend without sacrificing capability.

Total Cost of Ownership (TCO) and GPU provisioning

While the weights are free, GPU compute is not. Teams must weigh the Total Cost of Ownership (TCO) of provisioning AWS or Azure instances against using managed serverless endpoints. This leaderboard displays the API cost of hosted open-weight models, providing a baseline to determine if your token volume justifies the engineering overhead of managing your own vLLM or Ollama infrastructure.

Embed-ready

Need this live Open-Weight data on your website?

Join 500+ agencies in the US and Australia using LeadsCalc to capture high-intent leads. Embed this interactive Open-Weight leaderboard on your site in about a minute—Canadian teams use the same flows for CAD-priced proposals and compliance-friendly landing pages.

Customize & Embed this ToolWhite-label · No code required

United StatesCanadaAustralia

Live preview

Your visitors compare Open-Weight models without leaving your domain.

Support & clarity

Frequently Asked Questions

Focused on teams in the United States, Canada, and Australia.

Open-Weight

Not automatically—you must add GPU, ops, and engineering time. Compare hosted API estimates here against your self-host TCO; Australian and Canadian teams sometimes prefer open-weight for data sovereignty.

Open-source friendly LLM comparison with TCO in mind in 2026

Workload & pricing toggles

Include Vision / Image Processing

Use Cached Pricing

Deep Reasoning / Thinking Mode

Batch Pricing

Magic quadrant (top 15)

Full leaderboard

PDF Breakdown

Whitelabel Open-weight Leaderboardfor your site

Methodology: How we rank Open-Weight LLMs

What “open-weight” means on this leaderboard

Compare up to four LLMs side by side

Value analysis

Hosted API vs. self-host: how to read the trade-offs

Data Sovereignty & Custom Fine-Tuning

Air-gapped enterprise, HIPAA/SOC2 compliance, and domain adaptation

Self-Hosting vs Managed Endpoints

Total Cost of Ownership (TCO) and GPU provisioning

Need this live Open-Weight data on your website?

Frequently Asked Questions

1Is open-weight always cheaper than OpenAI?

2Can I run these models commercially?

3Why are some open models missing?

Whitelabel Open-weight Leaderboard
for your site