Model Benchmark

Every LLM,
compared.

309 models across 50 providers. Pricing, context windows, and capabilities, updated daily.

Last synced Jun 24, 2026, 08:16 PM

Compare Models

Compare

Popular comparisons

GPT 5.4vsClaude Opus 4.6 Claude Sonnet 4.6vsGPT 5.1 Claude Sonnet 4vsClaude Opus 4.6 Deepseek V3.2vsKimi K2.5 Deepseek V3.2vsQwen3 235b A22b Qwen3 235b A22bvsClaude Sonnet 4

All Models

⌘K

309 models

Provider	Model	Input $/1M	Output $/1M	Context	Max Output	Quality
Inclusionai	Ling 2.6 Flash	$0.0100	$0.0300	262.1K	32.8K	N/A
Ibm-granite	Granite 4.0 H Micro	$0.0170	$0.1120	131K	131K	36.3Good
Meta	Llama 3.1 8b Instruct	$0.0200	$0.0300	131.1K	16.4K	N/A
Mistral	Mistral Nemo	$0.0200	$0.0300	131.1K	—	N/A
Meta	Llama 3.2 1b Instruct	$0.0270	$0.2010	131.1K	60K	N/A
OpenAI	GPT Oss 20b	$0.0290	$0.1400	131.1K	—	N/A
Liquid	Lfm 2 24b A2b	$0.0300	$0.1200	128K	—	N/A
Amazon	Nova Micro V1	$0.0350	$0.1400	128K	5.1K	N/A
Cohere	Command R7b 12 2024	$0.0375	$0.1500	128K	4K	N/A
OpenAI	GPT Oss 120b	$0.0390	$0.1800	131.1K	—	47.7Good
Qwen	Qwen 2.5 7b Instruct	$0.0400	$0.1000	131.1K	32.8K	N/A
Sao10k	L3 Lunaris 8b	$0.0400	$0.0500	8.2K	16.4K	N/A
Arcee-ai	Trinity Mini	$0.0450	$0.1500	131.1K	131.1K	N/A
Qwen	Qwen3 30b A3b Instruct 2507	$0.0482	$0.1931	131.1K	32K	38.2Good
Ibm-granite	Granite 4.1 8b	$0.0500	$0.1000	131.1K	131.1K	N/A
NVIDIA	Nemotron 3 Nano 30b A3b	$0.0500	$0.2000	262.1K	228K	N/A
OpenAI	GPT 5 Nano	$0.0500	$0.4000	400K	—	49.9Good
Qwen	Qwen3 8b	$0.0500	$0.4000	131.1K	8.2K	N/A
Google	Gemma 3 4b It	$0.0500	$0.1000	131.1K	16.4K	N/A
Google	Gemma 3 12b It	$0.0500	$0.1500	131.1K	16.4K	N/A
Mistral	Mistral Small 24b Instruct 2501	$0.0500	$0.0800	32.8K	16.4K	N/A
Meta	Llama 3.2 3b Instruct	$0.0509	$0.3350	131.1K	80K	N/A
Google	Gemma 4 26b A4b It	$0.0600	$0.3300	262.1K	—	N/A
xAI	Glm 4.7 Flash	$0.0600	$0.4000	202.8K	16.4K	N/A
Google	Gemma 3n E4b It	$0.0600	$0.1200	32.8K	—	N/A
Amazon	Nova Lite V1	$0.0600	$0.2400	300K	5.1K	N/A
Gryphe	Mythomax L2 13b	$0.0600	$0.0600	4.1K	4.1K	N/A
Tencent	Hy3 Preview	$0.0630	$0.2100	262.1K	—	N/A
Qwen	Qwen3.5 Flash 02 23	$0.0650	$0.2600	1M	65.5K	N/A
Qwen	Qwen3 Coder 30b A3b Instruct	$0.0700	$0.2700	160K	32.8K	N/A
Microsoft	Phi 4	$0.0700	$0.1400	16.4K	16.4K	N/A
Inclusionai	Ring 2.6 1t	$0.0750	$0.6250	262.1K	65.5K	N/A
Inclusionai	Ling 2.6 1t	$0.0750	$0.6250	262.1K	32.8K	N/A
Bytedance-seed	Seed 1.6 Flash	$0.0750	$0.3000	262.1K	32.8K	N/A
OpenAI	GPT Oss Safeguard 20b	$0.0750	$0.3000	131.1K	65.5K	N/A
Mistral	Mistral Small 3.2 24b Instruct	$0.0750	$0.2000	128K	16.4K	N/A
Microsoft	Phi 4 Mini Instruct	$0.0800	$0.3500	131.1K	128K	N/A
Qwen	Qwen3 Vl 8b Instruct	$0.0800	$0.5000	256K	32.8K	N/A
Qwen	Qwen3 30b A3b Thinking 2507	$0.0800	$0.4000	131.1K	131.1K	38.2Good
Qwen	Qwen3 32b	$0.0800	$0.2800	131.1K	16.4K	42.2Good
Google	Gemma 3 27b It	$0.0800	$0.1600	131.1K	16.4K	N/A
DeepSeek	Deepseek V4 Flash	$0.0900	$0.1800	1.0M	65.5K	68.9Frontier
NVIDIA	Nemotron 3 Super 120b A12b	$0.0900	$0.4500	1M	—	N/A
Stepfun	Step 3.5 Flash	$0.0900	$0.3000	262.1K	16.4K	N/A
Qwen	Qwen3 Next 80b A3b Instruct	$0.0900	$1.10	262.1K	16.4K	50.7Strong
Qwen	Qwen3 235b A22b 2507	$0.0900	$0.1000	262.1K	16.4K	52.2Strong
Qwen	Qwen3 Next 80b A3b Thinking	$0.0975	$0.7800	262.1K	32.8K	50.7Strong
Poolside	Laguna Xs.2	$0.1000	$0.2000	262.1K	32.8K	N/A
Rekaai	Reka Edge	$0.1000	$0.1000	16.4K	16.4K	N/A
Qwen	Qwen3.5 9b	$0.1000	$0.1500	262.1K	262.1K	N/A

Page 1 of 7

Cost per 1M Tokens / Flagship Models

Input

Output

Models by Use Case

230 models

Coding Models

Top models for code generation, debugging, and software engineering — ranked by LiveBench coding scores and API price.

88 models

Writing Models

Models with the highest language and writing scores for content creation, copywriting, marketing, and creative tasks.

4 models

Reasoning Models

Models with extended thinking for complex math, logic, and multi-step problem solving — ranked by benchmarks.

227 models

Models for AI Agents

Models built for autonomous AI agents — with function calling, large context windows, and competitive pricing for agentic workflows.

204 models

Budget Models

The most affordable LLMs under $1/1M input tokens for high-volume apps, prototyping, and cost-sensitive production.

121 models

Affordable Models

Mid-range models that balance quality and cost — under $5/1M input tokens with strong benchmark scores.

30 models

Premium Models

Top-tier frontier models from leading providers — maximum capability for demanding tasks.

82 models

Best Value Models

Models with the best quality-to-price ratio — high benchmark scores at affordable prices.

158 models

Vision Models

Models that process images alongside text for visual analysis, OCR, and multimodal tasks.

272 models

Long Context Models

Models with 100K+ token context windows for processing long documents, codebases, and conversations.

6 models

Models with Caching

Models that support prompt caching for reduced costs on repeated or similar requests.

158 models

Multimodal Models

Models that handle multiple input types including text, images, and audio.

Top Providers

62 models

All Providers

Deploy your AI agent for $33/mo flat.

Managed Telegram bot hosting. We handle the infrastructure.

Any model. Any channel. Zero infrastructure.

Deploy Your Agent

Every LLM,compared.