Model Benchmark

Every LLM,
compared.

335 models across 52 providers. Pricing, context windows, and capabilities, updated daily.

Last synced May 8, 2026, 08:06 PM

All Models

335 models
ProviderModelInput $/1MOutput $/1MContextQuality
Ibm-graniteGranite 4.0 H Micro$0.0170$0.1100131K
36.3Good
MetaLlama 3.1 8b Instruct$0.0200$0.050016.4K
46.6Good
MistralMistral Nemo$0.0200$0.0300131.1KN/A
MetaLlama 3.2 1b Instruct$0.0270$0.200060K
28.6Economy
LiquidLfm 2 24b A2b$0.0300$0.120032.8KN/A
OpenAIGPT Oss 20b$0.0300$0.1400131.1K
65.6Frontier
QwenQwen Turbo$0.0325$0.1300131.1KN/A
AmazonNova Micro V1$0.0350$0.1400128KN/A
CohereCommand R7b 12 2024$0.0375$0.1500128KN/A
OpenAIGPT Oss 120b$0.0390$0.1800131.1K
47.7Good
QwenQwen3.5 9b$0.0400$0.1500262.1KN/A
NVIDIANemotron Nano 9b V2$0.0400$0.1600131.1KN/A
GoogleGemma 3 4b It$0.0400$0.0800131.1K
63.1Strong
GoogleGemma 3 12b It$0.0400$0.1300131.1K
70.0Frontier
QwenQwen 2.5 7b Instruct$0.0400$0.100032.8KN/A
Sao10kL3 Lunaris 8b$0.0400$0.05008.2KN/A
MetaLlama 3 8b Instruct$0.0400$0.04008.2K
48.7Good
Arcee-aiTrinity Mini$0.0450$0.1500131.1KN/A
Ibm-graniteGranite 4.1 8b$0.0500$0.1000131.1KN/A
NVIDIANemotron 3 Nano 30b A3b$0.0500$0.2000262.1K
65.5Frontier
OpenAIGPT 5 Nano$0.0500$0.4000400K
49.9Good
QwenQwen3 8b$0.0500$0.400041.0KN/A
MistralMistral Small 24b Instruct 2501$0.0500$0.080032.8K
57.8Strong
MetaLlama 3.2 3b Instruct$0.0510$0.340080K
38.5Good
GoogleGemma 4 26b A4b It$0.0600$0.3300262.1K
87.1Frontier
xAIGlm 4.7 Flash$0.0600$0.4000202.8K
74.6Frontier
GoogleGemma 3n E4b It$0.0600$0.120032.8K
65.7Frontier
QwenQwen3 14b$0.0600$0.240041.0KN/A
AmazonNova Lite V1$0.0600$0.2400300KN/A
GrypheMythomax L2 13b$0.0600$0.06004.1KN/A
QwenQwen3.5 Flash 02 23$0.0650$0.26001M
80.0Frontier
MicrosoftPhi 4$0.0650$0.140016.4K
54.6Strong
BaiduErnie 4.5 21b A3b Thinking$0.0700$0.2800131.1KN/A
BaiduErnie 4.5 21b A3b$0.0700$0.2800120KN/A
QwenQwen3 Coder 30b A3b Instruct$0.0700$0.2700160KN/A
QwenQwen3 235b A22b 2507$0.0710$0.1000262.1K
52.2Strong
Bytedance-seedSeed 1.6 Flash$0.0750$0.3000262.1KN/A
OpenAIGPT Oss Safeguard 20b$0.0750$0.3000131.1KN/A
MistralMistral Small 3.2 24b Instruct$0.0750$0.2000128KN/A
GoogleGemini 2.0 Flash Lite 001$0.0750$0.30001.0M
72.0Frontier
InclusionaiLing 2.6 Flash$0.0800$0.2400262.1KN/A
MicrosoftPhi 4 Mini Instruct$0.0800$0.3500128KN/A
QwenQwen3 Vl 8b Instruct$0.0800$0.5000131.1KN/A
QwenQwen3 30b A3b Thinking 2507$0.0800$0.4000131.1K
38.2Good
QwenQwen3 32b$0.0800$0.280041.0K
42.2Good
MetaLlama 4 Scout$0.0800$0.3000327.7KN/A
GoogleGemma 3 27b It$0.0800$0.1600131.1K
74.2Frontier
NVIDIANemotron 3 Super 120b A12b$0.0900$0.4500262.1K
73.5Frontier
AlibabaTongyi Deepresearch 30b A3b$0.0900$0.4500131.1KN/A
QwenQwen3 Next 80b A3b Instruct$0.0900$1.10262.1K
50.7Strong

Page 1 of 7

Cost per 1M Tokens / Flagship Models

Input
Output

Models by Use Case

242 models

Coding Models

Top models for code generation, debugging, and software engineering — ranked by LiveBench coding scores and API price.

182 models

Writing Models

Models with the highest language and writing scores for content creation, copywriting, marketing, and creative tasks.

4 models

Reasoning Models

Models with extended thinking for complex math, logic, and multi-step problem solving — ranked by benchmarks.

229 models

Models for AI Agents

Models built for autonomous AI agents — with function calling, large context windows, and competitive pricing for agentic workflows.

225 models

Budget Models

The most affordable LLMs under $1/1M input tokens for high-volume apps, prototyping, and cost-sensitive production.

128 models

Affordable Models

Mid-range models that balance quality and cost — under $5/1M input tokens with strong benchmark scores.

26 models

Premium Models

Top-tier frontier models from leading providers — maximum capability for demanding tasks.

161 models

Best Value Models

Models with the best quality-to-price ratio — high benchmark scores at affordable prices.

156 models

Vision Models

Models that process images alongside text for visual analysis, OCR, and multimodal tasks.

274 models

Long Context Models

Models with 100K+ token context windows for processing long documents, codebases, and conversations.

3 models

Models with Caching

Models that support prompt caching for reduced costs on repeated or similar requests.

156 models

Multimodal Models

Models that handle multiple input types including text, images, and audio.

Top Providers

All Providers

Deploy your AI agent for $33/mo flat.

Managed Telegram bot hosting. We handle the infrastructure.

Any model. Any channel. Zero infrastructure.