Models with Caching

LLMs with Prompt Caching: Save on Repeated API Calls

Compare LLMs that offer prompt caching. Cache your system prompt, few-shot examples, or conversation history to dramatically cut costs on subsequent requests. Essential for AI agents and chatbots that make many calls with shared context.

6 models·2 providers·$0.0900 to $0.7000/1M input

Top Picks

Compare Models

vs
Compare

Related Categories

Cost per 1M Tokens / Flagship Models

Input
Output

All Models with Caching

6 models
ProviderModelInput $/1MOutput $/1MContextQuality
DeepSeekDeepseek V4 Flash$0.0900$0.18001.0M
68.9Frontier
DeepSeekDeepseek Chat$0.2002$0.8001131.1KN/A
DeepSeekDeepseek V3.2$0.2288$0.3432131.1KN/A
InceptionMercury 2$0.2500$0.7500128KN/A
DeepSeekDeepseek V4 Pro$0.4350$0.87001.0M
76.4Frontier
DeepSeekDeepseek R1$0.7000$2.50163.8K
64.6Strong

Deploy your AI agent for $33/mo flat.

Managed Telegram bot hosting. We handle the infrastructure.

Any model. Any channel. Zero infrastructure.