Multimodal Models

Multimodal LLMs: Text, Image, Audio, and Beyond

Compare multimodal LLMs that go beyond text. These models process images, audio, and other input types alongside text, enabling rich applications like visual Q&A, document analysis, and creative content generation.

157 models·20 providers·$0.0500 to $150.00/1M input

Top Picks

Compare Models

vs
Compare

Popular comparisons

Related Categories

Cost per 1M Tokens / Flagship Models

Input
Output

All Multimodal Models

157 models
ProviderModelInput $/1MOutput $/1MContextQuality
OpenAIGPT 5 Nano$0.0500$0.4000400K
49.9Good
GoogleGemma 3 4b It$0.0500$0.1000131.1KN/A
GoogleGemma 3 12b It$0.0500$0.1500131.1KN/A
GoogleGemma 4 26b A4b It$0.0600$0.3300262.1KN/A
AmazonNova Lite V1$0.0600$0.2400300KN/A
QwenQwen3.5 Flash 02 23$0.0650$0.26001MN/A
Bytedance-seedSeed 1.6 Flash$0.0750$0.3000262.1KN/A
MistralMistral Small 3.2 24b Instruct$0.0750$0.2000128KN/A
QwenQwen3 Vl 8b Instruct$0.0800$0.5000256KN/A
GoogleGemma 3 27b It$0.0800$0.1600131.1KN/A
RekaaiReka Edge$0.1000$0.100016.4KN/A
QwenQwen3.5 9b$0.1000$0.1500262.1KN/A
Bytedance-seedSeed 2.0 Mini$0.1000$0.4000262.1KN/A
MistralMinistral 3b 2512$0.1000$0.1000131.1KN/A
GoogleGemini 2.5 Flash Lite Preview 09 2025$0.1000$0.40001.0M
40.8Good
BytedanceUi Tars 1.5 7b$0.1000$0.2000128KN/A
GoogleGemini 2.5 Flash Lite$0.1000$0.40001.0M
40.9Good
OpenAIGPT 4.1 Nano$0.1000$0.40001.0MN/A
MetaLlama 4 Scout$0.1000$0.300010MN/A
QwenQwen3 Vl 32b Instruct$0.1040$0.4160262.1KN/A
QwenQwen3 Vl 8b Thinking$0.1170$1.36256KN/A
GoogleGemma 4 31b It$0.1200$0.3500262.1K
62.4Strong
QwenQwen3 Vl 30b A3b Thinking$0.1300$1.56131.1KN/A
QwenQwen3 Vl 30b A3b Instruct$0.1300$0.5200262.1KN/A
QwenQwen3.6 35b A3b$0.1400$1.00262.1KN/A
XiaomiMimo V2.5$0.1400$0.28001.0MN/A
QwenQwen3.5 35b A3b$0.1400$1.00262.1KN/A
PerceptronPerceptron Mk1$0.1500$1.5032.8KN/A
MistralMistral Small 2603$0.1500$0.6000262.1KN/A
MistralMinistral 8b 2512$0.1500$0.1500262.1KN/A
MetaLlama 4 Maverick$0.1500$0.60001.0MN/A
OpenAIGPT 4o Mini 2024 07 18$0.1500$0.6000128KN/A
OpenAIGPT 4o Mini$0.1500$0.6000128KN/A
MetaLlama Guard 4 12b$0.1800$0.1800163.8KN/A
QwenQwen3.6 Flash$0.1875$1.131M
62.0Strong
QwenQwen3.5 27b$0.1950$1.56262.1KN/A
StepfunStep 3.7 Flash$0.2000$1.15256KN/A
OpenAIGPT 5.4 Nano$0.2000$1.25400K
73.8Frontier
MistralMinistral 14b 2512$0.2000$0.2000262.1KN/A
QwenQwen3 Vl 235b A22b Instruct$0.2000$0.8800262.1KN/A
MinimaxMinimax 01$0.2000$1.101.0MN/A
Nex-agiNex N2 Pro$0.2500$1.00262.1KN/A
GoogleGemini 3.1 Flash Lite$0.2500$1.501.0M
62.2Strong
Bytedance-seedSeed 2.0 Lite$0.2500$2.00262.1KN/A
GoogleGemini 3.1 Flash Lite Preview$0.2500$1.501.0M
62.2Strong
Bytedance-seedSeed 1.6$0.2500$2.00262.1KN/A
OpenAIGPT 5.1 Codex Mini$0.2500$2.00400K
46.1Good
OpenAIGPT 5 Mini$0.2500$2.00400K
68.0Frontier
AnthropicClaude 3 Haiku$0.2500$1.25200KN/A
QwenQwen3.5 122b A10b$0.2600$2.08262.1KN/A

Page 1 of 4

Deploy your AI agent for $33/mo flat.

Managed Telegram bot hosting. We handle the infrastructure.

Any model. Any channel. Zero infrastructure.