NVIDIA
Llama 3.1 Nemotron 70b Instruct
nvidia/llama-3.1-nemotron-70b-instruct
131.1K context·16.4K max output·text->text
Input
$1.20
per 1M tokens
Output
$1.20
per 1M tokens
Cache Read
-
not available
Cache Write
-
not available
Quality Benchmarks
StrongValue: 57.6
Coding
Code generation, completion, debugging
Math
Competition math, calculus, olympiad
Reasoning
Logic, spatial, puzzle solving
Language
Summarization, paraphrase, writing
Overall
Source: LiveBench (livebench.ai) / Scores 0-100
Capabilities
VisionTool UseReasoningPrompt Caching
Similar Price Range
More from NVIDIA
Deploy your AI agent for $33/mo flat.
Managed Telegram bot hosting. We handle the infrastructure.
Any model. Any channel. Zero infrastructure.