deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct
Deepinfra textdeepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct
Input
$0.4000
per 1M input tokens
Output
$0.4000
per 1M output tokens
Cache read
n/a
per 1M cached read tokens
Cache write
n/a
per 1M cache write tokens
Context window
131,072
Max output
131,072
Effective date
Jun 15, 2026
Estimate a workload
Enter token counts to see the cost at this model's rates.
Estimated cost:
$0.00
Price history
blended $ per 1M tokens (3:1)One price on record so far (Jun 15, 2026). This chart fills in as the price changes over time. We check daily.
| Effective | Input | Output | Blended |
|---|---|---|---|
| Jun 15, 2026 | $0.40 | $0.40 | $0.40 |
Other Deepinfra models
deepinfra/meta-llama/Llama-3.2-3B-Instruct
· $0.0200/Mtok out
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
· $0.0300/Mtok out
deepinfra/mistralai/Mistral-Nemo-Instruct-2407
· $0.0400/Mtok out
deepinfra/meta-llama/Meta-Llama-3-8B-Instruct
· $0.0600/Mtok out
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct
· $0.0500/Mtok out
deepinfra/Qwen/Qwen2.5-7B-Instruct
· $0.1000/Mtok out
Source: litellm. Confirm against the provider's official pricing before relying on these figures.