Llama 3.3 70B Pricing — $0.88/M input, $0.88/M output
Meta (via Together.ai) / April 2026
Input $0.88/M tokens
Output $0.88/M tokens
Context Window 128K tokens
Open-weight model hosted via Together.ai
Typical use cases
General-purpose inference, chat, extraction, structured output
Estimated monthly cost at scale
Assumes 50/50 input/output token split at stated daily volume.
| Daily Volume | Monthly Tokens | Estimated Monthly Cost |
|---|---|---|
| 1M tokens/day | 30M tokens | $26.40 |
| 5M tokens/day | 150M tokens | $132.00 |
| 10M tokens/day | 300M tokens | $264.00 |
vs. other Meta (via Together.ai) models
| Model | Input ($/M) | Output ($/M) | Context |
|---|---|---|---|
| Llama 3.3 70B (Fireworks) | $0.90 | $0.90 | 128K |
| Llama 3.1 70B | $0.88 | $0.88 | 128K |
| Llama 3.2 90B Vision | $0.88 | $0.88 | 128K |
| Llama 3.2 11B Vision | $0.18 | $0.18 | 128K |
Not sure if Llama 3.3 70B is the right fit for your workload? Clawback tests cheaper alternatives against your actual prompts and tells you exactly where you're overpaying.
Test if a cheaper model matches Llama 3.3 70B quality →