Cerebras Pricing → Llama 3.3 70B Price unverified

Llama 3.3 70B Pricing — $0.85/M input, $1.20/M output

Cerebras / April 2026
Input $0.85/M tokens
Output $1.20/M tokens
Context Window 8K tokens

Extremely fast inference — sub-second TTFT for most requests

Typical use cases

General-purpose inference, chat, extraction, structured output

Estimated monthly cost at scale

Assumes 50/50 input/output token split at stated daily volume.

Daily Volume Monthly Tokens Estimated Monthly Cost
1M tokens/day 30M tokens $30.75
5M tokens/day 150M tokens $153.75
10M tokens/day 300M tokens $307.50

vs. other Cerebras models

Model Input ($/M) Output ($/M) Context
Llama 3.1 8B $0.10 $0.10 8K

Not sure if Llama 3.3 70B is the right fit for your workload? Clawback tests cheaper alternatives against your actual prompts and tells you exactly where you're overpaying.

Test if a cheaper model matches Llama 3.3 70B quality →