Cerebras Pricing → Llama 3.1 8B Price unverified

Llama 3.1 8B Pricing — $0.10/M input, $0.10/M output

Cerebras / April 2026
Input $0.10/M tokens
Output $0.10/M tokens
Context Window 8K tokens

Fastest 8B inference in the market

Typical use cases

General-purpose inference, chat, extraction, structured output

Estimated monthly cost at scale

Assumes 50/50 input/output token split at stated daily volume.

Daily Volume Monthly Tokens Estimated Monthly Cost
1M tokens/day 30M tokens $3.00
5M tokens/day 150M tokens $15.00
10M tokens/day 300M tokens $30.00

vs. other Cerebras models

Model Input ($/M) Output ($/M) Context
Llama 3.3 70B $0.85 $1.20 8K

Not sure if Llama 3.1 8B is the right fit for your workload? Clawback tests cheaper alternatives against your actual prompts and tells you exactly where you're overpaying.

Test if a cheaper model matches Llama 3.1 8B quality →