Cerebras Pricing → Llama 3.1 8B Price unverified

Llama 3.1 8B Pricing — $0.10/M input, $0.10/M output

Cerebras / April 2026

Input $0.10/M tokens

Output $0.10/M tokens

Context Window 8K tokens

Fastest 8B inference in the market

Typical use cases

General-purpose inference, chat, extraction, structured output

Estimated monthly cost at scale

Assumes 50/50 input/output token split at stated daily volume.

Daily Volume	Monthly Tokens	Estimated Monthly Cost
1M tokens/day	30M tokens	$3.00
5M tokens/day	150M tokens	$15.00
10M tokens/day	300M tokens	$30.00

vs. other Cerebras models

Model	Input ($/M)	Output ($/M)	Context
Llama 3.3 70B	$0.85	$1.20	8K

Not sure if Llama 3.1 8B is the right fit for your workload? Clawback tests cheaper alternatives against your actual prompts and tells you exactly where you're overpaying.

Test if a cheaper model matches Llama 3.1 8B quality →