Fireworks.ai Pricing → Llama 3.1 8B Instruct Verified Apr 2026

Llama 3.1 8B Instruct Pricing — $0.10/M input, $0.10/M output

Fireworks.ai / April 2026

Input $0.10/M tokens

Output $0.10/M tokens

Context Window 128K tokens

Lightweight model for high-throughput workloads

Typical use cases

General-purpose inference, chat, extraction, structured output

Estimated monthly cost at scale

Assumes 50/50 input/output token split at stated daily volume.

Daily Volume	Monthly Tokens	Estimated Monthly Cost
1M tokens/day	30M tokens	$3.00
5M tokens/day	150M tokens	$15.00
10M tokens/day	300M tokens	$30.00

vs. other Fireworks.ai models

Model	Input ($/M)	Output ($/M)	Context
Llama 3.3 70B Instruct	$0.90	$0.90	128K
DeepSeek R1	$3.00	$8.00	128K
Qwen 2.5 72B Instruct	$0.90	$0.90	32K
Mixtral 8x22B Instruct	$0.90	$0.90	65K

Not sure if Llama 3.1 8B Instruct is the right fit for your workload? Clawback tests cheaper alternatives against your actual prompts and tells you exactly where you're overpaying.

Test if a cheaper model matches Llama 3.1 8B Instruct quality →