Groq Pricing → Mixtral 8x7B Verified Apr 2026

Mixtral 8x7B Pricing — $0.24/M input, $0.24/M output

Groq / April 2026

Input $0.24/M tokens

Output $0.24/M tokens

Context Window 32K tokens

Fastest MoE inference available

Typical use cases

General-purpose inference, chat, extraction, structured output

Estimated monthly cost at scale

Assumes 50/50 input/output token split at stated daily volume.

Daily Volume	Monthly Tokens	Estimated Monthly Cost
1M tokens/day	30M tokens	$7.20
5M tokens/day	150M tokens	$36.00
10M tokens/day	300M tokens	$72.00

vs. other Groq models

Model	Input ($/M)	Output ($/M)	Context
Llama 3.3 70B Versatile	$0.59	$0.79	128K
Gemma 2 9B	$0.20	$0.20	8K
Llama 3.1 8B Instant	$0.05	$0.08	128K
Llama 3.2 1B Preview	$0.04	$0.04	128K
Llama 3.2 3B Preview	$0.06	$0.06	128K
DeepSeek R1 Distill Llama 70B	$0.75	$0.99	128K

Not sure if Mixtral 8x7B is the right fit for your workload? Clawback tests cheaper alternatives against your actual prompts and tells you exactly where you're overpaying.

Test if a cheaper model matches Mixtral 8x7B quality →