Pricing

Llama 3.1 8B Pricing

Groq • Model ID: llama-3.1-8b-instant

Token Pricing (per 1M tokens)

$0.0500

Input tokens

$0.0800

Output tokens

Cost Calculator

Input tokens per call

Output tokens per call

Calls per month

$0.00

Estimated monthly cost • $0.000 per call

Monthly Cost Estimates

Usage Level	Calls/Month	Monthly Cost
Hobby	1,000	$0.07
Startup	10,000	$0.72
Growth	100,000	$7.20
Scale	500,000	$36.00
Enterprise	2,000,000	$144.00

Assumes 800 input + 400 output tokens per call.

Compare with Alternatives

Model	Provider	Input/1M	Output/1M	vs Llama 3.1 8B
Llama 3.1 8B	Groq	$0.0500	$0.0800	—
Mistral Small	Mistral AI	$0.0600	$0.1800	+85% more
Gemini 1.5 Flash 8B	Google Gemini	$0.0375	$0.1500	+44% more
Gemini 1.5 Flash	Google Gemini	$0.0750	$0.3000	+188% more
Gemini 2.0 Flash	Google Gemini	$0.1000	$0.4000	+285% more
Command R	Cohere	$0.1500	$0.6000	+477% more

Other Groq Models

Llama 3.3 70B — $0.59/M

Frequently Asked Questions

How much does Llama 3.1 8B cost?

Llama 3.1 8B by Groq costs $0.05/M input tokens and $0.08/M output tokens as of June 2026.

How much does 1,000 API calls to Llama 3.1 8B cost?

At 800 input + 400 output tokens per call, 1,000 calls to Llama 3.1 8B costs approximately $0.07.

Is Llama 3.1 8B cheaper than GPT-4o?

Llama 3.1 8B input costs $0.05/M vs GPT-4o at $5.00/M. Llama 3.1 8B is 99% cheaper for input tokens.

Track your Llama 3.1 8B costs in real time

Tokonomics automatically calculates cost for every Llama 3.1 8B call. Set budget alerts and never overspend.

Start Free →

Tokonomics

The budget-first AI cost metering proxy for any stack. Track every LLM token, set budget alerts, and never get surprised by your AI bill again.

Documentation

Blog

Status