| Usage Level | Calls/Month | Monthly Cost |
|---|---|---|
| Hobby | 1,000 | $0.07 |
| Startup | 10,000 | $0.72 |
| Growth | 100,000 | $7.20 |
| Scale | 500,000 | $36.00 |
| Enterprise | 2,000,000 | $144.00 |
Assumes 800 input + 400 output tokens per call.
| Model | Provider | Input/1M | Output/1M | vs Llama 3.1 8B |
|---|---|---|---|---|
| Llama 3.1 8B | Groq | $0.0500 | $0.0800 | — |
| Mistral Small | Mistral AI | $0.0600 | $0.1800 | +85% more |
| Gemini 1.5 Flash 8B | Google Gemini | $0.0375 | $0.1500 | +44% more |
| Gemini 1.5 Flash | Google Gemini | $0.0750 | $0.3000 | +188% more |
| text-embedding-3-small | unknown | $0.0200 | $0.0000 | -85% cheaper |
| embed-english-v3.0 | unknown | $0.1000 | $0.0000 | -23% cheaper |
Llama 3.1 8B by Groq costs $0.05/M input tokens and $0.08/M output tokens as of June 2026.
At 800 input + 400 output tokens per call, 1,000 calls to Llama 3.1 8B costs approximately $0.07.
Llama 3.1 8B input costs $0.05/M vs GPT-4o at $5.00/M. Llama 3.1 8B is 99% cheaper for input tokens.