| Usage Level | Calls/Month | Monthly Cost |
|---|---|---|
| Hobby | 1,000 | $0.79 |
| Startup | 10,000 | $7.88 |
| Growth | 100,000 | $78.80 |
| Scale | 500,000 | $394.00 |
| Enterprise | 2,000,000 | $1,576.00 |
Assumes 800 input + 400 output tokens per call.
| Model | Provider | Input/1M | Output/1M | vs Llama 3.3 70B |
|---|---|---|---|---|
| Llama 3.3 70B | Groq | $0.5900 | $0.7900 | — |
| Gemini 1.0 Pro | Google Gemini | $0.5000 | $1.5000 | +45% more |
| Mistral Large | Mistral AI | $0.5000 | $1.5000 | +45% more |
| GPT-3.5 Turbo | OpenAI | $0.5000 | $1.5000 | +45% more |
| Mixtral 8x7B | Mistral AI | $0.7000 | $0.7000 | +1% more |
| Mistral Medium | Mistral AI | $0.4000 | $2.0000 | +74% more |
Llama 3.3 70B by Groq costs $0.59/M input tokens and $0.79/M output tokens as of June 2026.
At 800 input + 400 output tokens per call, 1,000 calls to Llama 3.3 70B costs approximately $0.79.
Llama 3.3 70B input costs $0.59/M vs GPT-4o at $5.00/M. Llama 3.3 70B is 88% cheaper for input tokens.