| Usage Level | Calls/Month | Monthly Cost |
|---|---|---|
| Hobby | 1,000 | $0.09 |
| Startup | 10,000 | $0.90 |
| Growth | 100,000 | $9.00 |
| Scale | 500,000 | $45.00 |
| Enterprise | 2,000,000 | $180.00 |
Assumes 800 input + 400 output tokens per call.
| Model | Provider | Input/1M | Output/1M | vs Gemini 1.5 Flash 8B |
|---|---|---|---|---|
| Gemini 1.5 Flash 8B | Google Gemini | $0.0375 | $0.1500 | — |
| Llama 3.1 8B | Groq | $0.0500 | $0.0800 | -31% cheaper |
| text-embedding-3-small | unknown | $0.0200 | $0.0000 | -89% cheaper |
| Mistral Small | Mistral AI | $0.0600 | $0.1800 | +28% more |
| embed-english-v3.0 | unknown | $0.1000 | $0.0000 | -47% cheaper |
| embed-multilingual-v3.0 | unknown | $0.1000 | $0.0000 | -47% cheaper |
Gemini 1.5 Flash 8B by Google Gemini costs $0.0375/M input tokens and $0.15/M output tokens as of June 2026.
At 800 input + 400 output tokens per call, 1,000 calls to Gemini 1.5 Flash 8B costs approximately $0.09.
Gemini 1.5 Flash 8B input costs $0.0375/M vs GPT-4o at $5.00/M. Gemini 1.5 Flash 8B is 99% cheaper for input tokens.