AI Cost Analytics
Real-time token pricing, API economics, GPU serving costs, inference economics
Cheapest Input
Free
Llama 3.1 405B
Most Expensive Input
$15.00/1M
Claude 3 Opus
Cheapest GPU Cloud
$0.15/hr
P100 on Vast.ai
Enterprise AI Spend
$6.9M/mo
Tracked enterprises
API Cost Comparison / 1M tokens
Sorted by input cost
| Model | Provider | Input/1M | Output/1M | Context | Speed | Efficiency |
|---|---|---|---|---|---|---|
| Llama 3.1 70B | Meta | Free | Free | 128K | 112ms | |
| Llama 3.1 405B | Meta | Free | Free | 128K | 412ms | |
| DeepSeek V3 | DeepSeek | $0.27/1M | $1.10/1M | 128K | 156ms | |
| Gemini 2.0 Flash | $0.35/1M | $0.70/1M | 1M | 89ms | ||
| DeepSeek R1 | DeepSeek | $0.55/1M | $2.19/1M | 128K | 298ms | |
| Mistral Large v2 | Mistral | $2.00/1M | $6.00/1M | 128K | 98ms | |
| GPT-4o | OpenAI | $2.50/1M | $10.00/1M | 128K | 142ms | |
| Claude 3.5 Sonnet | Anthropic | $3.00/1M | $15.00/1M | 200K | 187ms | |
| Gemini 1.5 Pro | $3.50/1M | $10.50/1M | 2M | 165ms | ||
| Grok 2 | xAI | $5.00/1M | $15.00/1M | 131K | 134ms | |
| GPT-4-turbo | OpenAI | $10.00/1M | $30.00/1M | 128K | 256ms | |
| Claude 3 Opus | Anthropic | $15.00/1M | $75.00/1M | 200K | 412ms |