AI Cost Analytics

Real-time token pricing, API economics, GPU serving costs, inference economics

Cheapest Input

Free

Llama 3.1 405B

Most Expensive Input

$15.00/1M

Claude 3 Opus

Cheapest GPU Cloud

$0.15/hr

P100 on Vast.ai

Enterprise AI Spend

$6.9M/mo

Tracked enterprises

Sorted by input cost

Model	Provider	Input/1M	Output/1M	Context	Speed
Llama 3.1 70B	Meta	Free	Free	128K	112ms
Llama 3.1 405B	Meta	Free	Free	128K	412ms
DeepSeek V3	DeepSeek	$0.27/1M	$1.10/1M	128K	156ms
Gemini 2.0 Flash	Google	$0.35/1M	$0.70/1M	1M	89ms
DeepSeek R1	DeepSeek	$0.55/1M	$2.19/1M	128K	298ms
Mistral Large v2	Mistral	$2.00/1M	$6.00/1M	128K	98ms
GPT-4o	OpenAI	$2.50/1M	$10.00/1M	128K	142ms
Claude 3.5 Sonnet	Anthropic	$3.00/1M	$15.00/1M	200K	187ms
Gemini 1.5 Pro	Google	$3.50/1M	$10.50/1M	2M	165ms
Grok 2	xAI	$5.00/1M	$15.00/1M	131K	134ms
GPT-4-turbo	OpenAI	$10.00/1M	$30.00/1M	128K	256ms
Claude 3 Opus	Anthropic	$15.00/1M	$75.00/1M	200K	412ms