Inference Volume$52.1B+5.1%
Model Rankings165updated
AI Traffic624.8K+38.2%
Global Latency11msoptimal
API Calls102.4M+12.4%
Live Agents582,391+42.1%
AI Market Cap$58.2B+5.2%
GPU Nodes6,247+10.2%
Inference Volume$52.1B+5.1%
Model Rankings165updated
AI Traffic624.8K+38.2%
Global Latency11msoptimal
API Calls102.4M+12.4%
Live Agents582,391+42.1%
AI Market Cap$58.2B+5.2%
GPU Nodes6,247+10.2%

AI Model Compare Engine

Benchmark and compare leading AI models across reasoning, coding, vision, speed, and cost.

Select Models to Compare (max 6)

Benchmark Comparison

Sort by:
ModelProviderOverallReasoningCodingVisionMathSpeedEfficiencyContextInput CostOutput CostLatency
GPT-4oOpenAI8894BEST9396BEST917872128K$2.50/1M$10.00/1M142ms
Claude 3.5 SonnetAnthropic879395BEST91908278200K$3.00/1M$15.00/1M187ms
DeepSeek V3DeepSeek92BEST898878928588128K$0.27/1M$1.10/1M156ms
Gemini 2.0 FlashGoogle8890929494BEST95BEST921M$0.35/1M$0.70/1M89ms
Llama 3.1 70BMeta86868475859095BEST128KFree/1MFree/1M112ms

GPT-4o

OpenAI · ~1.8T

May 2024

88
Overall
MultimodalEnterpriseVision
94
Reasoning
93
Coding
96
Vision
91
Math
78
Speed
72
Efficiency

Claude 3.5 Sonnet

Anthropic · ~175B

Jun 2024

87
Overall
CodingAnalysisLong-context
93
Reasoning
95
Coding
91
Vision
90
Math
82
Speed
78
Efficiency

DeepSeek V3

DeepSeek · ~671B

Dec 2024

92
Overall
MathCost-efficientCoding
89
Reasoning
88
Coding
78
Vision
92
Math
85
Speed
88
Efficiency

Gemini 2.0 Flash

Google · ~100B

Dec 2024

88
Overall
SpeedLong-contextCost-efficient
90
Reasoning
92
Coding
94
Vision
94
Math
95
Speed
92
Efficiency

Llama 3.1 70B

Meta · 70B

Jul 2024

86
Overall
Open-sourceLocalFine-tuning
86
Reasoning
84
Coding
75
Vision
85
Math
90
Speed
95
Efficiency