AI Models
Explore the leading LLMs on performance benchmarks, latency, and pricing.
Model
Score (%)
Input Price
Output Price
Feb/4/2026
Claude Opus 4.6
AIMultiple Score94%
Context Window1M
Input Price($/M)$5.00
Output Price($/M)$25.00
Max Output Tokens128k
Benchmark Performance
AIMultiple
Rank
Category
Benchmarks
Score
Cost Analysis
Blended Price (USD/1M Tokens)
$10.00
From Anthropic
Min. Latency
1.91s
From Anthropic
FAQ
Consider your primary needs:
Content creation: Focus on AI reasoning and memory scores
Software development: Prioritize AI code performance
Data analysis: Look at text-to-sqlL and AI finance scores
Business automation: Consider Agentic RAG and AI Agents Performance
Factual accuracy: Emphasize low hallucination rates
Content creation: Focus on AI reasoning and memory scores
Software development: Prioritize AI code performance
Data analysis: Look at text-to-sqlL and AI finance scores
Business automation: Consider Agentic RAG and AI Agents Performance
Factual accuracy: Emphasize low hallucination rates
These represent different tiers of OpenAI's GPT-5 family:
GPT-5: Full-featured flagship model
GPT-5 Mini: Optimized for speed and cost while maintaining strong performance
GPT-5 Nano: Ultra-fast, lightweight version for high-volume applications
GPT-5: Full-featured flagship model
GPT-5 Mini: Optimized for speed and cost while maintaining strong performance
GPT-5 Nano: Ultra-fast, lightweight version for high-volume applications
Date suffixes indicate specific training cutoffs or release versions. For example, "claude-3-7-sonnet-20250219" was released on February 19, 2025, helping users track which exact version they're evaluating.
The "32b" in models like "exaone-4.0-32b" refers to 32 billion parameters. Generally, more parameters allow for better performance, but also require more computational resources and cost more to run.
Mini variants: Optimized for speed and cost, typically 65-80% the performance of full models
High variants: Maximum performance configurations, often with increased computational requirements
High variants: Maximum performance configurations, often with increased computational requirements