AI Models

Explore the leading LLMs on performance benchmarks, latency, and pricing.

Model

Score (%)

Input Price

Output Price

Anthropic

Jun/9/2026

Claude Fable 5

Overall Score92

Context Window1M

Input Price($/M)$10.00

Output Price($/M)$50.00

Max Output Tokens128k

Benchmark Performance

AIMultiple

Holdout

Rank

Cost Analysis

Lowest Input Cost

$10.00

From Microsoft Azure

Lowest Output Cost

$50.00

From Microsoft Azure

Min. Latency

5.20s

From Microsoft Azure

FAQ

Consider your primary needs:
Content creation: Focus on AI reasoning and memory scores
Software development: Prioritize AI code performance
Data analysis: Look at text-to-sqlL and AI finance scores
Business automation: Consider Agentic RAG and AI Agents Performance
Factual accuracy: Emphasize low hallucination rates

These represent different tiers of OpenAI's GPT-5 family:
GPT-5: Full-featured flagship model
GPT-5 Mini: Optimized for speed and cost while maintaining strong performance
GPT-5 Nano: Ultra-fast, lightweight version for high-volume applications

Date suffixes indicate specific training cutoffs or release versions. For example, "claude-3-7-sonnet-20250219" was released on February 19, 2025, helping users track which exact version they're evaluating.

The "32b" in models like "exaone-4.0-32b" refers to 32 billion parameters. Generally, more parameters allow for better performance, but also require more computational resources and cost more to run.

Mini variants: Optimized for speed and cost, typically 65-80% the performance of full models
High variants: Maximum performance configurations, often with increased computational requirements

AI Models

Claude Fable 5

Benchmark Performance

Cost Analysis

FAQ

Which model should I choose for my use case?

What's the difference between GPT-5, GPT-5 Mini, and GPT-5 Nano?

Why do some models have specific date suffixes?

What's the significance of model size indicators like "32b"?

How do "mini" and "high" variants compare?