🧠 AI Reasoning Models Compared (2025)

Compare 1,306 reasoning models across 95 providers. Find the best chain-of-thought model for math, science, coding, and complex analysis.

1,306Reasoning Models
95Providers
81Free
527Open Weights
πŸ” Interactive Catalog ⭐ Star on GitHub
πŸ’‘ What is a reasoning model? Reasoning models (like OpenAI o1/o3, DeepSeek R1, Claude with extended thinking) use chain-of-thought to break complex problems into steps. They excel at math, science, coding, and multi-step logic β€” but often cost more and run slower than standard models.

πŸ† Flagship Reasoning Models β€” Head to Head

The top reasoning models compared side by side.

Model Provider Input $/1M Output $/1M Context Tool Call
o3 openai $10 $40 200K βœ…
o3-mini openai $1.1 $4.4 200K βœ…
o4-mini openai $1.1 $4.4 200K βœ…
o1 openai $15 $60 200K βœ…
o1-mini openai $1.5 $6 128K βœ…
o1-pro openai $150 $600 200K βœ…
deepseek-r1-distill-llama-70b cerebras Free 131K
gemini-2.5-pro deepinfra $1.25 $10 1M
gemini-2.5-flash deepinfra $0.3 $2.5 1M
qwen3-235b-a22b alibaba $2 $8 ? βœ…

πŸ’° Cheapest Reasoning Models

Reasoning on a budget β€” most affordable models with reasoning capability.

Model Provider Input $/1M Output $/1M Context Tool Call
qwen3.5-0.8b deepinfra $0.01 $0.05 262K
qwen3.5-2b deepinfra $0.02 $0.1 262K
gpt-oss-20b deepinfra $0.03 $0.14 131K
qwen3.5-4b deepinfra $0.03 $0.15 262K
openai--gpt-oss-20b neuralwatt $0.03 $0.16 ? βœ…
qwen--qwen3-4b-fp8 novitaai $0.03 $0.03 128K βœ…
gpt-oss-120b deepinfra $0.039 $0.19 131K
nvidia-nemotron-nano-9b-v2 deepinfra $0.04 $0.16 131K
openai--gpt-oss-20b novitaai $0.04 $0.15 131K
nemotron-3-nano-30b-a3b deepinfra $0.05 $0.2 262K
gpt-oss-120b inferencenet $0.05 $0.45 131K βœ…
Qwen--Qwen3.6-35B-A3B neuralwatt $0.05 $0.1 ? βœ…
openai--gpt-oss-120b novitaai $0.05 $0.25 131K βœ…
qwen3-30b-a3b-fp8 cloudflare $0.051 $0.335 40K βœ…
glm-4.7-flash cloudflare $0.06 $0.4 131K βœ…

πŸ†“ Free Reasoning Models

33 reasoning models at zero cost β€” perfect for learning and prototyping.

Model Provider Context Tool Call
deepseek--deepseek-v4-flash--free openrouter 1M βœ…
nvidia--nemotron-3-super-120b-a12b--free openrouter 1M βœ…
gemma-4-26b-a4b-it auriko 262K βœ…
gemma-4-31b-it auriko 262K βœ…
arcee-ai--trinity-large-thinking--free openrouter 262K βœ…
google--gemma-4-26b-a4b-it--free openrouter 262K βœ…
google--gemma-4-31b-it--free openrouter 262K βœ…
nvidia--nemotron-3-nano-omni-30b-a3b-reasoning--free openrouter 256K βœ…
minimax--minimax-m2.5--free openrouter 204K βœ…
z-ai--glm-5.1 openrouter 202K βœ…

πŸ”“ Open-Weight Reasoning Models

120 reasoning models you can run locally for full privacy and zero API costs.

Model Provider Context Tool Call
xiaomi--mimo-v2.5-pro hpc-ai 1M βœ…
xiaomi--mimo-v2.5 hpc-ai 1M βœ…
deepseek--deepseek-v4-flash hpc-ai 1M βœ…
deepseek--deepseek-v4-pro hpc-ai 1M βœ…
DeepSeek-V4-Pro nebius 1M βœ…
trinity-large-thinking arcee 262K βœ…
qwen3-next-80b-a3b-thinking clarifai 262K βœ…
gemma-4-26b-a4b-it cloudflare 262K βœ…
kimi-k2.5 cloudflare 262K βœ…
kimi-k2.6 cloudflare 262K βœ…

πŸ”§ Reasoning + Tool Calling

Models with both reasoning and tool calling β€” the most capable for agentic workflows that need complex planning.

Model Provider Input $/1M Output $/1M Context
openai--gpt-oss-20b neuralwatt $0.03 $0.16 ?
qwen--qwen3-4b-fp8 novitaai $0.03 $0.03 128K
gpt-oss-120b inferencenet $0.05 $0.45 131K
Qwen--Qwen3.6-35B-A3B neuralwatt $0.05 $0.1 ?
openai--gpt-oss-120b novitaai $0.05 $0.25 131K
qwen3-30b-a3b-fp8 cloudflare $0.051 $0.335 40K
glm-4.7-flash cloudflare $0.06 $0.4 131K
Nemotron-3-Nano-Omni nebius $0.06 $0.24 128K
hermes-4-llama-3.1-8b nousresearch $0.06 $0.12 131K
seed-1.6-flash bytedance $0.07 $0.3 262K
ring-2.6-1t inclusionai $0.07 $0.62 262K
zai-org--glm-4.7-flash novitaai $0.07 $0.4 200K
microsoft-phi-4-mini-reasoning microsoft $0.075 $0.3 128K
Qwen--Qwen3-32B-TEE chutes $0.08 $0.24 40K
gpt-oss-120b clarifai $0.09 $0.36 131K

πŸ“ Large Context Reasoning Models

Reasoning models with 128K+ context β€” for analyzing long documents, large codebases, and complex multi-step problems.

Model Provider Context Input $/1M Tool Call
qwen3.5-0.8b deepinfra 262K $0.01
qwen3.5-2b deepinfra 262K $0.02
gpt-oss-20b deepinfra 131K $0.03
qwen3.5-4b deepinfra 262K $0.03
qwen--qwen3-4b-fp8 novitaai 128K $0.03 βœ…
gpt-oss-120b deepinfra 131K $0.039
nvidia-nemotron-nano-9b-v2 deepinfra 131K $0.04
openai--gpt-oss-20b novitaai 131K $0.04
nemotron-3-nano-30b-a3b deepinfra 262K $0.05
gpt-oss-120b inferencenet 131K $0.05 βœ…
openai--gpt-oss-120b novitaai 131K $0.05 βœ…
glm-4.7-flash cloudflare 131K $0.06 βœ…
glm-4.7-flash deepinfra 202K $0.06
Nemotron-3-Nano-Omni nebius 128K $0.06 βœ…
hermes-4-llama-3.1-8b nousresearch 131K $0.06 βœ…

πŸ“Š Methodology

All data is sourced from first-party APIs. Reasoning capability is defined by the provider's own classification β€” models that use chain-of-thought, extended thinking, or similar techniques. Aggregator providers are excluded from ranking tables to avoid duplicate models.

πŸ”— More Resources

Small Language Models

🎯 AI Model Picker

⚑ GitHub Action