527 open-weight LLMs compared โ pricing, context windows, tool calling, reasoning, and vision capabilities
๐ Interactive Catalog โญ Star on GitHubThe most capable open-weight models available today, from leading AI labs:
| Model | Provider | Context | Tool Call | Reasoning | Price (in/out per 1M) |
|---|---|---|---|---|---|
llama-4-maverick |
Meta | 1M | โ | โ | Varies |
llama-4-scout |
Meta | 10M | โ | โ | Varies |
deepseek-r1 |
DeepSeek | 128K | โ | โ | Varies |
deepseek-v3 |
DeepSeek | 128K | โ | โ | Varies |
qwen3-235b-a22b |
Alibaba | 128K | โ | โ | Varies |
qwen3-32b |
Alibaba | 128K | โ | โ | Varies |
llama-3.3-70b-instruct |
Meta | 128K | โ | โ | Varies |
gemma-3-27b-it |
128K | โ | โ | Free | |
phi-4 |
Microsoft | 16K | โ | โ | Varies |
command-a |
Cohere | 256K | โ | โ | Varies |
mistral-large-2411 |
Mistral | 128K | โ | โ | Varies |
81 open-weight models you can use for free through their provider APIs. These are ideal for prototyping, testing, and learning:
| Model | Provider | Context | Tool Call | Reasoning |
|---|---|---|---|---|
gemma-3-27b-it |
128K | โ | โ | |
gemma-3-12b-it |
128K | โ | โ | |
gemma-3-4b-it |
128K | โ | โ | |
gemma-3-1b-it |
128K | โ | โ | |
qwen3-235b-a22b |
Alibaba | 128K | โ | โ |
qwen3-30b-a3b |
Alibaba | 128K | โ | โ |
qwen3-32b |
Alibaba | 128K | โ | โ |
qwen3-14b |
Alibaba | 128K | โ | โ |
qwen3-8b |
Alibaba | 128K | โ | โ |
qwen3-4b |
Alibaba | 128K | โ | โ |
qwen3-1.7b |
Alibaba | 128K | โ | โ |
qwen3-0.6b |
Alibaba | 128K | โ | โ |
llama-4-maverick |
Meta | 1M | โ | โ |
llama-4-scout |
Meta | 10M | โ | โ |
llama-3.3-70b-instruct |
Meta | 128K | โ | โ |
โ See all 81 free AI models (including non-open-weight)
375 open-weight models support tool/function calling โ essential for AI agents and agentic workflows:
โ See all 2,350 tool-calling models
231 open-weight models with reasoning capabilities โ these can "think step by step" for complex tasks:
| Model | Provider | Context | Tool Call | Key Strength |
|---|---|---|---|---|
deepseek-r1 |
DeepSeek | 128K | โ | Best open-weight reasoning, rivals o1 |
qwen3-235b-a22b |
Alibaba | 128K | โ | MoE architecture, thinking mode |
qwen3-32b |
Alibaba | 128K | โ | Dense reasoning, strong benchmarks |
qwen3-30b-a3b |
Alibaba | 128K | โ | Lightweight MoE reasoning |
qwen3-14b |
Alibaba | 128K | โ | Mid-size reasoning model |
qwen3-8b |
Alibaba | 128K | โ | Small but capable reasoning |
โ See all 1,306 reasoning models
269 open-weight models can process images alongside text โ useful for document analysis, visual Q&A, and multimodal applications:
โ See all 1,487 vision models
Open-weight models with the largest context windows โ essential for processing long documents, codebases, and multi-turn conversations:
| Model | Provider | Context Window | Tool Call | Reasoning |
|---|---|---|---|---|
llama-4-scout |
Meta | 10M | โ | โ |
llama-4-maverick |
Meta | 1M | โ | โ |
command-a |
Cohere | 256K | โ | โ |
deepseek-r1 |
DeepSeek | 128K | โ | โ |
deepseek-v3 |
DeepSeek | 128K | โ | โ |
qwen3-235b-a22b |
Alibaba | 128K | โ | โ |
llama-3.3-70b-instruct |
Meta | 128K | โ | โ |
gemma-3-27b-it |
128K | โ | โ | |
mistral-large-2411 |
Mistral | 128K | โ | โ |
โ See all models with context window comparison
gemma-3-27b-it (Google,
free) or qwen3-32b (Alibaba, free)
llama-4-maverick (1M context + tool
calling) or deepseek-r1 (reasoning + tools)
llama-4-scout (10M context)
or llama-4-maverick (1M context)
deepseek-r1 (best open-weight
reasoning) or qwen3-235b-a22b
llama-4-maverick or
gemma-3-27b-it
qwen3-0.6b or
gemma-3-1b-it (smallest open-weight)
command-a (256K context, optimized
for RAG + tools)
| Aspect | Open Weights | Proprietary |
|---|---|---|
| Self-hosting | โ Run on your own hardware | โ Cloud API only |
| Data privacy | โ Full control over data | โ Data sent to provider |
| Customization | โ Fine-tune on your data | โ Limited (prompt-based) |
| Cost at scale | โ Fixed infra cost | โ Per-token pricing |
| Latest capabilities | ~3โ6 months behind | โ Cutting-edge |
| Convenience | Requires infra setup | โ Instant API access |