🏒 AI Models by Provider β€” All 95 Providers Listed

Browse 4,587 AI models across 95 providers. First-party data with real pricing, context windows, and capabilities.

95Providers
4,587Models
81Free Models
527Open Weights
πŸ” Interactive Catalog ⭐ Star on GitHub

πŸ“Š Provider Overview

All 95 providers sorted by number of models. Click a provider to see their models.

Provider Models Cheapest Input $/1M Max Context Tool Call Free
nanogpt (aggregator) 547 Aggregator ? 0
aihubmix (aggregator) 476 Aggregator ? 132
openrouter (aggregator) 356 Aggregator 10M 263 βœ…
martian (aggregator) 304 Aggregator ? 0
requesty (aggregator) 277 Aggregator 1M 251
302ai (aggregator) 268 Aggregator 2M 190
auriko (aggregator) 181 Aggregator 1M 154 βœ…
llmgateway (aggregator) 163 Aggregator ? 158 βœ…
aimlapi 147 $0.007 2M 21 βœ…
fastrouter (aggregator) 120 Aggregator 2M 94 βœ…
orcarouter (aggregator) 120 Aggregator 1M 102
cortecs (aggregator) 105 Aggregator ? 97
novitaai 104 $0.02 1M 72 βœ…
vultr 98 $0.55 1M 11
deepinfra 88 $0.01 1M 0
venice (aggregator) 75 Aggregator 2M 64
jiekou (aggregator) 73 Aggregator 2M 73
meganova (aggregator) 63 Aggregator 1M 60 βœ…
alibaba 62 $0.15 1M 62
ppio 60 $0.2145 1M 46 βœ…
amazon-bedrock 57 $0.035 1M 37
google-vertex 38 $0.07 1M 32
siliconflow-cn 37 $0.5 262K 2
stepfun 31 $0.7 256K 0 βœ…
cloudflare 30 $0.017 327K 15
databricks 29 $0.05 200K 4
gmicloud 29 $0.07 1M 11
openai 28 $0.02 1M 18
siliconflow 27 $0.04 1M 24
togetherai 24 $0.03 262K 22
nebius 23 $0.02 1M 21
google 21 $0.075 2M 8 βœ…
minimax 21 $2.1 204K 0
voyage 21 $0.02 ? 0 βœ…
digitalocean 20 $0.05 1M 14
inferencenet 20 $0.01 131K 15
zhipuai 20 $0.1 1M 20 βœ…
tencent-tokenhub 19 $1 1M 16
mistral 16 $0.04 256K 12 βœ…
moonshotai 16 $2 262K 0
neuralwatt 14 $0.03 ? 14
tencent 14 $0.5 250K 3 βœ…
scaleway 13 $0.15 131K 6
chutes 12 $0.08 262K 12
clarifai 12 $0.09 1M 9
cloudferro-sherlock 12 $0.26 1M 5
groq 12 $0.05 131K 8
klusterai 12 $0.008 1M 4
meta 12 $0.1 10M 9
microsoft 12 $0.075 128K 6
ovhcloud 12 $0.05 262K 0
anthropic 11 $1 1M 11
baichuan 11 $0.98 131K 0 βœ…
cerebras 11 $0.1 131K 9 βœ…
hpc-ai 11 $0.14 1M 11
hyperbolic 11 $0.1 163K 0
fireworks 10 $0.07 1M 10
baseten 9 $0.1 1M 9
baidu 8 $0.126 1M 7 βœ…
evroc 8 $0.1 131K 3
friendli 8 $0.1 262K 8
upstage 8 $0.1 128K 3
amazon 7 $0.035 1M 7
arcee 7 $0.04 262K 6 βœ…
berget 7 $0.2 ? 7
morph 7 $0.2 1M 5
nousresearch 7 $0.06 131K 7
sambanova 7 $0.22 196K 0
dinference 6 $0.07 204K 3
iflytek 6 $0.8 262K 0 βœ…
submodel 6 $0.1 262K 0
textsynth 6 $0.2 131K 0
writer 6 $0.6 1M 3
xai 6 $0.2 131K 6
01ai 5 $1 32K 4
aion 5 $0.7 131K 0
bytedance 5 $0.07 262K 4
inception 5 $0.25 128K 3
mixlayer 5 $0.1 131K 5 βœ…
privatemode 5 $0.43 131K 3
xiaomi 5 $0.1 1M 5
deepseek 4 $0.14 1M 4
perplexity 4 $1 200K 4
inclusionai 3 $0.01 262K 3
ai21 2 $0.2 256K 0
reka 2 $0.03 131K 1
wafer 2 $0.6 262K 2

🏒 OpenAI

GPT-4, GPT-4o, o1, o3 β€” the industry standard for LLMs. 28 models available.

Model Input $/1M Output $/1M Context Tool Call Reasoning
text-embedding-3-small $0.02 $0 8K
gpt-4.1-nano $0.1 $0.4 1M βœ…
text-embedding-ada-002 $0.1 $0 8K
text-embedding-3-large $0.13 $0 8K
gpt-4o-mini $0.15 $0.6 128K βœ…
gpt-4.1-mini $0.4 $1.6 1M βœ…
gpt-3.5-turbo $0.5 $1.5 16K βœ…
o3-mini $1.1 $4.4 200K βœ… βœ…
o4-mini $1.1 $4.4 200K βœ… βœ…
codex-mini $1.5 $6 192K βœ…
o1-mini $1.5 $6 128K βœ… βœ…
gpt-4.1 $2 $8 1M βœ…
gpt-4o-audio $2.5 $10 128K βœ…
gpt-4o $2.5 $10 128K βœ…
gpt-3.5-turbo-16k $3 $4 16K βœ…
gpt-4o-realtime $5 $20 128K βœ…
gpt-4-turbo $10 $30 128K βœ…
o3 $10 $40 200K βœ… βœ…
o1-realtime $15 $60 200K βœ… βœ…
o1 $15 $60 200K βœ… βœ…
gpt-4 $30 $60 8K βœ…
gpt-4-32k $60 $120 32K
o1-pro $150 $600 200K βœ… βœ…
dall-e-2 $? $? ?
dall-e-3 $? $? ?
tts-1-hd $? $? ?
tts-1 $? $? ?
whisper-1 $? $? ?

🏒 Anthropic

Claude β€” known for safety, reasoning, and long context. 11 models available.

Model Input $/1M Output $/1M Context Tool Call Reasoning
claude-haiku-4-5 $1 $5 200K βœ… βœ…
claude-sonnet-4-0 $3 $15 1M βœ… βœ…
claude-sonnet-4-5 $3 $15 1M βœ… βœ…
claude-sonnet-4-6 $3 $15 1M βœ… βœ…
claude-opus-4-5 $5 $25 200K βœ… βœ…
claude-opus-4-6 $5 $25 1M βœ… βœ…
claude-opus-4-7 $5 $25 1M βœ… βœ…
claude-opus-4-0 $15 $75 200K βœ… βœ…
claude-opus-4-1 $15 $75 200K βœ… βœ…
claude-opus-4-6-fast $30 $150 1M βœ… βœ…
claude-opus-4-7-fast $30 $150 1M βœ… βœ…

🏒 Google

Gemini β€” multimodal models with massive context windows. 21 models available.

Model Input $/1M Output $/1M Context Tool Call Reasoning
gemini-1.5-flash-8b $0.075 $0.3 1M βœ…
gemini-1.5-flash $0.075 $0.3 1M βœ…
gemini-2.0-flash-lite $0.075 $0.3 1M βœ…
gemini-2.0-flash $0.1 $0.4 1M βœ…
gemini-2.5-flash-lite $0.1 $0.4 1M βœ…
gemini-2.5-flash $0.15 $3.5 1M βœ… βœ…
gemini-1.5-pro $1.25 $5 2M βœ…
gemini-2.5-pro $1.25 $10 1M βœ… βœ…
chirp-3.0-HD $? $? ?
gemma-3-12b-it Free 131K
gemma-3-1b-it Free 131K
gemma-3-27b-it Free 131K
gemma-3-4b-it Free 131K
gemma-3n-E2B-it Free 131K
gemma-3n-E4B-it Free 131K
imagen-3.0-fast-generate $? $? ?
imagen-3.0-generate $? $? ?
imagen-4.0-fast-generate $? $? ?
imagen-4.0-generate $? $? ?
lyria-2.0 $? $? ?
veo-2.0-generate $? $? ?

🏒 Meta

Llama β€” open-weight models you can run anywhere. 12 models available.

Model Input $/1M Output $/1M Context Tool Call Reasoning
meta-llama-3.2-1b $0.1 $0.1 128K
meta-llama-3.2-3b $0.15 $0.15 128K
meta-llama-3.2-11b-vision $0.16 $0.16 128K βœ…
meta-llama-4-scout $0.17 $0.66 10M βœ…
meta-llama-3.1-8b $0.22 $0.22 128K βœ…
meta-llama-4-maverick $0.24 $0.97 1M βœ…
meta-llama-3-8b $0.3 $0.6 8K
meta-llama-3.1-70b $0.72 $0.72 128K βœ…
meta-llama-3.2-90b-vision $0.72 $0.72 128K βœ…
meta-llama-3.3-70b $0.72 $0.72 128K βœ…
meta-llama-3.1-405b $2.4 $2.4 128K βœ…
meta-llama-3-70b $2.65 $3.5 8K βœ…

🏒 DeepSeek

High-performance reasoning at competitive prices. 4 models available.

Model Input $/1M Output $/1M Context Tool Call Reasoning
deepseek-chat $0.14 $0.28 1M βœ…
deepseek-reasoner $0.14 $0.28 1M βœ… βœ…
deepseek-v4-flash $0.14 $0.28 1M βœ… βœ…
deepseek-v4-pro $0.435 $0.87 1M βœ… βœ…

🏒 Mistral

European AI with open and commercial models. 16 models available.

Model Input $/1M Output $/1M Context Tool Call Reasoning
ministral-3b $0.04 $0.04 128K βœ…
voxtral-mini $0.04 $0.04 128K
ministral-8b $0.1 $0.1 128K βœ…
voxtral-small $0.1 $0.3 128K
mistral-7b $0.15 $0.2 32K
mistral-nemo $0.15 $0.15 128K βœ…
mistral-small $0.2 $0.6 128K βœ…
mistral-medium $0.4 $2 128K βœ…
mixtral-8x7b $0.45 $0.7 32K βœ…
magistral-small $0.5 $1.5 128K βœ… βœ…
mixtral-8x22b $0.8 $1.2 64K βœ…
mistral-large $2 $6 128K βœ…
pixtral-large $2 $6 128K βœ…
mistral-large-2407 $4 $12 128K βœ…
codestral Free 256K
devstral Free 128K βœ…

🏒 xAI

Grok β€” models with real-time knowledge. 6 models available.

Model Input $/1M Output $/1M Context Tool Call Reasoning
xai-grok-4-fast $0.2 $0.5 131K βœ…
xai-grok-4.1 $0.2 $0.5 131K βœ… βœ…
xai-grok-3-mini $0.25 $1.27 131K βœ… βœ…
xai-grok-4.2 $2 $6 131K βœ… βœ…
xai-grok-3 $3 $15 131K βœ… βœ…
xai-grok-4 $3 $15 131K βœ… βœ…

🏒 AWS Bedrock

Managed access to multiple foundation models. 57 models available.

Model Input $/1M Output $/1M Context Tool Call Reasoning
amazon-nova-micro $0.035 $0.14 128K βœ…
google-gemma-3-4b $0.04 $0.08 131K
mistral-voxtral-mini $0.04 $0.04 128K
amazon-nova-lite $0.06 $0.24 300K βœ…
nvidia-nemotron-nano-2 $0.06 $0.23 4K
nvidia-nemotron-nano-3-30b $0.06 $0.24 4K
openai-gpt-oss-20b $0.07 $0.3 131K βœ…
openai-gpt-oss-safeguard-20b $0.07 $0.2 131K βœ…
zai-glm-4-7-flash $0.07 $0.4 131K βœ…
google-gemma-3-12b $0.09 $0.29 131K
meta-llama-3-2-1b $0.1 $0.1 128K
mistral-ministral-3b $0.1 $0.1 128K
mistral-voxtral-small $0.1 $0.3 128K
meta-llama-3-2-3b $0.15 $0.15 128K
mistral-ministral-8b $0.15 $0.15 128K
mistral-mistral-7b $0.15 $0.2 32K
nvidia-nemotron-3-super-120b $0.15 $0.65 4K
openai-gpt-oss-120b $0.15 $0.6 131K βœ…
openai-gpt-oss-safeguard-120b $0.15 $0.6 131K βœ…
qwen-qwen3-32b $0.15 $0.6 131K βœ…
qwen-qwen3-coder-30b-a3b $0.15 $0.6 131K βœ…
writer-palmyra-vision-7b $0.15 $0.6 8K
meta-llama-3-2-11b $0.16 $0.16 128K βœ…
meta-llama-4-scout-17b $0.17 $0.66 1M βœ…
mistral-ministral-14b $0.2 $0.2 128K
nvidia-nemotron-nano-2-vl $0.2 $0.6 4K
meta-llama-3-1-8b $0.22 $0.22 128K βœ…
google-gemma-3-27b $0.23 $0.38 131K
meta-llama-4-maverick-17b $0.24 $0.97 1M βœ…
meta-llama-3-8b $0.3 $0.6 8K
minimax-m2-1 $0.3 $1.2 1M βœ…
minimax-m2-5 $0.3 $1.2 1M βœ…
minimax-m2 $0.3 $1.2 1M βœ…
amazon-nova-2-lite $0.33 $2.75 64K βœ…
mistral-devstral $0.4 $2 128K βœ…
mistral-mixtral-8x7b $0.45 $0.7 32K
mistral-magistral-small $0.5 $1.5 128K βœ…
mistral-mistral-large-3 $0.5 $1.5 128K βœ…
qwen-qwen3-coder-next $0.5 $1.2 131K βœ…
qwen-qwen3-vl-235b-a22b $0.53 $2.66 131K βœ…
kimi-k2-thinking $0.6 $2.5 131K βœ…
moonshot-kimi-k2-5 $0.6 $3 131K βœ…
zai-glm-4-7 $0.6 $2.2 131K βœ…
deepseek-v3-2 $0.62 $1.85 65K βœ…
meta-llama-3-1-70b $0.72 $0.72 128K βœ…
meta-llama-3-2-90b $0.72 $0.72 128K βœ…
meta-llama-3-3-70b $0.72 $0.72 128K βœ…
amazon-nova-pro $0.8 $3.2 300K βœ…
meta-llama-3-1-70b-latency-optimized $0.9 $0.9 128K βœ…
amazon-nova-pro-latency-optimized $1 $4 300K βœ…
mistral-mistral-small $1 $3 128K βœ…
zai-glm-5 $1 $3.2 131K βœ…
deepseek-r1 $1.35 $5.4 65K
mistral-pixtral-large $2 $6 128K βœ…
amazon-nova-premier $2.5 $12.5 1M βœ…
meta-llama-3-70b $2.65 $3.5 8K
mistral-mistral-large $4 $12 128K βœ…

🏒 Groq

Ultra-fast inference with LPU hardware. 12 models available.

Model Input $/1M Output $/1M Context Tool Call Reasoning
llama-3.1-8b-instant $0.05 $0.08 131K βœ…
gpt-oss-20b $0.075 $0.3 131K βœ…
gpt-oss-safeguard-20b $0.075 $0.3 131K βœ…
llama-4-scout-17b-16e-instruct $0.11 $0.34 131K βœ…
gpt-oss-120b $0.15 $0.6 131K βœ…
qwen3-32b $0.29 $0.59 131K βœ…
llama-3.3-70b-versatile $0.59 $0.79 131K βœ…
kimi-k2-instruct-0905 $1 $3 131K βœ…
orpheus-ar-sa $? $? ?
orpheus-en $? $? ?
whisper-large-v3-turbo $? $? ?
whisper-large-v3 $? $? ?

🏒 Together AI

Open-weight model hosting platform. 24 models available.

Model Input $/1M Output $/1M Context Tool Call Reasoning
liquid-ai--LFM2-24B-A2B $0.03 $0.12 131K βœ…
openai--gpt-oss-20b $0.05 $0.2 131K βœ…
google--gemma-3n-E4B-it $0.06 $0.12 131K
Qwen--Qwen3.5-9B $0.1 $0.15 131K βœ…
meta-llama--Meta-Llama-3.1-8B-Instruct-Lite $0.1 $0.1 131K βœ…
essential-ai--Rnj-1-Instruct $0.15 $0.15 131K
openai--gpt-oss-120b $0.15 $0.6 131K βœ…
Qwen--Qwen3-235B-A22B-FP8-Throughput $0.2 $0.6 131K βœ…
MiniMaxAI--MiniMax-M2.5 $0.3 $1.2 131K βœ…
MiniMaxAI--MiniMax-M2.7 $0.3 $1.2 131K βœ…
Qwen--Qwen2.5-7B-Instruct-Turbo $0.3 $0.3 131K βœ…
google--gemma-4-31B-it $0.39 $0.97 131K βœ…
Qwen--Qwen3-Coder-Next $0.5 $1.2 131K βœ…
Qwen--Qwen3.6-Plus $0.5 $3 131K βœ…
moonshotai--Kimi-K2.5 $0.5 $2.8 131K βœ…
Qwen--Qwen3.5-397B-A17B $0.6 $3.6 131K βœ…
deepseek-ai--DeepSeek-V3.1 $0.6 $1.7 131K βœ…
meta-llama--Llama-3.3-70B-Instruct-Turbo $0.88 $0.88 131K βœ…
zai-org--GLM-5 $1 $3.2 131K βœ…
moonshotai--Kimi-K2.6 $1.2 $4.5 262K βœ…
cogito-ai--Cogito-v2.1-671B $1.25 $1.25 131K βœ… βœ…
zai-org--GLM-5.1 $1.4 $4.4 131K βœ…
Qwen--Qwen3-Coder-480B-A35B-Instruct $2 $2 131K βœ…
deepseek-ai--DeepSeek-V4-Pro $2.1 $4.4 131K βœ… βœ…

🏒 Fireworks

Fast inference for open-source models. 10 models available.

Model Input $/1M Output $/1M Context Tool Call Reasoning
gpt-oss-20b $0.07 $0.3 131K βœ…
gpt-oss-120b $0.15 $0.6 131K βœ…
llama4-scout-17b-16e-instruct $0.18 $0.59 131K βœ…
minimax-m2.5 $0.3 $1.2 196K βœ…
minimax-m2.7 $0.3 $1.2 196K βœ…
qwen3.6-plus $0.5 $3 131K βœ…
kimi-k2.5 $0.6 $3 262K βœ…
kimi-k2.6 $0.95 $4 262K βœ…
glm-5.1 $1.4 $4.4 202K βœ…
deepseek-v4-pro $1.74 $3.48 1M βœ… βœ…

🏒 Cerebras

Wafer-scale inference at extreme speed. 11 models available.

Model Input $/1M Output $/1M Context Tool Call Reasoning
llama3.1-8b $0.1 $0.1 131K βœ…
gpt-oss-120b $0.35 $0.75 131K βœ…
qwen3-235b-instruct $0.6 $1.2 131K βœ…
zai-glm-4.7 $2.25 $2.75 131K βœ…
deepseek-r1-distill-llama-70b Free 131K βœ…
deepseek-r1-distill-llama-8b Free 131K βœ…
llama-3.3-70b Free 131K βœ…
llama-4-scout-17b-16e-instruct Free 131K βœ…
qwen-2.5-32b Free 131K βœ…
qwen-2.5-coder-32b Free 131K βœ…
qwen3-32b Free 131K βœ…

🏒 Databricks

DBRX and enterprise AI models. 29 models available.

Model Input $/1M Output $/1M Context Tool Call Reasoning
databricks-gpt-5-nano $0.05 $0.4 200K
databricks-gpt-oss-20b $0.07 $0.3 131K
databricks-gemma-3-12b $0.15 $0.5 131K
databricks-gpt-oss-120b $0.15 $0.6 131K
databricks-meta-llama-3-1-8b-instruct $0.15 $0.45 131K βœ…
databricks-qwen3-next-80b-a3b-instruct $0.15 $1.2 131K βœ…
databricks-gpt-5-4-nano $0.2 $1.25 128K
databricks-gemini-3-1-flash-lite $0.25 $1.5 128K
databricks-gpt-5-1-codex-mini $0.25 $2 200K
databricks-gpt-5-mini $0.25 $2 200K
databricks-gemini-2-5-flash $0.3 $2.5 128K
databricks-llama-4-maverick $0.5 $1.5 131K βœ…
databricks-meta-llama-3-3-70b-instruct $0.5 $1.5 131K βœ…
databricks-gemini-3-flash $0.63 $3.75 128K
databricks-gpt-5-4-mini $0.75 $4.5 128K
databricks-claude-haiku-4-5 $1 $5 200K
databricks-gemini-2-5-pro $1.25 $10 128K
databricks-gpt-5-1-codex-max $1.25 $10 200K
databricks-gpt-5-1 $1.25 $10 200K
databricks-gpt-5 $1.25 $10 200K
databricks-gpt-5-2-codex $1.75 $14 200K
databricks-gpt-5-2 $1.75 $14 200K
databricks-gemini-3-1-pro $2.5 $15 128K
databricks-gpt-5-4 $2.5 $15 128K
databricks-claude-sonnet-4-5 $3 $15 200K
databricks-claude-sonnet-4 $3 $15 200K
databricks-claude-opus-4-5 $5 $25 200K
databricks-gpt-5-5 $5 $30 128K
databricks-claude-opus-4-1 $15 $75 200K

🏒 Alibaba (Qwen)

Qwen β€” multilingual models from Alibaba Cloud. 62 models available.

Model Input $/1M Output $/1M Context Tool Call Reasoning
qwen-flash $0.15 $1.5 ? βœ… βœ…
qwen3.5-flash-2026-02-23 $0.2 $2 1M βœ…
qwen3.5-flash $0.2 $2 1M βœ…
qwen-flash-character $0.25 $1.5 ? βœ… βœ…
qwen-turbo $0.3 $0.6 ? βœ… βœ…
qwen3-0.6b $0.3 $1.2 ? βœ… βœ…
qwen3-1.7b $0.3 $1.2 ? βœ… βœ…
qwen3-4b $0.3 $1.2 ? βœ… βœ…
qwen-omni-turbo $0.4 $25 ? βœ… βœ…
qwen3.5-35b-a3b $0.4 $3.2 256K βœ…
qwen-long-2025-01-25 $0.5 $2 ? βœ… βœ…
qwen-long-latest $0.5 $2 ? βœ… βœ…
qwen-long $0.5 $2 ? βœ… βœ…
qwen2.5-7b-instruct-1m $0.5 $1 ? βœ… βœ…
qwen2.5-7b-instruct $0.5 $1 ? βœ… βœ…
qwen3-8b $0.5 $2 ? βœ… βœ…
qwen-mt-lite $0.6 $1.6 ? βœ… βœ…
qwen2.5-omni-7b $0.6 $38 ? βœ… βœ…
qwen3.5-27b $0.6 $4.8 256K βœ…
qwen-mt-flash $0.7 $1.95 ? βœ… βœ…
qwen-mt-turbo $0.7 $1.95 ? βœ… βœ…
qwen3-30b-a3b-instruct-2507 $0.75 $3 ? βœ… βœ…
qwen3-30b-a3b $0.75 $3 ? βœ… βœ…
qwen-plus-character $0.8 $2 ? βœ… βœ…
qwen-plus $0.8 $2 ? βœ… βœ…
qwen3.5-122b-a10b $0.8 $6.4 256K βœ…
qwen3.5-plus-2026-02-15 $0.8 $4.8 1M βœ…
qwen3.5-plus $0.8 $4.8 1M βœ…
qwen2.5-14b-instruct-1m $1 $3 ? βœ… βœ…
qwen2.5-14b-instruct $1 $3 ? βœ… βœ…
qwen3-14b $1 $4 ? βœ… βœ…
qwen3-coder-flash-2025-07-28 $1 $4 ? βœ… βœ…
qwen3-coder-flash $1 $4 ? βœ… βœ…
qwen3-coder-next $1 $4 ? βœ… βœ…
qwen3-next-80b-a3b-instruct $1 $4 ? βœ… βœ…
qwen2.5-vl-3b-instruct $1.2 $3.6 ? βœ… βœ…
qwen3.5-397b-a17b $1.2 $7.2 256K βœ…
qwen3.6-flash-2026-04-16 $1.2 $7.2 1M βœ…
qwen3.6-flash $1.2 $7.2 1M βœ… βœ…
qwen3-coder-30b-a3b-instruct $1.5 $6 ? βœ… βœ…
qwen-mt-plus $1.8 $5.4 ? βœ… βœ…
qwen2.5-32b-instruct $2 $6 ? βœ… βœ…
qwen2.5-vl-7b-instruct $2 $5 ? βœ… βœ…
qwen3-235b-a22b-instruct-2507 $2 $8 ? βœ… βœ…
qwen3-235b-a22b $2 $8 ? βœ… βœ…
qwen3-32b $2 $8 ? βœ… βœ…
qwen3.6-plus-2026-04-02 $2 $12 1M βœ…
qwen3.6-plus $2 $12 1M βœ… βœ…
qwen-max $2.4 $9.6 ? βœ… βœ…
qwen3-max-2026-01-23 $2.5 $10 ? βœ… βœ…
qwen3-max $2.5 $10 ? βœ… βœ…
qwen-plus-character-ja $3.67 $10.275 ? βœ… βœ…
qwen2.5-72b-instruct $4 $12 ? βœ… βœ…
qwen3-coder-plus-2025-07-22 $4 $16 ? βœ… βœ…
qwen3-coder-plus-2025-09-23 $4 $16 ? βœ… βœ…
qwen3-coder-plus $4 $16 ? βœ… βœ…
qwen3-coder-480b-a35b-instruct $6 $24 ? βœ… βœ…
qwen3-max-2025-09-23 $6 $24 ? βœ… βœ…
qwen3-max-preview $6 $24 ? βœ… βœ…
qwen2.5-vl-32b-instruct $8 $24 ? βœ… βœ…
qwen3.6-max-preview $9 $54 256K βœ… βœ…
qwen2.5-vl-72b-instruct $16 $48 ? βœ… βœ…

🏒 ByteDance

Doubao β€” models from the TikTok parent company. 5 models available.

Model Input $/1M Output $/1M Context Tool Call Reasoning
seed-1.6-flash $0.07 $0.3 262K βœ… βœ…
seed-2.0-mini $0.1 $0.4 262K βœ… βœ…
ui-tars-1.5-7b $0.1 $0.2 128K
seed-1.6 $0.25 $2 262K βœ… βœ…
seed-2.0-lite $0.25 $2 262K βœ… βœ…

🏒 MiniMax

Chinese AI startup with competitive models. 21 models available.

Model Input $/1M Output $/1M Context Tool Call Reasoning
M2-her $2.1 $8.4 64K
MiniMax-M2.1 $2.1 $8.4 204K
MiniMax-M2.5 $2.1 $8.4 204K
MiniMax-M2.7 $2.1 $8.4 204K
MiniMax-M2 $2.1 $8.4 204K
MiniMax-M2.1-highspeed $4.2 $16.8 204K
MiniMax-M2.5-highspeed $4.2 $16.8 204K
MiniMax-M2.7-highspeed $4.2 $16.8 204K
MiniMax-Hailuo-02 $? $? ?
MiniMax-Hailuo-2.3-Fast $? $? ?
MiniMax-Hailuo-2.3 $? $? ?
image-01-live $? $? ?
image-01 $? $? ?
music-2.6 $? $? ?
music-cover $? $? ?
speech-02-hd $? $? ?
speech-02-turbo $? $? ?
speech-2.6-hd $? $? ?
speech-2.6-turbo $? $? ?
speech-2.8-hd $? $? ?
speech-2.8-turbo $? $? ?

🏒 Moonshot AI

Kimi β€” long-context Chinese models. 16 models available.

Model Input $/1M Output $/1M Context Tool Call Reasoning
moonshot-v1-8k-vision-preview $2 $10 8K
moonshot-v1-8k $2 $10 8K
kimi-k2-0711-preview $4 $16 131K
kimi-k2-0905-preview $4 $16 262K
kimi-k2-thinking $4 $16 262K βœ…
kimi-k2.5 $4 $21 262K βœ…
kimi-vl-a3b-thinking $4 $21 131K βœ…
kimi-vl-a3b $4 $21 131K
moonshot-v1-32k-vision-preview $5 $20 32K
moonshot-v1-32k $5 $20 32K
kimi-k2.6-long $6.5 $27 262K βœ…
kimi-k2.6 $6.5 $27 262K βœ…
kimi-k2-thinking-turbo $8 $58 262K βœ…
kimi-k2-turbo-preview $8 $58 262K
moonshot-v1-128k-vision-preview $10 $30 131K
moonshot-v1-128k $10 $30 131K

🏒 StepFun

Step β€” Chinese AI models with strong capabilities. 31 models available.

Model Input $/1M Output $/1M Context Tool Call Reasoning
step-3.5-flash-2603 $0.7 $2.1 256K
step-3.5-flash $0.7 $2.1 256K
step-2-mini $1 $2 32K
step-3 $1.5 $4 64K
step-1o-turbo-vision $2.5 $8 32K
step-r1-v-mini $2.5 $8 100K
step-1-8k $5 $20 8K
step-1v-8k $5 $20 8K
step-audio-2 $10 $70 ?
stepaudio-2.5-chat $10 $25 ?
stepaudio-2.5-realtime $10 $70 ?
step-1-32k $15 $70 32K
step-1o-vision-32k $15 $70 32K
step-1v-32k $15 $70 32K
step-1o-audio $25 $60 ?
step-2-16k-exp $38 $120 16K
step-2-16k $38 $120 16K
step-1x-edit Free ?
step-1x-medium $? $? ?
step-2x-large Free ?
step-asr-1.1-stream $? $? ?
step-asr-1.1 $? $? ?
step-asr $? $? ?
step-audio-r1.1 Free ?
step-gui Free ?
step-image-edit-2 $? $? ?
step-tts-2 $? $? ?
step-tts-mini $? $? ?
stepaudio-2-asr-pro $? $? ?
stepaudio-2.5-asr $? $? ?
stepaudio-2.5-tts $? $? ?

🏒 Baidu

ERNIE β€” models from China's search giant. 8 models available.

Model Input $/1M Output $/1M Context Tool Call Reasoning
deepseek-v4-flash $0.126 $0.252 1M βœ… βœ…
deepseek-v3.2 $0.252 $0.378 131K βœ… βœ…
minimax-m2.5 $0.27 $1.08 196K βœ… βœ…
qianfan-ocr-fast $0.6799999999999999 $2.81 65K
glm-5 $0.7 $2.24 202K βœ… βœ…
glm-5.1 $0.98 $3.08 202K βœ… βœ…
deepseek-v4-pro $1.521 $3.042 716K βœ… βœ…
cobuddy Free 131K βœ…

πŸ“Š Methodology

All data is sourced from first-party APIs β€” not third-party aggregators. Pricing, context windows, and capabilities are verified against official provider documentation. Aggregator providers (OpenRouter, Requesty, etc.) are labeled as such β€” they provide access to other providers' models.

πŸ”— More Resources

Small Language Models

🎯 AI Model Picker

⚑ GitHub Action