🏢 AI Models by Provider — All 95 Providers Listed

Browse 4,587 AI models across 95 providers. First-party data with real pricing, context windows, and capabilities.

95Providers

4,587Models

81Free Models

527Open Weights

🔍 Interactive Catalog ⭐ Star on GitHub

OpenAI Anthropic Google Meta DeepSeek Mistral xAI AWS Bedrock Groq Together AI Fireworks Cerebras

📊 Provider Overview

All 95 providers sorted by number of models. Click a provider to see their models.

Provider	Models	Cheapest Input $/1M	Max Context	Tool Call	Free
nanogpt (aggregator)	547	Aggregator	?	0
aihubmix (aggregator)	476	Aggregator	?	132
openrouter (aggregator)	356	Aggregator	10M	263	✅
martian (aggregator)	304	Aggregator	?	0
requesty (aggregator)	277	Aggregator	1M	251
302ai (aggregator)	268	Aggregator	2M	190
auriko (aggregator)	181	Aggregator	1M	154	✅
llmgateway (aggregator)	163	Aggregator	?	158	✅
aimlapi	147	$0.007	2M	21	✅
fastrouter (aggregator)	120	Aggregator	2M	94	✅
orcarouter (aggregator)	120	Aggregator	1M	102
cortecs (aggregator)	105	Aggregator	?	97
novitaai	104	$0.02	1M	72	✅
vultr	98	$0.55	1M	11
deepinfra	88	$0.01	1M	0
venice (aggregator)	75	Aggregator	2M	64
jiekou (aggregator)	73	Aggregator	2M	73
meganova (aggregator)	63	Aggregator	1M	60	✅
alibaba	62	$0.15	1M	62
ppio	60	$0.2145	1M	46	✅
amazon-bedrock	57	$0.035	1M	37
google-vertex	38	$0.07	1M	32
siliconflow-cn	37	$0.5	262K	2
stepfun	31	$0.7	256K	0	✅
cloudflare	30	$0.017	327K	15
databricks	29	$0.05	200K	4
gmicloud	29	$0.07	1M	11
openai	28	$0.02	1M	18
siliconflow	27	$0.04	1M	24
togetherai	24	$0.03	262K	22
nebius	23	$0.02	1M	21
google	21	$0.075	2M	8	✅
minimax	21	$2.1	204K	0
voyage	21	$0.02	?	0	✅
digitalocean	20	$0.05	1M	14
inferencenet	20	$0.01	131K	15
zhipuai	20	$0.1	1M	20	✅
tencent-tokenhub	19	$1	1M	16
mistral	16	$0.04	256K	12	✅
moonshotai	16	$2	262K	0
neuralwatt	14	$0.03	?	14
tencent	14	$0.5	250K	3	✅
scaleway	13	$0.15	131K	6
chutes	12	$0.08	262K	12
clarifai	12	$0.09	1M	9
cloudferro-sherlock	12	$0.26	1M	5
groq	12	$0.05	131K	8
klusterai	12	$0.008	1M	4
meta	12	$0.1	10M	9
microsoft	12	$0.075	128K	6
ovhcloud	12	$0.05	262K	0
anthropic	11	$1	1M	11
baichuan	11	$0.98	131K	0	✅
cerebras	11	$0.1	131K	9	✅
hpc-ai	11	$0.14	1M	11
hyperbolic	11	$0.1	163K	0
fireworks	10	$0.07	1M	10
baseten	9	$0.1	1M	9
baidu	8	$0.126	1M	7	✅
evroc	8	$0.1	131K	3
friendli	8	$0.1	262K	8
upstage	8	$0.1	128K	3
amazon	7	$0.035	1M	7
arcee	7	$0.04	262K	6	✅
berget	7	$0.2	?	7
morph	7	$0.2	1M	5
nousresearch	7	$0.06	131K	7
sambanova	7	$0.22	196K	0
dinference	6	$0.07	204K	3
iflytek	6	$0.8	262K	0	✅
submodel	6	$0.1	262K	0
textsynth	6	$0.2	131K	0
writer	6	$0.6	1M	3
xai	6	$0.2	131K	6
01ai	5	$1	32K	4
aion	5	$0.7	131K	0
bytedance	5	$0.07	262K	4
inception	5	$0.25	128K	3
mixlayer	5	$0.1	131K	5	✅
privatemode	5	$0.43	131K	3
xiaomi	5	$0.1	1M	5
deepseek	4	$0.14	1M	4
perplexity	4	$1	200K	4
inclusionai	3	$0.01	262K	3
ai21	2	$0.2	256K	0
reka	2	$0.03	131K	1
wafer	2	$0.6	262K	2

🏢 OpenAI

GPT-4, GPT-4o, o1, o3 — the industry standard for LLMs. 28 models available.

Model	Input $/1M	Output $/1M	Context	Tool Call	Reasoning
text-embedding-3-small	$0.02	$0	8K
gpt-4.1-nano	$0.1	$0.4	1M	✅
text-embedding-ada-002	$0.1	$0	8K
text-embedding-3-large	$0.13	$0	8K
gpt-4o-mini	$0.15	$0.6	128K	✅
gpt-4.1-mini	$0.4	$1.6	1M	✅
gpt-3.5-turbo	$0.5	$1.5	16K	✅
o3-mini	$1.1	$4.4	200K	✅	✅
o4-mini	$1.1	$4.4	200K	✅	✅
codex-mini	$1.5	$6	192K		✅
o1-mini	$1.5	$6	128K	✅	✅
gpt-4.1	$2	$8	1M	✅
gpt-4o-audio	$2.5	$10	128K	✅
gpt-4o	$2.5	$10	128K	✅
gpt-3.5-turbo-16k	$3	$4	16K	✅
gpt-4o-realtime	$5	$20	128K	✅
gpt-4-turbo	$10	$30	128K	✅
o3	$10	$40	200K	✅	✅
o1-realtime	$15	$60	200K	✅	✅
o1	$15	$60	200K	✅	✅
gpt-4	$30	$60	8K	✅
gpt-4-32k	$60	$120	32K
o1-pro	$150	$600	200K	✅	✅
dall-e-2	$?	$?	?
dall-e-3	$?	$?	?
tts-1-hd	$?	$?	?
tts-1	$?	$?	?
whisper-1	$?	$?	?

🏢 Anthropic

Claude — known for safety, reasoning, and long context. 11 models available.

Model	Input $/1M	Output $/1M	Context	Tool Call	Reasoning
claude-haiku-4-5	$1	$5	200K	✅	✅
claude-sonnet-4-0	$3	$15	1M	✅	✅
claude-sonnet-4-5	$3	$15	1M	✅	✅
claude-sonnet-4-6	$3	$15	1M	✅	✅
claude-opus-4-5	$5	$25	200K	✅	✅
claude-opus-4-6	$5	$25	1M	✅	✅
claude-opus-4-7	$5	$25	1M	✅	✅
claude-opus-4-0	$15	$75	200K	✅	✅
claude-opus-4-1	$15	$75	200K	✅	✅
claude-opus-4-6-fast	$30	$150	1M	✅	✅
claude-opus-4-7-fast	$30	$150	1M	✅	✅

🏢 Google

Gemini — multimodal models with massive context windows. 21 models available.

Model	Input $/1M	Output $/1M	Context	Tool Call	Reasoning
gemini-1.5-flash-8b	$0.075	$0.3	1M	✅
gemini-1.5-flash	$0.075	$0.3	1M	✅
gemini-2.0-flash-lite	$0.075	$0.3	1M	✅
gemini-2.0-flash	$0.1	$0.4	1M	✅
gemini-2.5-flash-lite	$0.1	$0.4	1M	✅
gemini-2.5-flash	$0.15	$3.5	1M	✅	✅
gemini-1.5-pro	$1.25	$5	2M	✅
gemini-2.5-pro	$1.25	$10	1M	✅	✅
chirp-3.0-HD	$?	$?	?
gemma-3-12b-it	Free		131K
gemma-3-1b-it	Free		131K
gemma-3-27b-it	Free		131K
gemma-3-4b-it	Free		131K
gemma-3n-E2B-it	Free		131K
gemma-3n-E4B-it	Free		131K
imagen-3.0-fast-generate	$?	$?	?
imagen-3.0-generate	$?	$?	?
imagen-4.0-fast-generate	$?	$?	?
imagen-4.0-generate	$?	$?	?
lyria-2.0	$?	$?	?
veo-2.0-generate	$?	$?	?

🏢 Meta

Llama — open-weight models you can run anywhere. 12 models available.

Model	Input $/1M	Output $/1M	Context	Tool Call
meta-llama-3.2-1b	$0.1	$0.1	128K
meta-llama-3.2-3b	$0.15	$0.15	128K
meta-llama-3.2-11b-vision	$0.16	$0.16	128K	✅
meta-llama-4-scout	$0.17	$0.66	10M	✅
meta-llama-3.1-8b	$0.22	$0.22	128K	✅
meta-llama-4-maverick	$0.24	$0.97	1M	✅
meta-llama-3-8b	$0.3	$0.6	8K
meta-llama-3.1-70b	$0.72	$0.72	128K	✅
meta-llama-3.2-90b-vision	$0.72	$0.72	128K	✅
meta-llama-3.3-70b	$0.72	$0.72	128K	✅
meta-llama-3.1-405b	$2.4	$2.4	128K	✅
meta-llama-3-70b	$2.65	$3.5	8K	✅

🏢 DeepSeek

High-performance reasoning at competitive prices. 4 models available.

Model	Input $/1M	Output $/1M	Context	Tool Call	Reasoning
deepseek-chat	$0.14	$0.28	1M	✅
deepseek-reasoner	$0.14	$0.28	1M	✅	✅
deepseek-v4-flash	$0.14	$0.28	1M	✅	✅
deepseek-v4-pro	$0.435	$0.87	1M	✅	✅

🏢 Mistral

European AI with open and commercial models. 16 models available.

Model	Input $/1M	Output $/1M	Context	Tool Call	Reasoning
ministral-3b	$0.04	$0.04	128K	✅
voxtral-mini	$0.04	$0.04	128K
ministral-8b	$0.1	$0.1	128K	✅
voxtral-small	$0.1	$0.3	128K
mistral-7b	$0.15	$0.2	32K
mistral-nemo	$0.15	$0.15	128K	✅
mistral-small	$0.2	$0.6	128K	✅
mistral-medium	$0.4	$2	128K	✅
mixtral-8x7b	$0.45	$0.7	32K	✅
magistral-small	$0.5	$1.5	128K	✅	✅
mixtral-8x22b	$0.8	$1.2	64K	✅
mistral-large	$2	$6	128K	✅
pixtral-large	$2	$6	128K	✅
mistral-large-2407	$4	$12	128K	✅
codestral	Free		256K
devstral	Free		128K	✅

🏢 xAI

Grok — models with real-time knowledge. 6 models available.

Model	Input $/1M	Output $/1M	Context	Tool Call	Reasoning
xai-grok-4-fast	$0.2	$0.5	131K	✅
xai-grok-4.1	$0.2	$0.5	131K	✅	✅
xai-grok-3-mini	$0.25	$1.27	131K	✅	✅
xai-grok-4.2	$2	$6	131K	✅	✅
xai-grok-3	$3	$15	131K	✅	✅
xai-grok-4	$3	$15	131K	✅	✅

🏢 AWS Bedrock

Managed access to multiple foundation models. 57 models available.

Model	Input $/1M	Output $/1M	Context	Tool Call
amazon-nova-micro	$0.035	$0.14	128K	✅
google-gemma-3-4b	$0.04	$0.08	131K
mistral-voxtral-mini	$0.04	$0.04	128K
amazon-nova-lite	$0.06	$0.24	300K	✅
nvidia-nemotron-nano-2	$0.06	$0.23	4K
nvidia-nemotron-nano-3-30b	$0.06	$0.24	4K
openai-gpt-oss-20b	$0.07	$0.3	131K	✅
openai-gpt-oss-safeguard-20b	$0.07	$0.2	131K	✅
zai-glm-4-7-flash	$0.07	$0.4	131K	✅
google-gemma-3-12b	$0.09	$0.29	131K
meta-llama-3-2-1b	$0.1	$0.1	128K
mistral-ministral-3b	$0.1	$0.1	128K
mistral-voxtral-small	$0.1	$0.3	128K
meta-llama-3-2-3b	$0.15	$0.15	128K
mistral-ministral-8b	$0.15	$0.15	128K
mistral-mistral-7b	$0.15	$0.2	32K
nvidia-nemotron-3-super-120b	$0.15	$0.65	4K
openai-gpt-oss-120b	$0.15	$0.6	131K	✅
openai-gpt-oss-safeguard-120b	$0.15	$0.6	131K	✅
qwen-qwen3-32b	$0.15	$0.6	131K	✅
qwen-qwen3-coder-30b-a3b	$0.15	$0.6	131K	✅
writer-palmyra-vision-7b	$0.15	$0.6	8K
meta-llama-3-2-11b	$0.16	$0.16	128K	✅
meta-llama-4-scout-17b	$0.17	$0.66	1M	✅
mistral-ministral-14b	$0.2	$0.2	128K
nvidia-nemotron-nano-2-vl	$0.2	$0.6	4K
meta-llama-3-1-8b	$0.22	$0.22	128K	✅
google-gemma-3-27b	$0.23	$0.38	131K
meta-llama-4-maverick-17b	$0.24	$0.97	1M	✅
meta-llama-3-8b	$0.3	$0.6	8K
minimax-m2-1	$0.3	$1.2	1M	✅
minimax-m2-5	$0.3	$1.2	1M	✅
minimax-m2	$0.3	$1.2	1M	✅
amazon-nova-2-lite	$0.33	$2.75	64K	✅
mistral-devstral	$0.4	$2	128K	✅
mistral-mixtral-8x7b	$0.45	$0.7	32K
mistral-magistral-small	$0.5	$1.5	128K	✅
mistral-mistral-large-3	$0.5	$1.5	128K	✅
qwen-qwen3-coder-next	$0.5	$1.2	131K	✅
qwen-qwen3-vl-235b-a22b	$0.53	$2.66	131K	✅
kimi-k2-thinking	$0.6	$2.5	131K	✅
moonshot-kimi-k2-5	$0.6	$3	131K	✅
zai-glm-4-7	$0.6	$2.2	131K	✅
deepseek-v3-2	$0.62	$1.85	65K	✅
meta-llama-3-1-70b	$0.72	$0.72	128K	✅
meta-llama-3-2-90b	$0.72	$0.72	128K	✅
meta-llama-3-3-70b	$0.72	$0.72	128K	✅
amazon-nova-pro	$0.8	$3.2	300K	✅
meta-llama-3-1-70b-latency-optimized	$0.9	$0.9	128K	✅
amazon-nova-pro-latency-optimized	$1	$4	300K	✅
mistral-mistral-small	$1	$3	128K	✅
zai-glm-5	$1	$3.2	131K	✅
deepseek-r1	$1.35	$5.4	65K
mistral-pixtral-large	$2	$6	128K	✅
amazon-nova-premier	$2.5	$12.5	1M	✅
meta-llama-3-70b	$2.65	$3.5	8K
mistral-mistral-large	$4	$12	128K	✅

🏢 Groq

Ultra-fast inference with LPU hardware. 12 models available.

Model	Input $/1M	Output $/1M	Context	Tool Call
llama-3.1-8b-instant	$0.05	$0.08	131K	✅
gpt-oss-20b	$0.075	$0.3	131K	✅
gpt-oss-safeguard-20b	$0.075	$0.3	131K	✅
llama-4-scout-17b-16e-instruct	$0.11	$0.34	131K	✅
gpt-oss-120b	$0.15	$0.6	131K	✅
qwen3-32b	$0.29	$0.59	131K	✅
llama-3.3-70b-versatile	$0.59	$0.79	131K	✅
kimi-k2-instruct-0905	$1	$3	131K	✅
orpheus-ar-sa	$?	$?	?
orpheus-en	$?	$?	?
whisper-large-v3-turbo	$?	$?	?
whisper-large-v3	$?	$?	?

🏢 Together AI

Open-weight model hosting platform. 24 models available.

Model	Input $/1M	Output $/1M	Context	Tool Call	Reasoning
liquid-ai--LFM2-24B-A2B	$0.03	$0.12	131K	✅
openai--gpt-oss-20b	$0.05	$0.2	131K	✅
google--gemma-3n-E4B-it	$0.06	$0.12	131K
Qwen--Qwen3.5-9B	$0.1	$0.15	131K	✅
meta-llama--Meta-Llama-3.1-8B-Instruct-Lite	$0.1	$0.1	131K	✅
essential-ai--Rnj-1-Instruct	$0.15	$0.15	131K
openai--gpt-oss-120b	$0.15	$0.6	131K	✅
Qwen--Qwen3-235B-A22B-FP8-Throughput	$0.2	$0.6	131K	✅
MiniMaxAI--MiniMax-M2.5	$0.3	$1.2	131K	✅
MiniMaxAI--MiniMax-M2.7	$0.3	$1.2	131K	✅
Qwen--Qwen2.5-7B-Instruct-Turbo	$0.3	$0.3	131K	✅
google--gemma-4-31B-it	$0.39	$0.97	131K	✅
Qwen--Qwen3-Coder-Next	$0.5	$1.2	131K	✅
Qwen--Qwen3.6-Plus	$0.5	$3	131K	✅
moonshotai--Kimi-K2.5	$0.5	$2.8	131K	✅
Qwen--Qwen3.5-397B-A17B	$0.6	$3.6	131K	✅
deepseek-ai--DeepSeek-V3.1	$0.6	$1.7	131K	✅
meta-llama--Llama-3.3-70B-Instruct-Turbo	$0.88	$0.88	131K	✅
zai-org--GLM-5	$1	$3.2	131K	✅
moonshotai--Kimi-K2.6	$1.2	$4.5	262K	✅
cogito-ai--Cogito-v2.1-671B	$1.25	$1.25	131K	✅	✅
zai-org--GLM-5.1	$1.4	$4.4	131K	✅
Qwen--Qwen3-Coder-480B-A35B-Instruct	$2	$2	131K	✅
deepseek-ai--DeepSeek-V4-Pro	$2.1	$4.4	131K	✅	✅

🏢 Fireworks

Fast inference for open-source models. 10 models available.

Model	Input $/1M	Output $/1M	Context	Tool Call	Reasoning
gpt-oss-20b	$0.07	$0.3	131K	✅
gpt-oss-120b	$0.15	$0.6	131K	✅
llama4-scout-17b-16e-instruct	$0.18	$0.59	131K	✅
minimax-m2.5	$0.3	$1.2	196K	✅
minimax-m2.7	$0.3	$1.2	196K	✅
qwen3.6-plus	$0.5	$3	131K	✅
kimi-k2.5	$0.6	$3	262K	✅
kimi-k2.6	$0.95	$4	262K	✅
glm-5.1	$1.4	$4.4	202K	✅
deepseek-v4-pro	$1.74	$3.48	1M	✅	✅

🏢 Cerebras

Wafer-scale inference at extreme speed. 11 models available.

Model	Input $/1M	Output $/1M	Context	Tool Call	Reasoning
llama3.1-8b	$0.1	$0.1	131K	✅
gpt-oss-120b	$0.35	$0.75	131K	✅
qwen3-235b-instruct	$0.6	$1.2	131K	✅
zai-glm-4.7	$2.25	$2.75	131K	✅
deepseek-r1-distill-llama-70b	Free		131K		✅
deepseek-r1-distill-llama-8b	Free		131K		✅
llama-3.3-70b	Free		131K	✅
llama-4-scout-17b-16e-instruct	Free		131K	✅
qwen-2.5-32b	Free		131K	✅
qwen-2.5-coder-32b	Free		131K	✅
qwen3-32b	Free		131K	✅

🏢 Databricks

DBRX and enterprise AI models. 29 models available.

Model	Input $/1M	Output $/1M	Context	Tool Call
databricks-gpt-5-nano	$0.05	$0.4	200K
databricks-gpt-oss-20b	$0.07	$0.3	131K
databricks-gemma-3-12b	$0.15	$0.5	131K
databricks-gpt-oss-120b	$0.15	$0.6	131K
databricks-meta-llama-3-1-8b-instruct	$0.15	$0.45	131K	✅
databricks-qwen3-next-80b-a3b-instruct	$0.15	$1.2	131K	✅
databricks-gpt-5-4-nano	$0.2	$1.25	128K
databricks-gemini-3-1-flash-lite	$0.25	$1.5	128K
databricks-gpt-5-1-codex-mini	$0.25	$2	200K
databricks-gpt-5-mini	$0.25	$2	200K
databricks-gemini-2-5-flash	$0.3	$2.5	128K
databricks-llama-4-maverick	$0.5	$1.5	131K	✅
databricks-meta-llama-3-3-70b-instruct	$0.5	$1.5	131K	✅
databricks-gemini-3-flash	$0.63	$3.75	128K
databricks-gpt-5-4-mini	$0.75	$4.5	128K
databricks-claude-haiku-4-5	$1	$5	200K
databricks-gemini-2-5-pro	$1.25	$10	128K
databricks-gpt-5-1-codex-max	$1.25	$10	200K
databricks-gpt-5-1	$1.25	$10	200K
databricks-gpt-5	$1.25	$10	200K
databricks-gpt-5-2-codex	$1.75	$14	200K
databricks-gpt-5-2	$1.75	$14	200K
databricks-gemini-3-1-pro	$2.5	$15	128K
databricks-gpt-5-4	$2.5	$15	128K
databricks-claude-sonnet-4-5	$3	$15	200K
databricks-claude-sonnet-4	$3	$15	200K
databricks-claude-opus-4-5	$5	$25	200K
databricks-gpt-5-5	$5	$30	128K
databricks-claude-opus-4-1	$15	$75	200K

🏢 Alibaba (Qwen)

Qwen — multilingual models from Alibaba Cloud. 62 models available.

Model	Input $/1M	Output $/1M	Context	Tool Call	Reasoning
qwen-flash	$0.15	$1.5	?	✅	✅
qwen3.5-flash-2026-02-23	$0.2	$2	1M	✅
qwen3.5-flash	$0.2	$2	1M	✅
qwen-flash-character	$0.25	$1.5	?	✅	✅
qwen-turbo	$0.3	$0.6	?	✅	✅
qwen3-0.6b	$0.3	$1.2	?	✅	✅
qwen3-1.7b	$0.3	$1.2	?	✅	✅
qwen3-4b	$0.3	$1.2	?	✅	✅
qwen-omni-turbo	$0.4	$25	?	✅	✅
qwen3.5-35b-a3b	$0.4	$3.2	256K	✅
qwen-long-2025-01-25	$0.5	$2	?	✅	✅
qwen-long-latest	$0.5	$2	?	✅	✅
qwen-long	$0.5	$2	?	✅	✅
qwen2.5-7b-instruct-1m	$0.5	$1	?	✅	✅
qwen2.5-7b-instruct	$0.5	$1	?	✅	✅
qwen3-8b	$0.5	$2	?	✅	✅
qwen-mt-lite	$0.6	$1.6	?	✅	✅
qwen2.5-omni-7b	$0.6	$38	?	✅	✅
qwen3.5-27b	$0.6	$4.8	256K	✅
qwen-mt-flash	$0.7	$1.95	?	✅	✅
qwen-mt-turbo	$0.7	$1.95	?	✅	✅
qwen3-30b-a3b-instruct-2507	$0.75	$3	?	✅	✅
qwen3-30b-a3b	$0.75	$3	?	✅	✅
qwen-plus-character	$0.8	$2	?	✅	✅
qwen-plus	$0.8	$2	?	✅	✅
qwen3.5-122b-a10b	$0.8	$6.4	256K	✅
qwen3.5-plus-2026-02-15	$0.8	$4.8	1M	✅
qwen3.5-plus	$0.8	$4.8	1M	✅
qwen2.5-14b-instruct-1m	$1	$3	?	✅	✅
qwen2.5-14b-instruct	$1	$3	?	✅	✅
qwen3-14b	$1	$4	?	✅	✅
qwen3-coder-flash-2025-07-28	$1	$4	?	✅	✅
qwen3-coder-flash	$1	$4	?	✅	✅
qwen3-coder-next	$1	$4	?	✅	✅
qwen3-next-80b-a3b-instruct	$1	$4	?	✅	✅
qwen2.5-vl-3b-instruct	$1.2	$3.6	?	✅	✅
qwen3.5-397b-a17b	$1.2	$7.2	256K	✅
qwen3.6-flash-2026-04-16	$1.2	$7.2	1M	✅
qwen3.6-flash	$1.2	$7.2	1M	✅	✅
qwen3-coder-30b-a3b-instruct	$1.5	$6	?	✅	✅
qwen-mt-plus	$1.8	$5.4	?	✅	✅
qwen2.5-32b-instruct	$2	$6	?	✅	✅
qwen2.5-vl-7b-instruct	$2	$5	?	✅	✅
qwen3-235b-a22b-instruct-2507	$2	$8	?	✅	✅
qwen3-235b-a22b	$2	$8	?	✅	✅
qwen3-32b	$2	$8	?	✅	✅
qwen3.6-plus-2026-04-02	$2	$12	1M	✅
qwen3.6-plus	$2	$12	1M	✅	✅
qwen-max	$2.4	$9.6	?	✅	✅
qwen3-max-2026-01-23	$2.5	$10	?	✅	✅
qwen3-max	$2.5	$10	?	✅	✅
qwen-plus-character-ja	$3.67	$10.275	?	✅	✅
qwen2.5-72b-instruct	$4	$12	?	✅	✅
qwen3-coder-plus-2025-07-22	$4	$16	?	✅	✅
qwen3-coder-plus-2025-09-23	$4	$16	?	✅	✅
qwen3-coder-plus	$4	$16	?	✅	✅
qwen3-coder-480b-a35b-instruct	$6	$24	?	✅	✅
qwen3-max-2025-09-23	$6	$24	?	✅	✅
qwen3-max-preview	$6	$24	?	✅	✅
qwen2.5-vl-32b-instruct	$8	$24	?	✅	✅
qwen3.6-max-preview	$9	$54	256K	✅	✅
qwen2.5-vl-72b-instruct	$16	$48	?	✅	✅

🏢 ByteDance

Doubao — models from the TikTok parent company. 5 models available.

Model	Input $/1M	Output $/1M	Context	Tool Call	Reasoning
seed-1.6-flash	$0.07	$0.3	262K	✅	✅
seed-2.0-mini	$0.1	$0.4	262K	✅	✅
ui-tars-1.5-7b	$0.1	$0.2	128K
seed-1.6	$0.25	$2	262K	✅	✅
seed-2.0-lite	$0.25	$2	262K	✅	✅

🏢 MiniMax

Chinese AI startup with competitive models. 21 models available.

Model	Input $/1M	Output $/1M	Context
M2-her	$2.1	$8.4	64K
MiniMax-M2.1	$2.1	$8.4	204K
MiniMax-M2.5	$2.1	$8.4	204K
MiniMax-M2.7	$2.1	$8.4	204K
MiniMax-M2	$2.1	$8.4	204K
MiniMax-M2.1-highspeed	$4.2	$16.8	204K
MiniMax-M2.5-highspeed	$4.2	$16.8	204K
MiniMax-M2.7-highspeed	$4.2	$16.8	204K
MiniMax-Hailuo-02	$?	$?	?
MiniMax-Hailuo-2.3-Fast	$?	$?	?
MiniMax-Hailuo-2.3	$?	$?	?
image-01-live	$?	$?	?
image-01	$?	$?	?
music-2.6	$?	$?	?
music-cover	$?	$?	?
speech-02-hd	$?	$?	?
speech-02-turbo	$?	$?	?
speech-2.6-hd	$?	$?	?
speech-2.6-turbo	$?	$?	?
speech-2.8-hd	$?	$?	?
speech-2.8-turbo	$?	$?	?

🏢 Moonshot AI

Kimi — long-context Chinese models. 16 models available.

Model	Input $/1M	Output $/1M	Context	Reasoning
moonshot-v1-8k-vision-preview	$2	$10	8K
moonshot-v1-8k	$2	$10	8K
kimi-k2-0711-preview	$4	$16	131K
kimi-k2-0905-preview	$4	$16	262K
kimi-k2-thinking	$4	$16	262K	✅
kimi-k2.5	$4	$21	262K	✅
kimi-vl-a3b-thinking	$4	$21	131K	✅
kimi-vl-a3b	$4	$21	131K
moonshot-v1-32k-vision-preview	$5	$20	32K
moonshot-v1-32k	$5	$20	32K
kimi-k2.6-long	$6.5	$27	262K	✅
kimi-k2.6	$6.5	$27	262K	✅
kimi-k2-thinking-turbo	$8	$58	262K	✅
kimi-k2-turbo-preview	$8	$58	262K
moonshot-v1-128k-vision-preview	$10	$30	131K
moonshot-v1-128k	$10	$30	131K

🏢 StepFun

Step — Chinese AI models with strong capabilities. 31 models available.

Model	Input $/1M	Output $/1M	Context
step-3.5-flash-2603	$0.7	$2.1	256K
step-3.5-flash	$0.7	$2.1	256K
step-2-mini	$1	$2	32K
step-3	$1.5	$4	64K
step-1o-turbo-vision	$2.5	$8	32K
step-r1-v-mini	$2.5	$8	100K
step-1-8k	$5	$20	8K
step-1v-8k	$5	$20	8K
step-audio-2	$10	$70	?
stepaudio-2.5-chat	$10	$25	?
stepaudio-2.5-realtime	$10	$70	?
step-1-32k	$15	$70	32K
step-1o-vision-32k	$15	$70	32K
step-1v-32k	$15	$70	32K
step-1o-audio	$25	$60	?
step-2-16k-exp	$38	$120	16K
step-2-16k	$38	$120	16K
step-1x-edit	Free		?
step-1x-medium	$?	$?	?
step-2x-large	Free		?
step-asr-1.1-stream	$?	$?	?
step-asr-1.1	$?	$?	?
step-asr	$?	$?	?
step-audio-r1.1	Free		?
step-gui	Free		?
step-image-edit-2	$?	$?	?
step-tts-2	$?	$?	?
step-tts-mini	$?	$?	?
stepaudio-2-asr-pro	$?	$?	?
stepaudio-2.5-asr	$?	$?	?
stepaudio-2.5-tts	$?	$?	?

🏢 Baidu

ERNIE — models from China's search giant. 8 models available.

Model	Input $/1M	Output $/1M	Context	Tool Call	Reasoning
deepseek-v4-flash	$0.126	$0.252	1M	✅	✅
deepseek-v3.2	$0.252	$0.378	131K	✅	✅
minimax-m2.5	$0.27	$1.08	196K	✅	✅
qianfan-ocr-fast	$0.6799999999999999	$2.81	65K
glm-5	$0.7	$2.24	202K	✅	✅
glm-5.1	$0.98	$3.08	202K	✅	✅
deepseek-v4-pro	$1.521	$3.042	716K	✅	✅
cobuddy	Free		131K	✅

📊 Methodology

All data is sourced from first-party APIs — not third-party aggregators. Pricing, context windows, and capabilities are verified against official provider documentation. Aggregator providers (OpenRouter, Requesty, etc.) are labeled as such — they provide access to other providers' models.

🔗 More Resources

Interactive Catalog — search, filter, compare all models
Best AI Models — curated by use case
Free AI Models — 81 models at zero cost
LLM Pricing Comparison — detailed pricing tables
OpenAI Alternatives — 95 providers compared
GitHub Repository 🔓 Open Source AI Models (527 models) 🎨 Multimodal AI Models (1,548 models) State of AI Models 2025 — star, fork, contribute
Cheapest AI Models — lowest price LLMs
Reasoning Models Comparison — o1, R1, Claude, Gemini compared
Tool Calling Models Comparison — function calling LLMs
AI Model Pricing Calculator — LLM cost calculator
Best AI Models for Image Generation — DALL·E, Imagen, GPT-5 Image compared
Best AI Models for Vision — GPT-4o, Claude, Gemini vision compared
Structured Output Models Comparison — JSON mode, function calling compared

Small Language Models

🎯 AI Model Picker

⚡ GitHub Action