Browse 4,587 AI models across 95 providers. First-party data with real pricing, context windows, and capabilities.
All 95 providers sorted by number of models. Click a provider to see their models.
| Provider | Models | Cheapest Input $/1M | Max Context | Tool Call | Free |
|---|---|---|---|---|---|
| nanogpt (aggregator) | 547 | Aggregator | ? | 0 | |
| aihubmix (aggregator) | 476 | Aggregator | ? | 132 | |
| openrouter (aggregator) | 356 | Aggregator | 10M | 263 | β |
| martian (aggregator) | 304 | Aggregator | ? | 0 | |
| requesty (aggregator) | 277 | Aggregator | 1M | 251 | |
| 302ai (aggregator) | 268 | Aggregator | 2M | 190 | |
| auriko (aggregator) | 181 | Aggregator | 1M | 154 | β |
| llmgateway (aggregator) | 163 | Aggregator | ? | 158 | β |
| aimlapi | 147 | $0.007 | 2M | 21 | β |
| fastrouter (aggregator) | 120 | Aggregator | 2M | 94 | β |
| orcarouter (aggregator) | 120 | Aggregator | 1M | 102 | |
| cortecs (aggregator) | 105 | Aggregator | ? | 97 | |
| novitaai | 104 | $0.02 | 1M | 72 | β |
| vultr | 98 | $0.55 | 1M | 11 | |
| deepinfra | 88 | $0.01 | 1M | 0 | |
| venice (aggregator) | 75 | Aggregator | 2M | 64 | |
| jiekou (aggregator) | 73 | Aggregator | 2M | 73 | |
| meganova (aggregator) | 63 | Aggregator | 1M | 60 | β |
| alibaba | 62 | $0.15 | 1M | 62 | |
| ppio | 60 | $0.2145 | 1M | 46 | β |
| amazon-bedrock | 57 | $0.035 | 1M | 37 | |
| google-vertex | 38 | $0.07 | 1M | 32 | |
| siliconflow-cn | 37 | $0.5 | 262K | 2 | |
| stepfun | 31 | $0.7 | 256K | 0 | β |
| cloudflare | 30 | $0.017 | 327K | 15 | |
| databricks | 29 | $0.05 | 200K | 4 | |
| gmicloud | 29 | $0.07 | 1M | 11 | |
| openai | 28 | $0.02 | 1M | 18 | |
| siliconflow | 27 | $0.04 | 1M | 24 | |
| togetherai | 24 | $0.03 | 262K | 22 | |
| nebius | 23 | $0.02 | 1M | 21 | |
| 21 | $0.075 | 2M | 8 | β | |
| minimax | 21 | $2.1 | 204K | 0 | |
| voyage | 21 | $0.02 | ? | 0 | β |
| digitalocean | 20 | $0.05 | 1M | 14 | |
| inferencenet | 20 | $0.01 | 131K | 15 | |
| zhipuai | 20 | $0.1 | 1M | 20 | β |
| tencent-tokenhub | 19 | $1 | 1M | 16 | |
| mistral | 16 | $0.04 | 256K | 12 | β |
| moonshotai | 16 | $2 | 262K | 0 | |
| neuralwatt | 14 | $0.03 | ? | 14 | |
| tencent | 14 | $0.5 | 250K | 3 | β |
| scaleway | 13 | $0.15 | 131K | 6 | |
| chutes | 12 | $0.08 | 262K | 12 | |
| clarifai | 12 | $0.09 | 1M | 9 | |
| cloudferro-sherlock | 12 | $0.26 | 1M | 5 | |
| groq | 12 | $0.05 | 131K | 8 | |
| klusterai | 12 | $0.008 | 1M | 4 | |
| meta | 12 | $0.1 | 10M | 9 | |
| microsoft | 12 | $0.075 | 128K | 6 | |
| ovhcloud | 12 | $0.05 | 262K | 0 | |
| anthropic | 11 | $1 | 1M | 11 | |
| baichuan | 11 | $0.98 | 131K | 0 | β |
| cerebras | 11 | $0.1 | 131K | 9 | β |
| hpc-ai | 11 | $0.14 | 1M | 11 | |
| hyperbolic | 11 | $0.1 | 163K | 0 | |
| fireworks | 10 | $0.07 | 1M | 10 | |
| baseten | 9 | $0.1 | 1M | 9 | |
| baidu | 8 | $0.126 | 1M | 7 | β |
| evroc | 8 | $0.1 | 131K | 3 | |
| friendli | 8 | $0.1 | 262K | 8 | |
| upstage | 8 | $0.1 | 128K | 3 | |
| amazon | 7 | $0.035 | 1M | 7 | |
| arcee | 7 | $0.04 | 262K | 6 | β |
| berget | 7 | $0.2 | ? | 7 | |
| morph | 7 | $0.2 | 1M | 5 | |
| nousresearch | 7 | $0.06 | 131K | 7 | |
| sambanova | 7 | $0.22 | 196K | 0 | |
| dinference | 6 | $0.07 | 204K | 3 | |
| iflytek | 6 | $0.8 | 262K | 0 | β |
| submodel | 6 | $0.1 | 262K | 0 | |
| textsynth | 6 | $0.2 | 131K | 0 | |
| writer | 6 | $0.6 | 1M | 3 | |
| xai | 6 | $0.2 | 131K | 6 | |
| 01ai | 5 | $1 | 32K | 4 | |
| aion | 5 | $0.7 | 131K | 0 | |
| bytedance | 5 | $0.07 | 262K | 4 | |
| inception | 5 | $0.25 | 128K | 3 | |
| mixlayer | 5 | $0.1 | 131K | 5 | β |
| privatemode | 5 | $0.43 | 131K | 3 | |
| xiaomi | 5 | $0.1 | 1M | 5 | |
| deepseek | 4 | $0.14 | 1M | 4 | |
| perplexity | 4 | $1 | 200K | 4 | |
| inclusionai | 3 | $0.01 | 262K | 3 | |
| ai21 | 2 | $0.2 | 256K | 0 | |
| reka | 2 | $0.03 | 131K | 1 | |
| wafer | 2 | $0.6 | 262K | 2 |
GPT-4, GPT-4o, o1, o3 β the industry standard for LLMs. 28 models available.
| Model | Input $/1M | Output $/1M | Context | Tool Call | Reasoning |
|---|---|---|---|---|---|
| text-embedding-3-small | $0.02 | $0 | 8K | ||
| gpt-4.1-nano | $0.1 | $0.4 | 1M | β | |
| text-embedding-ada-002 | $0.1 | $0 | 8K | ||
| text-embedding-3-large | $0.13 | $0 | 8K | ||
| gpt-4o-mini | $0.15 | $0.6 | 128K | β | |
| gpt-4.1-mini | $0.4 | $1.6 | 1M | β | |
| gpt-3.5-turbo | $0.5 | $1.5 | 16K | β | |
| o3-mini | $1.1 | $4.4 | 200K | β | β |
| o4-mini | $1.1 | $4.4 | 200K | β | β |
| codex-mini | $1.5 | $6 | 192K | β | |
| o1-mini | $1.5 | $6 | 128K | β | β |
| gpt-4.1 | $2 | $8 | 1M | β | |
| gpt-4o-audio | $2.5 | $10 | 128K | β | |
| gpt-4o | $2.5 | $10 | 128K | β | |
| gpt-3.5-turbo-16k | $3 | $4 | 16K | β | |
| gpt-4o-realtime | $5 | $20 | 128K | β | |
| gpt-4-turbo | $10 | $30 | 128K | β | |
| o3 | $10 | $40 | 200K | β | β |
| o1-realtime | $15 | $60 | 200K | β | β |
| o1 | $15 | $60 | 200K | β | β |
| gpt-4 | $30 | $60 | 8K | β | |
| gpt-4-32k | $60 | $120 | 32K | ||
| o1-pro | $150 | $600 | 200K | β | β |
| dall-e-2 | $? | $? | ? | ||
| dall-e-3 | $? | $? | ? | ||
| tts-1-hd | $? | $? | ? | ||
| tts-1 | $? | $? | ? | ||
| whisper-1 | $? | $? | ? |
Claude β known for safety, reasoning, and long context. 11 models available.
| Model | Input $/1M | Output $/1M | Context | Tool Call | Reasoning |
|---|---|---|---|---|---|
| claude-haiku-4-5 | $1 | $5 | 200K | β | β |
| claude-sonnet-4-0 | $3 | $15 | 1M | β | β |
| claude-sonnet-4-5 | $3 | $15 | 1M | β | β |
| claude-sonnet-4-6 | $3 | $15 | 1M | β | β |
| claude-opus-4-5 | $5 | $25 | 200K | β | β |
| claude-opus-4-6 | $5 | $25 | 1M | β | β |
| claude-opus-4-7 | $5 | $25 | 1M | β | β |
| claude-opus-4-0 | $15 | $75 | 200K | β | β |
| claude-opus-4-1 | $15 | $75 | 200K | β | β |
| claude-opus-4-6-fast | $30 | $150 | 1M | β | β |
| claude-opus-4-7-fast | $30 | $150 | 1M | β | β |
Gemini β multimodal models with massive context windows. 21 models available.
| Model | Input $/1M | Output $/1M | Context | Tool Call | Reasoning |
|---|---|---|---|---|---|
| gemini-1.5-flash-8b | $0.075 | $0.3 | 1M | β | |
| gemini-1.5-flash | $0.075 | $0.3 | 1M | β | |
| gemini-2.0-flash-lite | $0.075 | $0.3 | 1M | β | |
| gemini-2.0-flash | $0.1 | $0.4 | 1M | β | |
| gemini-2.5-flash-lite | $0.1 | $0.4 | 1M | β | |
| gemini-2.5-flash | $0.15 | $3.5 | 1M | β | β |
| gemini-1.5-pro | $1.25 | $5 | 2M | β | |
| gemini-2.5-pro | $1.25 | $10 | 1M | β | β |
| chirp-3.0-HD | $? | $? | ? | ||
| gemma-3-12b-it | Free | 131K | |||
| gemma-3-1b-it | Free | 131K | |||
| gemma-3-27b-it | Free | 131K | |||
| gemma-3-4b-it | Free | 131K | |||
| gemma-3n-E2B-it | Free | 131K | |||
| gemma-3n-E4B-it | Free | 131K | |||
| imagen-3.0-fast-generate | $? | $? | ? | ||
| imagen-3.0-generate | $? | $? | ? | ||
| imagen-4.0-fast-generate | $? | $? | ? | ||
| imagen-4.0-generate | $? | $? | ? | ||
| lyria-2.0 | $? | $? | ? | ||
| veo-2.0-generate | $? | $? | ? |
Llama β open-weight models you can run anywhere. 12 models available.
| Model | Input $/1M | Output $/1M | Context | Tool Call | Reasoning |
|---|---|---|---|---|---|
| meta-llama-3.2-1b | $0.1 | $0.1 | 128K | ||
| meta-llama-3.2-3b | $0.15 | $0.15 | 128K | ||
| meta-llama-3.2-11b-vision | $0.16 | $0.16 | 128K | β | |
| meta-llama-4-scout | $0.17 | $0.66 | 10M | β | |
| meta-llama-3.1-8b | $0.22 | $0.22 | 128K | β | |
| meta-llama-4-maverick | $0.24 | $0.97 | 1M | β | |
| meta-llama-3-8b | $0.3 | $0.6 | 8K | ||
| meta-llama-3.1-70b | $0.72 | $0.72 | 128K | β | |
| meta-llama-3.2-90b-vision | $0.72 | $0.72 | 128K | β | |
| meta-llama-3.3-70b | $0.72 | $0.72 | 128K | β | |
| meta-llama-3.1-405b | $2.4 | $2.4 | 128K | β | |
| meta-llama-3-70b | $2.65 | $3.5 | 8K | β |
High-performance reasoning at competitive prices. 4 models available.
| Model | Input $/1M | Output $/1M | Context | Tool Call | Reasoning |
|---|---|---|---|---|---|
| deepseek-chat | $0.14 | $0.28 | 1M | β | |
| deepseek-reasoner | $0.14 | $0.28 | 1M | β | β |
| deepseek-v4-flash | $0.14 | $0.28 | 1M | β | β |
| deepseek-v4-pro | $0.435 | $0.87 | 1M | β | β |
European AI with open and commercial models. 16 models available.
| Model | Input $/1M | Output $/1M | Context | Tool Call | Reasoning |
|---|---|---|---|---|---|
| ministral-3b | $0.04 | $0.04 | 128K | β | |
| voxtral-mini | $0.04 | $0.04 | 128K | ||
| ministral-8b | $0.1 | $0.1 | 128K | β | |
| voxtral-small | $0.1 | $0.3 | 128K | ||
| mistral-7b | $0.15 | $0.2 | 32K | ||
| mistral-nemo | $0.15 | $0.15 | 128K | β | |
| mistral-small | $0.2 | $0.6 | 128K | β | |
| mistral-medium | $0.4 | $2 | 128K | β | |
| mixtral-8x7b | $0.45 | $0.7 | 32K | β | |
| magistral-small | $0.5 | $1.5 | 128K | β | β |
| mixtral-8x22b | $0.8 | $1.2 | 64K | β | |
| mistral-large | $2 | $6 | 128K | β | |
| pixtral-large | $2 | $6 | 128K | β | |
| mistral-large-2407 | $4 | $12 | 128K | β | |
| codestral | Free | 256K | |||
| devstral | Free | 128K | β |
Grok β models with real-time knowledge. 6 models available.
| Model | Input $/1M | Output $/1M | Context | Tool Call | Reasoning |
|---|---|---|---|---|---|
| xai-grok-4-fast | $0.2 | $0.5 | 131K | β | |
| xai-grok-4.1 | $0.2 | $0.5 | 131K | β | β |
| xai-grok-3-mini | $0.25 | $1.27 | 131K | β | β |
| xai-grok-4.2 | $2 | $6 | 131K | β | β |
| xai-grok-3 | $3 | $15 | 131K | β | β |
| xai-grok-4 | $3 | $15 | 131K | β | β |
Managed access to multiple foundation models. 57 models available.
| Model | Input $/1M | Output $/1M | Context | Tool Call | Reasoning |
|---|---|---|---|---|---|
| amazon-nova-micro | $0.035 | $0.14 | 128K | β | |
| google-gemma-3-4b | $0.04 | $0.08 | 131K | ||
| mistral-voxtral-mini | $0.04 | $0.04 | 128K | ||
| amazon-nova-lite | $0.06 | $0.24 | 300K | β | |
| nvidia-nemotron-nano-2 | $0.06 | $0.23 | 4K | ||
| nvidia-nemotron-nano-3-30b | $0.06 | $0.24 | 4K | ||
| openai-gpt-oss-20b | $0.07 | $0.3 | 131K | β | |
| openai-gpt-oss-safeguard-20b | $0.07 | $0.2 | 131K | β | |
| zai-glm-4-7-flash | $0.07 | $0.4 | 131K | β | |
| google-gemma-3-12b | $0.09 | $0.29 | 131K | ||
| meta-llama-3-2-1b | $0.1 | $0.1 | 128K | ||
| mistral-ministral-3b | $0.1 | $0.1 | 128K | ||
| mistral-voxtral-small | $0.1 | $0.3 | 128K | ||
| meta-llama-3-2-3b | $0.15 | $0.15 | 128K | ||
| mistral-ministral-8b | $0.15 | $0.15 | 128K | ||
| mistral-mistral-7b | $0.15 | $0.2 | 32K | ||
| nvidia-nemotron-3-super-120b | $0.15 | $0.65 | 4K | ||
| openai-gpt-oss-120b | $0.15 | $0.6 | 131K | β | |
| openai-gpt-oss-safeguard-120b | $0.15 | $0.6 | 131K | β | |
| qwen-qwen3-32b | $0.15 | $0.6 | 131K | β | |
| qwen-qwen3-coder-30b-a3b | $0.15 | $0.6 | 131K | β | |
| writer-palmyra-vision-7b | $0.15 | $0.6 | 8K | ||
| meta-llama-3-2-11b | $0.16 | $0.16 | 128K | β | |
| meta-llama-4-scout-17b | $0.17 | $0.66 | 1M | β | |
| mistral-ministral-14b | $0.2 | $0.2 | 128K | ||
| nvidia-nemotron-nano-2-vl | $0.2 | $0.6 | 4K | ||
| meta-llama-3-1-8b | $0.22 | $0.22 | 128K | β | |
| google-gemma-3-27b | $0.23 | $0.38 | 131K | ||
| meta-llama-4-maverick-17b | $0.24 | $0.97 | 1M | β | |
| meta-llama-3-8b | $0.3 | $0.6 | 8K | ||
| minimax-m2-1 | $0.3 | $1.2 | 1M | β | |
| minimax-m2-5 | $0.3 | $1.2 | 1M | β | |
| minimax-m2 | $0.3 | $1.2 | 1M | β | |
| amazon-nova-2-lite | $0.33 | $2.75 | 64K | β | |
| mistral-devstral | $0.4 | $2 | 128K | β | |
| mistral-mixtral-8x7b | $0.45 | $0.7 | 32K | ||
| mistral-magistral-small | $0.5 | $1.5 | 128K | β | |
| mistral-mistral-large-3 | $0.5 | $1.5 | 128K | β | |
| qwen-qwen3-coder-next | $0.5 | $1.2 | 131K | β | |
| qwen-qwen3-vl-235b-a22b | $0.53 | $2.66 | 131K | β | |
| kimi-k2-thinking | $0.6 | $2.5 | 131K | β | |
| moonshot-kimi-k2-5 | $0.6 | $3 | 131K | β | |
| zai-glm-4-7 | $0.6 | $2.2 | 131K | β | |
| deepseek-v3-2 | $0.62 | $1.85 | 65K | β | |
| meta-llama-3-1-70b | $0.72 | $0.72 | 128K | β | |
| meta-llama-3-2-90b | $0.72 | $0.72 | 128K | β | |
| meta-llama-3-3-70b | $0.72 | $0.72 | 128K | β | |
| amazon-nova-pro | $0.8 | $3.2 | 300K | β | |
| meta-llama-3-1-70b-latency-optimized | $0.9 | $0.9 | 128K | β | |
| amazon-nova-pro-latency-optimized | $1 | $4 | 300K | β | |
| mistral-mistral-small | $1 | $3 | 128K | β | |
| zai-glm-5 | $1 | $3.2 | 131K | β | |
| deepseek-r1 | $1.35 | $5.4 | 65K | ||
| mistral-pixtral-large | $2 | $6 | 128K | β | |
| amazon-nova-premier | $2.5 | $12.5 | 1M | β | |
| meta-llama-3-70b | $2.65 | $3.5 | 8K | ||
| mistral-mistral-large | $4 | $12 | 128K | β |
Ultra-fast inference with LPU hardware. 12 models available.
| Model | Input $/1M | Output $/1M | Context | Tool Call | Reasoning |
|---|---|---|---|---|---|
| llama-3.1-8b-instant | $0.05 | $0.08 | 131K | β | |
| gpt-oss-20b | $0.075 | $0.3 | 131K | β | |
| gpt-oss-safeguard-20b | $0.075 | $0.3 | 131K | β | |
| llama-4-scout-17b-16e-instruct | $0.11 | $0.34 | 131K | β | |
| gpt-oss-120b | $0.15 | $0.6 | 131K | β | |
| qwen3-32b | $0.29 | $0.59 | 131K | β | |
| llama-3.3-70b-versatile | $0.59 | $0.79 | 131K | β | |
| kimi-k2-instruct-0905 | $1 | $3 | 131K | β | |
| orpheus-ar-sa | $? | $? | ? | ||
| orpheus-en | $? | $? | ? | ||
| whisper-large-v3-turbo | $? | $? | ? | ||
| whisper-large-v3 | $? | $? | ? |
Open-weight model hosting platform. 24 models available.
| Model | Input $/1M | Output $/1M | Context | Tool Call | Reasoning |
|---|---|---|---|---|---|
| liquid-ai--LFM2-24B-A2B | $0.03 | $0.12 | 131K | β | |
| openai--gpt-oss-20b | $0.05 | $0.2 | 131K | β | |
| google--gemma-3n-E4B-it | $0.06 | $0.12 | 131K | ||
| Qwen--Qwen3.5-9B | $0.1 | $0.15 | 131K | β | |
| meta-llama--Meta-Llama-3.1-8B-Instruct-Lite | $0.1 | $0.1 | 131K | β | |
| essential-ai--Rnj-1-Instruct | $0.15 | $0.15 | 131K | ||
| openai--gpt-oss-120b | $0.15 | $0.6 | 131K | β | |
| Qwen--Qwen3-235B-A22B-FP8-Throughput | $0.2 | $0.6 | 131K | β | |
| MiniMaxAI--MiniMax-M2.5 | $0.3 | $1.2 | 131K | β | |
| MiniMaxAI--MiniMax-M2.7 | $0.3 | $1.2 | 131K | β | |
| Qwen--Qwen2.5-7B-Instruct-Turbo | $0.3 | $0.3 | 131K | β | |
| google--gemma-4-31B-it | $0.39 | $0.97 | 131K | β | |
| Qwen--Qwen3-Coder-Next | $0.5 | $1.2 | 131K | β | |
| Qwen--Qwen3.6-Plus | $0.5 | $3 | 131K | β | |
| moonshotai--Kimi-K2.5 | $0.5 | $2.8 | 131K | β | |
| Qwen--Qwen3.5-397B-A17B | $0.6 | $3.6 | 131K | β | |
| deepseek-ai--DeepSeek-V3.1 | $0.6 | $1.7 | 131K | β | |
| meta-llama--Llama-3.3-70B-Instruct-Turbo | $0.88 | $0.88 | 131K | β | |
| zai-org--GLM-5 | $1 | $3.2 | 131K | β | |
| moonshotai--Kimi-K2.6 | $1.2 | $4.5 | 262K | β | |
| cogito-ai--Cogito-v2.1-671B | $1.25 | $1.25 | 131K | β | β |
| zai-org--GLM-5.1 | $1.4 | $4.4 | 131K | β | |
| Qwen--Qwen3-Coder-480B-A35B-Instruct | $2 | $2 | 131K | β | |
| deepseek-ai--DeepSeek-V4-Pro | $2.1 | $4.4 | 131K | β | β |
Fast inference for open-source models. 10 models available.
| Model | Input $/1M | Output $/1M | Context | Tool Call | Reasoning |
|---|---|---|---|---|---|
| gpt-oss-20b | $0.07 | $0.3 | 131K | β | |
| gpt-oss-120b | $0.15 | $0.6 | 131K | β | |
| llama4-scout-17b-16e-instruct | $0.18 | $0.59 | 131K | β | |
| minimax-m2.5 | $0.3 | $1.2 | 196K | β | |
| minimax-m2.7 | $0.3 | $1.2 | 196K | β | |
| qwen3.6-plus | $0.5 | $3 | 131K | β | |
| kimi-k2.5 | $0.6 | $3 | 262K | β | |
| kimi-k2.6 | $0.95 | $4 | 262K | β | |
| glm-5.1 | $1.4 | $4.4 | 202K | β | |
| deepseek-v4-pro | $1.74 | $3.48 | 1M | β | β |
Wafer-scale inference at extreme speed. 11 models available.
| Model | Input $/1M | Output $/1M | Context | Tool Call | Reasoning |
|---|---|---|---|---|---|
| llama3.1-8b | $0.1 | $0.1 | 131K | β | |
| gpt-oss-120b | $0.35 | $0.75 | 131K | β | |
| qwen3-235b-instruct | $0.6 | $1.2 | 131K | β | |
| zai-glm-4.7 | $2.25 | $2.75 | 131K | β | |
| deepseek-r1-distill-llama-70b | Free | 131K | β | ||
| deepseek-r1-distill-llama-8b | Free | 131K | β | ||
| llama-3.3-70b | Free | 131K | β | ||
| llama-4-scout-17b-16e-instruct | Free | 131K | β | ||
| qwen-2.5-32b | Free | 131K | β | ||
| qwen-2.5-coder-32b | Free | 131K | β | ||
| qwen3-32b | Free | 131K | β |
DBRX and enterprise AI models. 29 models available.
| Model | Input $/1M | Output $/1M | Context | Tool Call | Reasoning |
|---|---|---|---|---|---|
| databricks-gpt-5-nano | $0.05 | $0.4 | 200K | ||
| databricks-gpt-oss-20b | $0.07 | $0.3 | 131K | ||
| databricks-gemma-3-12b | $0.15 | $0.5 | 131K | ||
| databricks-gpt-oss-120b | $0.15 | $0.6 | 131K | ||
| databricks-meta-llama-3-1-8b-instruct | $0.15 | $0.45 | 131K | β | |
| databricks-qwen3-next-80b-a3b-instruct | $0.15 | $1.2 | 131K | β | |
| databricks-gpt-5-4-nano | $0.2 | $1.25 | 128K | ||
| databricks-gemini-3-1-flash-lite | $0.25 | $1.5 | 128K | ||
| databricks-gpt-5-1-codex-mini | $0.25 | $2 | 200K | ||
| databricks-gpt-5-mini | $0.25 | $2 | 200K | ||
| databricks-gemini-2-5-flash | $0.3 | $2.5 | 128K | ||
| databricks-llama-4-maverick | $0.5 | $1.5 | 131K | β | |
| databricks-meta-llama-3-3-70b-instruct | $0.5 | $1.5 | 131K | β | |
| databricks-gemini-3-flash | $0.63 | $3.75 | 128K | ||
| databricks-gpt-5-4-mini | $0.75 | $4.5 | 128K | ||
| databricks-claude-haiku-4-5 | $1 | $5 | 200K | ||
| databricks-gemini-2-5-pro | $1.25 | $10 | 128K | ||
| databricks-gpt-5-1-codex-max | $1.25 | $10 | 200K | ||
| databricks-gpt-5-1 | $1.25 | $10 | 200K | ||
| databricks-gpt-5 | $1.25 | $10 | 200K | ||
| databricks-gpt-5-2-codex | $1.75 | $14 | 200K | ||
| databricks-gpt-5-2 | $1.75 | $14 | 200K | ||
| databricks-gemini-3-1-pro | $2.5 | $15 | 128K | ||
| databricks-gpt-5-4 | $2.5 | $15 | 128K | ||
| databricks-claude-sonnet-4-5 | $3 | $15 | 200K | ||
| databricks-claude-sonnet-4 | $3 | $15 | 200K | ||
| databricks-claude-opus-4-5 | $5 | $25 | 200K | ||
| databricks-gpt-5-5 | $5 | $30 | 128K | ||
| databricks-claude-opus-4-1 | $15 | $75 | 200K |
Qwen β multilingual models from Alibaba Cloud. 62 models available.
| Model | Input $/1M | Output $/1M | Context | Tool Call | Reasoning |
|---|---|---|---|---|---|
| qwen-flash | $0.15 | $1.5 | ? | β | β |
| qwen3.5-flash-2026-02-23 | $0.2 | $2 | 1M | β | |
| qwen3.5-flash | $0.2 | $2 | 1M | β | |
| qwen-flash-character | $0.25 | $1.5 | ? | β | β |
| qwen-turbo | $0.3 | $0.6 | ? | β | β |
| qwen3-0.6b | $0.3 | $1.2 | ? | β | β |
| qwen3-1.7b | $0.3 | $1.2 | ? | β | β |
| qwen3-4b | $0.3 | $1.2 | ? | β | β |
| qwen-omni-turbo | $0.4 | $25 | ? | β | β |
| qwen3.5-35b-a3b | $0.4 | $3.2 | 256K | β | |
| qwen-long-2025-01-25 | $0.5 | $2 | ? | β | β |
| qwen-long-latest | $0.5 | $2 | ? | β | β |
| qwen-long | $0.5 | $2 | ? | β | β |
| qwen2.5-7b-instruct-1m | $0.5 | $1 | ? | β | β |
| qwen2.5-7b-instruct | $0.5 | $1 | ? | β | β |
| qwen3-8b | $0.5 | $2 | ? | β | β |
| qwen-mt-lite | $0.6 | $1.6 | ? | β | β |
| qwen2.5-omni-7b | $0.6 | $38 | ? | β | β |
| qwen3.5-27b | $0.6 | $4.8 | 256K | β | |
| qwen-mt-flash | $0.7 | $1.95 | ? | β | β |
| qwen-mt-turbo | $0.7 | $1.95 | ? | β | β |
| qwen3-30b-a3b-instruct-2507 | $0.75 | $3 | ? | β | β |
| qwen3-30b-a3b | $0.75 | $3 | ? | β | β |
| qwen-plus-character | $0.8 | $2 | ? | β | β |
| qwen-plus | $0.8 | $2 | ? | β | β |
| qwen3.5-122b-a10b | $0.8 | $6.4 | 256K | β | |
| qwen3.5-plus-2026-02-15 | $0.8 | $4.8 | 1M | β | |
| qwen3.5-plus | $0.8 | $4.8 | 1M | β | |
| qwen2.5-14b-instruct-1m | $1 | $3 | ? | β | β |
| qwen2.5-14b-instruct | $1 | $3 | ? | β | β |
| qwen3-14b | $1 | $4 | ? | β | β |
| qwen3-coder-flash-2025-07-28 | $1 | $4 | ? | β | β |
| qwen3-coder-flash | $1 | $4 | ? | β | β |
| qwen3-coder-next | $1 | $4 | ? | β | β |
| qwen3-next-80b-a3b-instruct | $1 | $4 | ? | β | β |
| qwen2.5-vl-3b-instruct | $1.2 | $3.6 | ? | β | β |
| qwen3.5-397b-a17b | $1.2 | $7.2 | 256K | β | |
| qwen3.6-flash-2026-04-16 | $1.2 | $7.2 | 1M | β | |
| qwen3.6-flash | $1.2 | $7.2 | 1M | β | β |
| qwen3-coder-30b-a3b-instruct | $1.5 | $6 | ? | β | β |
| qwen-mt-plus | $1.8 | $5.4 | ? | β | β |
| qwen2.5-32b-instruct | $2 | $6 | ? | β | β |
| qwen2.5-vl-7b-instruct | $2 | $5 | ? | β | β |
| qwen3-235b-a22b-instruct-2507 | $2 | $8 | ? | β | β |
| qwen3-235b-a22b | $2 | $8 | ? | β | β |
| qwen3-32b | $2 | $8 | ? | β | β |
| qwen3.6-plus-2026-04-02 | $2 | $12 | 1M | β | |
| qwen3.6-plus | $2 | $12 | 1M | β | β |
| qwen-max | $2.4 | $9.6 | ? | β | β |
| qwen3-max-2026-01-23 | $2.5 | $10 | ? | β | β |
| qwen3-max | $2.5 | $10 | ? | β | β |
| qwen-plus-character-ja | $3.67 | $10.275 | ? | β | β |
| qwen2.5-72b-instruct | $4 | $12 | ? | β | β |
| qwen3-coder-plus-2025-07-22 | $4 | $16 | ? | β | β |
| qwen3-coder-plus-2025-09-23 | $4 | $16 | ? | β | β |
| qwen3-coder-plus | $4 | $16 | ? | β | β |
| qwen3-coder-480b-a35b-instruct | $6 | $24 | ? | β | β |
| qwen3-max-2025-09-23 | $6 | $24 | ? | β | β |
| qwen3-max-preview | $6 | $24 | ? | β | β |
| qwen2.5-vl-32b-instruct | $8 | $24 | ? | β | β |
| qwen3.6-max-preview | $9 | $54 | 256K | β | β |
| qwen2.5-vl-72b-instruct | $16 | $48 | ? | β | β |
Doubao β models from the TikTok parent company. 5 models available.
| Model | Input $/1M | Output $/1M | Context | Tool Call | Reasoning |
|---|---|---|---|---|---|
| seed-1.6-flash | $0.07 | $0.3 | 262K | β | β |
| seed-2.0-mini | $0.1 | $0.4 | 262K | β | β |
| ui-tars-1.5-7b | $0.1 | $0.2 | 128K | ||
| seed-1.6 | $0.25 | $2 | 262K | β | β |
| seed-2.0-lite | $0.25 | $2 | 262K | β | β |
Chinese AI startup with competitive models. 21 models available.
| Model | Input $/1M | Output $/1M | Context | Tool Call | Reasoning |
|---|---|---|---|---|---|
| M2-her | $2.1 | $8.4 | 64K | ||
| MiniMax-M2.1 | $2.1 | $8.4 | 204K | ||
| MiniMax-M2.5 | $2.1 | $8.4 | 204K | ||
| MiniMax-M2.7 | $2.1 | $8.4 | 204K | ||
| MiniMax-M2 | $2.1 | $8.4 | 204K | ||
| MiniMax-M2.1-highspeed | $4.2 | $16.8 | 204K | ||
| MiniMax-M2.5-highspeed | $4.2 | $16.8 | 204K | ||
| MiniMax-M2.7-highspeed | $4.2 | $16.8 | 204K | ||
| MiniMax-Hailuo-02 | $? | $? | ? | ||
| MiniMax-Hailuo-2.3-Fast | $? | $? | ? | ||
| MiniMax-Hailuo-2.3 | $? | $? | ? | ||
| image-01-live | $? | $? | ? | ||
| image-01 | $? | $? | ? | ||
| music-2.6 | $? | $? | ? | ||
| music-cover | $? | $? | ? | ||
| speech-02-hd | $? | $? | ? | ||
| speech-02-turbo | $? | $? | ? | ||
| speech-2.6-hd | $? | $? | ? | ||
| speech-2.6-turbo | $? | $? | ? | ||
| speech-2.8-hd | $? | $? | ? | ||
| speech-2.8-turbo | $? | $? | ? |
Kimi β long-context Chinese models. 16 models available.
| Model | Input $/1M | Output $/1M | Context | Tool Call | Reasoning |
|---|---|---|---|---|---|
| moonshot-v1-8k-vision-preview | $2 | $10 | 8K | ||
| moonshot-v1-8k | $2 | $10 | 8K | ||
| kimi-k2-0711-preview | $4 | $16 | 131K | ||
| kimi-k2-0905-preview | $4 | $16 | 262K | ||
| kimi-k2-thinking | $4 | $16 | 262K | β | |
| kimi-k2.5 | $4 | $21 | 262K | β | |
| kimi-vl-a3b-thinking | $4 | $21 | 131K | β | |
| kimi-vl-a3b | $4 | $21 | 131K | ||
| moonshot-v1-32k-vision-preview | $5 | $20 | 32K | ||
| moonshot-v1-32k | $5 | $20 | 32K | ||
| kimi-k2.6-long | $6.5 | $27 | 262K | β | |
| kimi-k2.6 | $6.5 | $27 | 262K | β | |
| kimi-k2-thinking-turbo | $8 | $58 | 262K | β | |
| kimi-k2-turbo-preview | $8 | $58 | 262K | ||
| moonshot-v1-128k-vision-preview | $10 | $30 | 131K | ||
| moonshot-v1-128k | $10 | $30 | 131K |
Step β Chinese AI models with strong capabilities. 31 models available.
| Model | Input $/1M | Output $/1M | Context | Tool Call | Reasoning |
|---|---|---|---|---|---|
| step-3.5-flash-2603 | $0.7 | $2.1 | 256K | ||
| step-3.5-flash | $0.7 | $2.1 | 256K | ||
| step-2-mini | $1 | $2 | 32K | ||
| step-3 | $1.5 | $4 | 64K | ||
| step-1o-turbo-vision | $2.5 | $8 | 32K | ||
| step-r1-v-mini | $2.5 | $8 | 100K | ||
| step-1-8k | $5 | $20 | 8K | ||
| step-1v-8k | $5 | $20 | 8K | ||
| step-audio-2 | $10 | $70 | ? | ||
| stepaudio-2.5-chat | $10 | $25 | ? | ||
| stepaudio-2.5-realtime | $10 | $70 | ? | ||
| step-1-32k | $15 | $70 | 32K | ||
| step-1o-vision-32k | $15 | $70 | 32K | ||
| step-1v-32k | $15 | $70 | 32K | ||
| step-1o-audio | $25 | $60 | ? | ||
| step-2-16k-exp | $38 | $120 | 16K | ||
| step-2-16k | $38 | $120 | 16K | ||
| step-1x-edit | Free | ? | |||
| step-1x-medium | $? | $? | ? | ||
| step-2x-large | Free | ? | |||
| step-asr-1.1-stream | $? | $? | ? | ||
| step-asr-1.1 | $? | $? | ? | ||
| step-asr | $? | $? | ? | ||
| step-audio-r1.1 | Free | ? | |||
| step-gui | Free | ? | |||
| step-image-edit-2 | $? | $? | ? | ||
| step-tts-2 | $? | $? | ? | ||
| step-tts-mini | $? | $? | ? | ||
| stepaudio-2-asr-pro | $? | $? | ? | ||
| stepaudio-2.5-asr | $? | $? | ? | ||
| stepaudio-2.5-tts | $? | $? | ? |
ERNIE β models from China's search giant. 8 models available.
| Model | Input $/1M | Output $/1M | Context | Tool Call | Reasoning |
|---|---|---|---|---|---|
| deepseek-v4-flash | $0.126 | $0.252 | 1M | β | β |
| deepseek-v3.2 | $0.252 | $0.378 | 131K | β | β |
| minimax-m2.5 | $0.27 | $1.08 | 196K | β | β |
| qianfan-ocr-fast | $0.6799999999999999 | $2.81 | 65K | ||
| glm-5 | $0.7 | $2.24 | 202K | β | β |
| glm-5.1 | $0.98 | $3.08 | 202K | β | β |
| deepseek-v4-pro | $1.521 | $3.042 | 716K | β | β |
| cobuddy | Free | 131K | β |
All data is sourced from first-party APIs β not third-party aggregators. Pricing, context windows, and capabilities are verified against official provider documentation. Aggregator providers (OpenRouter, Requesty, etc.) are labeled as such β they provide access to other providers' models.