Compare 2,350 AI models with tool/function calling across 95 providers. Find the best model for agents, automation, and API integration.
The top models with tool calling compared side by side.
| Model | Provider | Input $/1M | Output $/1M | Context | Reasoning |
|---|---|---|---|---|---|
| gpt-4o | openai | $2.5 | $10 | 128K | |
| gpt-4o-mini | openai | $0.15 | $0.6 | 128K | |
| gpt-4.1 | openai | $2 | $8 | 1M | |
| gpt-4.1-mini | openai | $0.4 | $1.6 | 1M | |
| gpt-4.1-nano | openai | $0.1 | $0.4 | 1M | |
| o3 | openai | $10 | $40 | 200K | β |
| o3-mini | openai | $1.1 | $4.4 | 200K | β |
| o4-mini | openai | $1.1 | $4.4 | 200K | β |
| gemini-2.0-flash | $0.1 | $0.4 | 1M | ||
| deepseek-chat | deepseek | $0.14 | $0.28 | 1M | |
| qwen3-235b-a22b | alibaba | $2 | $8 | ? | β |
| llama-4-maverick | digitalocean | $0.25 | $0.87 | 1M | |
| llama-4-scout | google-vertex | $0.25 | $0.7 | 1M |
Most affordable models with tool calling β for cost-sensitive agents and automation.
| Model | Provider | Input $/1M | Output $/1M | Context | Reasoning |
|---|---|---|---|---|---|
| ling-2.6-flash | inclusionai | $0.01 | $0.03 | 262K | |
| bdc-coder | inferencenet | $0.01 | $0.01 | 131K | |
| klusterai--Meta-Llama-3.1-8B-Instruct-Turbo | klusterai | $0.015 | $0.02 | 131K | |
| granite-4.0-h-micro | cloudflare | $0.017 | $0.112 | 131K | |
| llama-3.1-8b-instruct--fp-16 | inferencenet | $0.02 | $0.03 | 131K | |
| schematron-3b | inferencenet | $0.02 | $0.05 | 131K | |
| schematron-v3 | inferencenet | $0.02 | $0.05 | 131K | |
| gpt-oss-20b | inferencenet | $0.03 | $0.15 | 131K | |
| schematron-v2-turbo | inferencenet | $0.03 | $0.15 | 131K | |
| openai--gpt-oss-20b | neuralwatt | $0.03 | $0.16 | ? | β |
| qwen--qwen3-4b-fp8 | novitaai | $0.03 | $0.03 | 128K | β |
| liquid-ai--LFM2-24B-A2B | togetherai | $0.03 | $0.12 | 131K | |
| amazon-nova-micro | amazon | $0.035 | $0.14 | 128K | |
| amazon-nova-micro | amazon-bedrock | $0.035 | $0.14 | 128K | |
| mistral-nemo-12b-instruct--fp-8 | inferencenet | $0.0375 | $0.1 | 131K |
54 models with tool calling at zero cost β perfect for prototyping agents.
| Model | Provider | Context | Reasoning |
|---|---|---|---|
| openrouter--owl-alpha | openrouter | 1M | |
| deepseek--deepseek-v4-flash--free | openrouter | 1M | β |
| qwen--qwen3-coder--free | openrouter | 1M | |
| nvidia--nemotron-3-super-120b-a12b--free | openrouter | 1M | β |
| gemma-4-26b-a4b-it | auriko | 262K | β |
| gemma-4-31b-it | auriko | 262K | β |
| arcee-ai--trinity-large-thinking--free | openrouter | 262K | β |
| google--gemma-4-26b-a4b-it--free | openrouter | 262K | β |
| google--gemma-4-31b-it--free | openrouter | 262K | β |
| nvidia--nemotron-3-nano-omni-30b-a3b-reasoning--free | openrouter | 256K | β |
278 models with tool calling you can run locally β for privacy-first agents.
| Model | Provider | Context | Reasoning |
|---|---|---|---|
| google--gemma-4-31b-it | orcarouter | 1M | |
| qwen--qwen3.5-flash-2026-02-23 | orcarouter | 1M | |
| qwen--qwen3.5-flash | orcarouter | 1M | |
| qwen--qwen3.6-flash-2026-04-16 | orcarouter | 1M | |
| qwen--qwen3.6-flash | orcarouter | 1M | |
| meta-llama-4-maverick-17b | amazon-bedrock | 1M | |
| meta-llama-4-scout-17b | amazon-bedrock | 1M | |
| minimax-m2-1 | amazon-bedrock | 1M | |
| minimax-m2-5 | amazon-bedrock | 1M | |
| minimax-m2 | amazon-bedrock | 1M |
Models with both tool calling and reasoning β the most capable for complex agentic workflows that need planning and execution.
| Model | Provider | Input $/1M | Output $/1M | Context |
|---|---|---|---|---|
| openai--gpt-oss-20b | neuralwatt | $0.03 | $0.16 | ? |
| qwen--qwen3-4b-fp8 | novitaai | $0.03 | $0.03 | 128K |
| gpt-oss-120b | inferencenet | $0.05 | $0.45 | 131K |
| Qwen--Qwen3.6-35B-A3B | neuralwatt | $0.05 | $0.1 | ? |
| openai--gpt-oss-120b | novitaai | $0.05 | $0.25 | 131K |
| qwen3-30b-a3b-fp8 | cloudflare | $0.051 | $0.335 | 40K |
| glm-4.7-flash | cloudflare | $0.06 | $0.4 | 131K |
| Nemotron-3-Nano-Omni | nebius | $0.06 | $0.24 | 128K |
| hermes-4-llama-3.1-8b | nousresearch | $0.06 | $0.12 | 131K |
| seed-1.6-flash | bytedance | $0.07 | $0.3 | 262K |
| ring-2.6-1t | inclusionai | $0.07 | $0.62 | 262K |
| zai-org--glm-4.7-flash | novitaai | $0.07 | $0.4 | 200K |
| microsoft-phi-4-mini-reasoning | microsoft | $0.075 | $0.3 | 128K |
| Qwen--Qwen3-32B-TEE | chutes | $0.08 | $0.24 | 40K |
| gpt-oss-120b | clarifai | $0.09 | $0.36 | 131K |
Models with tool calling and image understanding β for agents that need to see and act.
| Model | Provider | Input $/1M | Output $/1M | Context |
|---|---|---|---|---|
| Qwen--Qwen3.6-35B-A3B | neuralwatt | $0.05 | $0.1 | ? |
| qwen3.6-35b-fast | neuralwatt | $0.05 | $0.1 | ? |
| openai--gpt-oss-120b | novitaai | $0.05 | $0.25 | 131K |
| amazon-nova-lite | amazon | $0.06 | $0.24 | 300K |
| amazon-nova-lite | amazon-bedrock | $0.06 | $0.24 | 300K |
| Nemotron-3-Nano-Omni | nebius | $0.06 | $0.24 | 128K |
| openai--gpt-5-nano | aimlapi | $0.065 | $0.52 | 400K |
| seed-1.6-flash | bytedance | $0.07 | $0.3 | 262K |
| gemini-1.5-flash-8b | $0.075 | $0.3 | 1M | |
| gemini-1.5-flash | $0.075 | $0.3 | 1M | |
| gemini-2.0-flash-lite | $0.075 | $0.3 | 1M | |
| gemini-2-0-flash-lite | google-vertex | $0.075 | $0.3 | 1M |
| microsoft-phi-4-mini-multimodal | microsoft | $0.08 | $0.32 | 128K |
| qwen--qwen3-vl-8b-instruct | novitaai | $0.08 | $0.5 | 131K |
| seed-2.0-mini | bytedance | $0.1 | $0.4 | 262K |
Models with tool calling and large context windows β for agents processing long documents or complex multi-step tasks.
| Model | Provider | Context | Input $/1M | Reasoning |
|---|---|---|---|---|
| ling-2.6-flash | inclusionai | 262K | $0.01 | |
| bdc-coder | inferencenet | 131K | $0.01 | |
| klusterai--Meta-Llama-3.1-8B-Instruct-Turbo | klusterai | 131K | $0.015 | |
| granite-4.0-h-micro | cloudflare | 131K | $0.017 | |
| llama-3.1-8b-instruct--fp-16 | inferencenet | 131K | $0.02 | |
| schematron-3b | inferencenet | 131K | $0.02 | |
| schematron-v3 | inferencenet | 131K | $0.02 | |
| gpt-oss-20b | inferencenet | 131K | $0.03 | |
| schematron-v2-turbo | inferencenet | 131K | $0.03 | |
| qwen--qwen3-4b-fp8 | novitaai | 128K | $0.03 | β |
| liquid-ai--LFM2-24B-A2B | togetherai | 131K | $0.03 | |
| amazon-nova-micro | amazon | 128K | $0.035 | |
| amazon-nova-micro | amazon-bedrock | 128K | $0.035 | |
| mistral-nemo-12b-instruct--fp-8 | inferencenet | 131K | $0.0375 | |
| klusterai--Meta-Llama-3.3-70B-Instruct-Turbo | klusterai | 131K | $0.038 |
All data is sourced from first-party APIs. Tool calling capability is defined by the provider's own classification β models that support function/tool calling via their API. Aggregator providers are excluded from ranking tables to avoid duplicate models.