πŸ”§ Tool Calling AI Models Compared (2025)

Compare 2,350 AI models with tool/function calling across 95 providers. Find the best model for agents, automation, and API integration.

2,350Tool Calling Models
95Providers
81Free
527Open Weights
πŸ” Interactive Catalog ⭐ Star on GitHub
πŸ’‘ What is tool calling? Tool calling (also called function calling) lets LLMs invoke external APIs, databases, and services. This is the foundation of AI agents β€” without tool calling, a model can only generate text. With it, models can search the web, run code, query databases, and take real-world actions.

πŸ† Flagship Tool Calling Models β€” Head to Head

The top models with tool calling compared side by side.

Model Provider Input $/1M Output $/1M Context Reasoning
gpt-4o openai $2.5 $10 128K
gpt-4o-mini openai $0.15 $0.6 128K
gpt-4.1 openai $2 $8 1M
gpt-4.1-mini openai $0.4 $1.6 1M
gpt-4.1-nano openai $0.1 $0.4 1M
o3 openai $10 $40 200K βœ…
o3-mini openai $1.1 $4.4 200K βœ…
o4-mini openai $1.1 $4.4 200K βœ…
gemini-2.0-flash google $0.1 $0.4 1M
deepseek-chat deepseek $0.14 $0.28 1M
qwen3-235b-a22b alibaba $2 $8 ? βœ…
llama-4-maverick digitalocean $0.25 $0.87 1M
llama-4-scout google-vertex $0.25 $0.7 1M

πŸ’° Cheapest Tool Calling Models

Most affordable models with tool calling β€” for cost-sensitive agents and automation.

Model Provider Input $/1M Output $/1M Context Reasoning
ling-2.6-flash inclusionai $0.01 $0.03 262K
bdc-coder inferencenet $0.01 $0.01 131K
klusterai--Meta-Llama-3.1-8B-Instruct-Turbo klusterai $0.015 $0.02 131K
granite-4.0-h-micro cloudflare $0.017 $0.112 131K
llama-3.1-8b-instruct--fp-16 inferencenet $0.02 $0.03 131K
schematron-3b inferencenet $0.02 $0.05 131K
schematron-v3 inferencenet $0.02 $0.05 131K
gpt-oss-20b inferencenet $0.03 $0.15 131K
schematron-v2-turbo inferencenet $0.03 $0.15 131K
openai--gpt-oss-20b neuralwatt $0.03 $0.16 ? βœ…
qwen--qwen3-4b-fp8 novitaai $0.03 $0.03 128K βœ…
liquid-ai--LFM2-24B-A2B togetherai $0.03 $0.12 131K
amazon-nova-micro amazon $0.035 $0.14 128K
amazon-nova-micro amazon-bedrock $0.035 $0.14 128K
mistral-nemo-12b-instruct--fp-8 inferencenet $0.0375 $0.1 131K

πŸ†“ Free Tool Calling Models

54 models with tool calling at zero cost β€” perfect for prototyping agents.

Model Provider Context Reasoning
openrouter--owl-alpha openrouter 1M
deepseek--deepseek-v4-flash--free openrouter 1M βœ…
qwen--qwen3-coder--free openrouter 1M
nvidia--nemotron-3-super-120b-a12b--free openrouter 1M βœ…
gemma-4-26b-a4b-it auriko 262K βœ…
gemma-4-31b-it auriko 262K βœ…
arcee-ai--trinity-large-thinking--free openrouter 262K βœ…
google--gemma-4-26b-a4b-it--free openrouter 262K βœ…
google--gemma-4-31b-it--free openrouter 262K βœ…
nvidia--nemotron-3-nano-omni-30b-a3b-reasoning--free openrouter 256K βœ…

πŸ”“ Open-Weight Tool Calling Models

278 models with tool calling you can run locally β€” for privacy-first agents.

Model Provider Context Reasoning
google--gemma-4-31b-it orcarouter 1M
qwen--qwen3.5-flash-2026-02-23 orcarouter 1M
qwen--qwen3.5-flash orcarouter 1M
qwen--qwen3.6-flash-2026-04-16 orcarouter 1M
qwen--qwen3.6-flash orcarouter 1M
meta-llama-4-maverick-17b amazon-bedrock 1M
meta-llama-4-scout-17b amazon-bedrock 1M
minimax-m2-1 amazon-bedrock 1M
minimax-m2-5 amazon-bedrock 1M
minimax-m2 amazon-bedrock 1M

🧠 Tool Calling + Reasoning

Models with both tool calling and reasoning β€” the most capable for complex agentic workflows that need planning and execution.

Model Provider Input $/1M Output $/1M Context
openai--gpt-oss-20b neuralwatt $0.03 $0.16 ?
qwen--qwen3-4b-fp8 novitaai $0.03 $0.03 128K
gpt-oss-120b inferencenet $0.05 $0.45 131K
Qwen--Qwen3.6-35B-A3B neuralwatt $0.05 $0.1 ?
openai--gpt-oss-120b novitaai $0.05 $0.25 131K
qwen3-30b-a3b-fp8 cloudflare $0.051 $0.335 40K
glm-4.7-flash cloudflare $0.06 $0.4 131K
Nemotron-3-Nano-Omni nebius $0.06 $0.24 128K
hermes-4-llama-3.1-8b nousresearch $0.06 $0.12 131K
seed-1.6-flash bytedance $0.07 $0.3 262K
ring-2.6-1t inclusionai $0.07 $0.62 262K
zai-org--glm-4.7-flash novitaai $0.07 $0.4 200K
microsoft-phi-4-mini-reasoning microsoft $0.075 $0.3 128K
Qwen--Qwen3-32B-TEE chutes $0.08 $0.24 40K
gpt-oss-120b clarifai $0.09 $0.36 131K

πŸ‘οΈ Tool Calling + Vision

Models with tool calling and image understanding β€” for agents that need to see and act.

Model Provider Input $/1M Output $/1M Context
Qwen--Qwen3.6-35B-A3B neuralwatt $0.05 $0.1 ?
qwen3.6-35b-fast neuralwatt $0.05 $0.1 ?
openai--gpt-oss-120b novitaai $0.05 $0.25 131K
amazon-nova-lite amazon $0.06 $0.24 300K
amazon-nova-lite amazon-bedrock $0.06 $0.24 300K
Nemotron-3-Nano-Omni nebius $0.06 $0.24 128K
openai--gpt-5-nano aimlapi $0.065 $0.52 400K
seed-1.6-flash bytedance $0.07 $0.3 262K
gemini-1.5-flash-8b google $0.075 $0.3 1M
gemini-1.5-flash google $0.075 $0.3 1M
gemini-2.0-flash-lite google $0.075 $0.3 1M
gemini-2-0-flash-lite google-vertex $0.075 $0.3 1M
microsoft-phi-4-mini-multimodal microsoft $0.08 $0.32 128K
qwen--qwen3-vl-8b-instruct novitaai $0.08 $0.5 131K
seed-2.0-mini bytedance $0.1 $0.4 262K

πŸ“ Tool Calling + Large Context (128K+)

Models with tool calling and large context windows β€” for agents processing long documents or complex multi-step tasks.

Model Provider Context Input $/1M Reasoning
ling-2.6-flash inclusionai 262K $0.01
bdc-coder inferencenet 131K $0.01
klusterai--Meta-Llama-3.1-8B-Instruct-Turbo klusterai 131K $0.015
granite-4.0-h-micro cloudflare 131K $0.017
llama-3.1-8b-instruct--fp-16 inferencenet 131K $0.02
schematron-3b inferencenet 131K $0.02
schematron-v3 inferencenet 131K $0.02
gpt-oss-20b inferencenet 131K $0.03
schematron-v2-turbo inferencenet 131K $0.03
qwen--qwen3-4b-fp8 novitaai 128K $0.03 βœ…
liquid-ai--LFM2-24B-A2B togetherai 131K $0.03
amazon-nova-micro amazon 128K $0.035
amazon-nova-micro amazon-bedrock 128K $0.035
mistral-nemo-12b-instruct--fp-8 inferencenet 131K $0.0375
klusterai--Meta-Llama-3.3-70B-Instruct-Turbo klusterai 131K $0.038

πŸ“Š Methodology

All data is sourced from first-party APIs. Tool calling capability is defined by the provider's own classification β€” models that support function/tool calling via their API. Aggregator providers are excluded from ranking tables to avoid duplicate models.

πŸ”— More Resources

Small Language Models

🎯 AI Model Picker

⚑ GitHub Action