🔧 Tool Calling AI Models Compared (2025)

Compare 2,350 AI models with tool/function calling across 95 providers. Find the best model for agents, automation, and API integration.

2,350Tool Calling Models

95Providers

81Free

527Open Weights

🔍 Interactive Catalog ⭐ Star on GitHub

💡 What is tool calling? Tool calling (also called function calling) lets LLMs invoke external APIs, databases, and services. This is the foundation of AI agents — without tool calling, a model can only generate text. With it, models can search the web, run code, query databases, and take real-world actions.

🏆 Flagship Tool Calling Models — Head to Head

The top models with tool calling compared side by side.

Model	Provider	Input $/1M	Output $/1M	Context	Reasoning
gpt-4o	openai	$2.5	$10	128K
gpt-4o-mini	openai	$0.15	$0.6	128K
gpt-4.1	openai	$2	$8	1M
gpt-4.1-mini	openai	$0.4	$1.6	1M
gpt-4.1-nano	openai	$0.1	$0.4	1M
o3	openai	$10	$40	200K	✅
o3-mini	openai	$1.1	$4.4	200K	✅
o4-mini	openai	$1.1	$4.4	200K	✅
gemini-2.0-flash	google	$0.1	$0.4	1M
deepseek-chat	deepseek	$0.14	$0.28	1M
qwen3-235b-a22b	alibaba	$2	$8	?	✅
llama-4-maverick	digitalocean	$0.25	$0.87	1M
llama-4-scout	google-vertex	$0.25	$0.7	1M

💰 Cheapest Tool Calling Models

Most affordable models with tool calling — for cost-sensitive agents and automation.

Model	Provider	Input $/1M	Output $/1M	Context	Reasoning
ling-2.6-flash	inclusionai	$0.01	$0.03	262K
bdc-coder	inferencenet	$0.01	$0.01	131K
klusterai--Meta-Llama-3.1-8B-Instruct-Turbo	klusterai	$0.015	$0.02	131K
granite-4.0-h-micro	cloudflare	$0.017	$0.112	131K
llama-3.1-8b-instruct--fp-16	inferencenet	$0.02	$0.03	131K
schematron-3b	inferencenet	$0.02	$0.05	131K
schematron-v3	inferencenet	$0.02	$0.05	131K
gpt-oss-20b	inferencenet	$0.03	$0.15	131K
schematron-v2-turbo	inferencenet	$0.03	$0.15	131K
openai--gpt-oss-20b	neuralwatt	$0.03	$0.16	?	✅
qwen--qwen3-4b-fp8	novitaai	$0.03	$0.03	128K	✅
liquid-ai--LFM2-24B-A2B	togetherai	$0.03	$0.12	131K
amazon-nova-micro	amazon	$0.035	$0.14	128K
amazon-nova-micro	amazon-bedrock	$0.035	$0.14	128K
mistral-nemo-12b-instruct--fp-8	inferencenet	$0.0375	$0.1	131K

🆓 Free Tool Calling Models

54 models with tool calling at zero cost — perfect for prototyping agents.

Model	Provider	Context	Reasoning
openrouter--owl-alpha	openrouter	1M
deepseek--deepseek-v4-flash--free	openrouter	1M	✅
qwen--qwen3-coder--free	openrouter	1M
nvidia--nemotron-3-super-120b-a12b--free	openrouter	1M	✅
gemma-4-26b-a4b-it	auriko	262K	✅
gemma-4-31b-it	auriko	262K	✅
arcee-ai--trinity-large-thinking--free	openrouter	262K	✅
google--gemma-4-26b-a4b-it--free	openrouter	262K	✅
google--gemma-4-31b-it--free	openrouter	262K	✅
nvidia--nemotron-3-nano-omni-30b-a3b-reasoning--free	openrouter	256K	✅

🔓 Open-Weight Tool Calling Models

278 models with tool calling you can run locally — for privacy-first agents.

Model	Provider	Context
google--gemma-4-31b-it	orcarouter	1M
qwen--qwen3.5-flash-2026-02-23	orcarouter	1M
qwen--qwen3.5-flash	orcarouter	1M
qwen--qwen3.6-flash-2026-04-16	orcarouter	1M
qwen--qwen3.6-flash	orcarouter	1M
meta-llama-4-maverick-17b	amazon-bedrock	1M
meta-llama-4-scout-17b	amazon-bedrock	1M
minimax-m2-1	amazon-bedrock	1M
minimax-m2-5	amazon-bedrock	1M
minimax-m2	amazon-bedrock	1M

🧠 Tool Calling + Reasoning

Models with both tool calling and reasoning — the most capable for complex agentic workflows that need planning and execution.

Model	Provider	Input $/1M	Output $/1M	Context
openai--gpt-oss-20b	neuralwatt	$0.03	$0.16	?
qwen--qwen3-4b-fp8	novitaai	$0.03	$0.03	128K
gpt-oss-120b	inferencenet	$0.05	$0.45	131K
Qwen--Qwen3.6-35B-A3B	neuralwatt	$0.05	$0.1	?
openai--gpt-oss-120b	novitaai	$0.05	$0.25	131K
qwen3-30b-a3b-fp8	cloudflare	$0.051	$0.335	40K
glm-4.7-flash	cloudflare	$0.06	$0.4	131K
Nemotron-3-Nano-Omni	nebius	$0.06	$0.24	128K
hermes-4-llama-3.1-8b	nousresearch	$0.06	$0.12	131K
seed-1.6-flash	bytedance	$0.07	$0.3	262K
ring-2.6-1t	inclusionai	$0.07	$0.62	262K
zai-org--glm-4.7-flash	novitaai	$0.07	$0.4	200K
microsoft-phi-4-mini-reasoning	microsoft	$0.075	$0.3	128K
Qwen--Qwen3-32B-TEE	chutes	$0.08	$0.24	40K
gpt-oss-120b	clarifai	$0.09	$0.36	131K

👁️ Tool Calling + Vision

Models with tool calling and image understanding — for agents that need to see and act.

Model	Provider	Input $/1M	Output $/1M	Context
Qwen--Qwen3.6-35B-A3B	neuralwatt	$0.05	$0.1	?
qwen3.6-35b-fast	neuralwatt	$0.05	$0.1	?
openai--gpt-oss-120b	novitaai	$0.05	$0.25	131K
amazon-nova-lite	amazon	$0.06	$0.24	300K
amazon-nova-lite	amazon-bedrock	$0.06	$0.24	300K
Nemotron-3-Nano-Omni	nebius	$0.06	$0.24	128K
openai--gpt-5-nano	aimlapi	$0.065	$0.52	400K
seed-1.6-flash	bytedance	$0.07	$0.3	262K
gemini-1.5-flash-8b	google	$0.075	$0.3	1M
gemini-1.5-flash	google	$0.075	$0.3	1M
gemini-2.0-flash-lite	google	$0.075	$0.3	1M
gemini-2-0-flash-lite	google-vertex	$0.075	$0.3	1M
microsoft-phi-4-mini-multimodal	microsoft	$0.08	$0.32	128K
qwen--qwen3-vl-8b-instruct	novitaai	$0.08	$0.5	131K
seed-2.0-mini	bytedance	$0.1	$0.4	262K

📏 Tool Calling + Large Context (128K+)

Models with tool calling and large context windows — for agents processing long documents or complex multi-step tasks.

Model	Provider	Context	Input $/1M	Reasoning
ling-2.6-flash	inclusionai	262K	$0.01
bdc-coder	inferencenet	131K	$0.01
klusterai--Meta-Llama-3.1-8B-Instruct-Turbo	klusterai	131K	$0.015
granite-4.0-h-micro	cloudflare	131K	$0.017
llama-3.1-8b-instruct--fp-16	inferencenet	131K	$0.02
schematron-3b	inferencenet	131K	$0.02
schematron-v3	inferencenet	131K	$0.02
gpt-oss-20b	inferencenet	131K	$0.03
schematron-v2-turbo	inferencenet	131K	$0.03
qwen--qwen3-4b-fp8	novitaai	128K	$0.03	✅
liquid-ai--LFM2-24B-A2B	togetherai	131K	$0.03
amazon-nova-micro	amazon	128K	$0.035
amazon-nova-micro	amazon-bedrock	128K	$0.035
mistral-nemo-12b-instruct--fp-8	inferencenet	131K	$0.0375
klusterai--Meta-Llama-3.3-70B-Instruct-Turbo	klusterai	131K	$0.038

📊 Methodology

All data is sourced from first-party APIs. Tool calling capability is defined by the provider's own classification — models that support function/tool calling via their API. Aggregator providers are excluded from ranking tables to avoid duplicate models.

🔗 More Resources

Interactive Catalog — search, filter, compare all models
AI Model Pricing Calculator — LLM cost calculator
Best AI Models for Agents — agentic model comparison
Best AI Models — curated by use case
Best AI Models for Coding — code-focused comparison
Reasoning Models Comparison — o1, R1, Claude, Gemini
Free AI Models — 81 models at zero cost
LLM Pricing Comparison — detailed pricing tables
Cheapest AI Models — lowest price LLMs
OpenAI Alternatives — 95 providers compared
AI Models by Provider — browse by provider
Context Window Comparison — largest context LLMs
GitHub Repository 🔓 Open Source AI Models (527 models) 🎨 Multimodal AI Models (1,548 models) State of AI Models 2025 Benchmarks — star, fork, contribute
Best AI Models for Image Generation — DALL·E, Imagen, GPT-5 Image compared
Best AI Models for Vision — GPT-4o, Claude, Gemini vision compared
Structured Output Models Comparison — JSON mode, function calling compared

Small Language Models

🎯 AI Model Picker

⚡ GitHub Action