Function Calling

What it is

Function calling is the capability of an LLM to decide when to invoke an external function and generate the necessary arguments in structured format (typically JSON). The model doesn't execute the function — it generates the call specification that the host system executes.

This capability transforms LLMs from text generators into action orchestrators.

How it works

Definition: the model is provided a schema of available functions (name, description, parameters)
Decision: given a query, the model decides whether to call a function or respond directly
Generation: if it decides to call, it generates JSON with the function name and arguments
Execution: the host system executes the actual function
Continuation: the result is returned to the model to formulate the final response

Example flow

User: "What's the weather in Madrid?"

Model generates:
{
  "function": "get_weather",
  "arguments": { "city": "Madrid", "units": "celsius" }
}

System executes get_weather("Madrid", "celsius") → "18°C, partly cloudy"

Model responds: "In Madrid it's 18°C with partly cloudy skies."

Example with the Anthropic API

import anthropic
 
client = anthropic.Anthropic()
 
tools = [{
    "name": "get_weather",
    "description": "Gets the current weather for a city",
    "input_schema": {
        "type": "object",
        "properties": {
            "city": {"type": "string", "description": "City name"},
            "units": {"type": "string", "enum": ["celsius", "fahrenheit"]}
        },
        "required": ["city"]
    }
}]
 
response = client.messages.create(
    model="claude-sonnet-4-20250514",
    max_tokens=1024,
    tools=tools,
    messages=[{"role": "user", "content": "What's the weather in Madrid?"}]
)
# response.content includes a tool_use block with name and input

Provider comparison

Feature	OpenAI	Anthropic	Gemini	Bedrock
Parallel calls	Yes	Yes	Yes	Model-dependent
Tool streaming	Yes	Yes	Yes	Yes
Forced mode	`tool_choice: required`	`tool_choice: any`	Mode configuration	Via `toolChoice`
Schema format	JSON Schema	JSON Schema	Protobuf or JSON	JSON Schema

Schema design

Function description quality determines model accuracy:

Descriptive names: search_knowledge_base is better than search
Specific descriptions: "Search documents by semantic similarity in the knowledge base" is better than "Search stuff"
Enums over free strings: use "enum": ["celsius", "fahrenheit"] instead of "type": "string"
Optional parameters with defaults: reduce the model's cognitive load
Examples in descriptions: help the model understand expected format

Relationship with MCP

The Model Context Protocol standardizes how models discover and call tools. Function calling is the underlying mechanism; MCP is the protocol that makes it interoperable across different systems.

Usage patterns

Simple tools: calculator, unit conversion, date/time
External APIs: weather, search, databases
System actions: create files, send emails, execute code
Parallel calls: multiple functions in a single response — the model generates an array of calls that the system executes concurrently
Chained calls: one function's result feeds the next — the model receives the result and decides the next step
Structured output: using function calling not to execute functions but to force the model to generate JSON with a specific schema

Considerations

Validation: always validate generated arguments before execution
Security: limit which functions are available based on context
Error handling: the model should be able to interpret and recover from errors
Clear descriptions: function description quality directly affects accuracy

Why it matters

Function calling is the mechanism that turns LLMs from text generators into agents that interact with the real world. Without it, models can only respond with text. With it, they can query databases, call APIs, and execute concrete actions.

References

Tool Use — Anthropic — Anthropic, 2024. Tool use guide with Claude.
Function Calling — Gemini — Google, 2024. Function calling implementation in Gemini.
Function Calling — OpenAI Cookbook — OpenAI, 2024. Practical guide with examples.
Tool Use — Amazon Bedrock — AWS, 2024. Tool use documentation for Bedrock.
Function Calling — Mistral AI — Mistral AI, 2024. Implementation in Mistral models.

What it is

This capability transforms LLMs from text generators into action orchestrators.

How it works

Definition: the model is provided a schema of available functions (name, description, parameters)
Decision: given a query, the model decides whether to call a function or respond directly
Generation: if it decides to call, it generates JSON with the function name and arguments
Execution: the host system executes the actual function
Continuation: the result is returned to the model to formulate the final response

Example flow

User: "What's the weather in Madrid?"

Model generates:
{
  "function": "get_weather",
  "arguments": { "city": "Madrid", "units": "celsius" }
}

System executes get_weather("Madrid", "celsius") → "18°C, partly cloudy"

Model responds: "In Madrid it's 18°C with partly cloudy skies."

Example with the Anthropic API

import anthropic
 
client = anthropic.Anthropic()
 
tools = [{
    "name": "get_weather",
    "description": "Gets the current weather for a city",
    "input_schema": {
        "type": "object",
        "properties": {
            "city": {"type": "string", "description": "City name"},
            "units": {"type": "string", "enum": ["celsius", "fahrenheit"]}
        },
        "required": ["city"]
    }
}]
 
response = client.messages.create(
    model="claude-sonnet-4-20250514",
    max_tokens=1024,
    tools=tools,
    messages=[{"role": "user", "content": "What's the weather in Madrid?"}]
)
# response.content includes a tool_use block with name and input

Provider comparison

Feature	OpenAI	Anthropic	Gemini	Bedrock
Parallel calls	Yes	Yes	Yes	Model-dependent
Tool streaming	Yes	Yes	Yes	Yes
Forced mode	`tool_choice: required`	`tool_choice: any`	Mode configuration	Via `toolChoice`
Schema format	JSON Schema	JSON Schema	Protobuf or JSON	JSON Schema

Schema design

Function description quality determines model accuracy:

Descriptive names: search_knowledge_base is better than search
Specific descriptions: "Search documents by semantic similarity in the knowledge base" is better than "Search stuff"
Enums over free strings: use "enum": ["celsius", "fahrenheit"] instead of "type": "string"
Optional parameters with defaults: reduce the model's cognitive load
Examples in descriptions: help the model understand expected format

Relationship with MCP

The Model Context Protocol standardizes how models discover and call tools. Function calling is the underlying mechanism; MCP is the protocol that makes it interoperable across different systems.

Usage patterns

Simple tools: calculator, unit conversion, date/time
External APIs: weather, search, databases
System actions: create files, send emails, execute code
Parallel calls: multiple functions in a single response — the model generates an array of calls that the system executes concurrently
Chained calls: one function's result feeds the next — the model receives the result and decides the next step
Structured output: using function calling not to execute functions but to force the model to generate JSON with a specific schema

Considerations

Validation: always validate generated arguments before execution
Security: limit which functions are available based on context
Error handling: the model should be able to interpret and recover from errors
Clear descriptions: function description quality directly affects accuracy

Why it matters

References

Tool Use — Anthropic — Anthropic, 2024. Tool use guide with Claude.
Function Calling — Gemini — Google, 2024. Function calling implementation in Gemini.
Function Calling — OpenAI Cookbook — OpenAI, 2024. Practical guide with examples.
Tool Use — Amazon Bedrock — AWS, 2024. Tool use documentation for Bedrock.
Function Calling — Mistral AI — Mistral AI, 2024. Implementation in Mistral models.

Function Calling

What it is

How it works

Example flow

Example with the Anthropic API

Provider comparison

Schema design

Relationship with MCP

Usage patterns

Considerations

Why it matters

References

Related content

Function Calling

What it is

How it works

Example flow

Example with the Anthropic API

Provider comparison

Schema design

Relationship with MCP

Usage patterns

Considerations

Why it matters

References

Related content