Anthropic: Claude Sonnet 4.5
anthropic/claude-sonnet-4.5
Claude Sonnet 4.5 is Anthropic’s most advanced Sonnet model to date, optimized for real-world agents and coding workflows. It delivers state-of-the-art performance on coding benchmarks such as SWE-bench Verified, with improvements across system design, code security, and specification adherence. The model is designed for extended autonomous operation, maintaining task continuity across sessions and providing fact-based progress tracking. Sonnet 4.5 also introduces stronger agentic capabilities, including improved tool orchestration, speculative parallel execution, and more efficient context and memory management. With enhanced context tracking and awareness of token usage across tool calls, it is particularly well-suited for multi-context and long-running workflows. Use cases span software engineering, cybersecurity, financial analysis, research agents, and other domains requiring sustained reasoning and tool use.
ByanthropicInput typeOutput type
Recent activity on Claude Sonnet 4.5
Tokens processed per day
Thoughput
(tokens/s)
ProvidersMin (tokens/s)Max (tokens/s)Avg (tokens/s)
Anthropic4.0936.359.85
Amazon Bedrock2.7326.3415.61
First Token Latency
(ms)
ProvidersMin (ms)Max (ms)Avg (ms)
Anthropic199627122179.60
Amazon Bedrock3595127146812.33
Providers for Claude Sonnet 4.5
ZenMux Provider to the best providers that are able to handle your prompt size and parameters, with fallbacks to maximize uptime.
Latency
2.89
s
Throughput
43.29
tps
Uptime
100.00
%
Recent uptime
Oct 10,2025 - 3 PM100.00%
Price
Tiered pricing
0 <= Input < 200k
Input
$ 3
/ M tokens
Output
$ 15
/ M tokens
Cache read
$ 0.3
/ M tokens
Cache write 5m
$ 3.75
/ M tokens
Cache write 1h
$ 6
/ M tokens
Cache write
-
Web search
$ 0.01
/ request
Model limitation
Context
200.00K
Max output
64.00K
Supported Parameters
max_completion_tokens
temperature
top_p
frequency_penalty
-
presence_penalty
-
seed
-
logit_bias
-
logprobs
-
top_logprobs
-
response_format
-
stop
tools
tool_choice
parallel_tool_calls
Model Protocol Compatibility
openai
anthropic
Data policy
Prompt training
false
Prompt Logging
30 day retention
Moderation
Responsibility of developer
Status Page
status page
Latency
-
Throughput
-
Uptime
100.00
%
Recent uptime
Oct 10,2025 - 3 PM100.00%
Price
Tiered pricing
0 <= Input < 200k
Input
$ 3
/ M tokens
Output
$ 15
/ M tokens
Cache read
$ 0.3
/ M tokens
Cache write 5m
$ 3.75
/ M tokens
Cache write 1h
$ 6
/ M tokens
Cache write
-
Web search
$ 0.01
/ request
Model limitation
Context
200.00K
Max output
64.00K
Supported Parameters
max_completion_tokens
temperature
top_p
frequency_penalty
-
presence_penalty
-
seed
-
logit_bias
-
logprobs
-
top_logprobs
-
response_format
-
stop
tools
tool_choice
parallel_tool_calls
Model Protocol Compatibility
openai
anthropic
Data policy
Prompt training
false
Prompt Logging
30 day retention
Moderation
Responsibility of developer
Status Page
status page
Sample code and API for Claude Sonnet 4.5
ZenMux normalizes requests and responses across providers for you.
OpenAI-PythonPythonTypeScriptOpenAI-TypeScriptcURL
python
from openai import OpenAI

client = OpenAI(
  base_url="https://zenmux.ai/api/v1",
  api_key="<ZenMux_API_KEY>",
)

completion = client.chat.completions.create(
  model="anthropic/claude-sonnet-4.5",
  messages=[
    {
      "role": "user",
      "content": [
        {
          "type": "text",
          "text": "What is in this image?"
        }
      ]
    }
  ]
)
print(completion.choices[0].message.content)