Thank you for choosing our service. Add credits today to receive an additional 20% bonus. Add Credits

Anthropic: Claude Sonnet 4.5

anthropic/claude-sonnet-4.5

Claude Sonnet 4.5 is Anthropic’s most advanced Sonnet model to date, optimized for real-world agents and coding workflows. It delivers state-of-the-art performance on coding benchmarks such as SWE-bench Verified, with improvements across system design, code security, and specification adherence. The model is designed for extended autonomous operation, maintaining task continuity across sessions and providing fact-based progress tracking. Sonnet 4.5 also introduces stronger agentic capabilities, including improved tool orchestration, speculative parallel execution, and more efficient context and memory management. With enhanced context tracking and awareness of token usage across tool calls, it is particularly well-suited for multi-context and long-running workflows. Use cases span software engineering, cybersecurity, financial analysis, research agents, and other domains requiring sustained reasoning and tool use.

ByanthropicInput typeOutput type

Recent activity on Claude Sonnet 4.5

Tokens processed per day

Thoughput

(tokens/s)

Providers	Min (tokens/s)	Max (tokens/s)	Avg (tokens/s)
Anthropic	4.09	36.35	9.85
Amazon Bedrock	2.73	26.34	15.61

First Token Latency

(ms)

Providers	Min (ms)	Max (ms)	Avg (ms)
Anthropic	1996	2712	2179.60
Amazon Bedrock	3595	12714	6812.33

Providers for Claude Sonnet 4.5

ZenMux Provider to the best providers that are able to handle your prompt size and parameters, with fallbacks to maximize uptime.

Anthropic

Latency

2.89

Throughput

43.29

tps

Uptime

100.00

Recent uptime

Oct 10,2025 - 3 PM100.00%

Price

Tiered pricing

0 <= Input < 200k

Input

$ 3

/ M tokens

Output

$ 15

/ M tokens

Cache read

$ 0.3

/ M tokens

Cache write 5m

$ 3.75

/ M tokens

Cache write 1h

$ 6

/ M tokens

Cache write

Web search

$ 0.01

/ request

Model limitation

Context

200.00K

Max output

64.00K

Supported Parameters

max_completion_tokens

temperature

top_p

frequency_penalty

presence_penalty

seed

logit_bias

logprobs

top_logprobs

response_format

stop

tools

tool_choice

parallel_tool_calls

Model Protocol Compatibility

openai

anthropic

Data policy

Prompt training

false

Prompt Logging

30 day retention

Moderation

Responsibility of developer

Status Page

status page

Amazon Bedrock

Latency

Throughput

Uptime

100.00

Recent uptime

Oct 10,2025 - 3 PM100.00%

Price

Tiered pricing

0 <= Input < 200k

Input

$ 3

/ M tokens

Output

$ 15

/ M tokens

Cache read

$ 0.3

/ M tokens

Cache write 5m

$ 3.75

/ M tokens

Cache write 1h

$ 6

/ M tokens

Cache write

Web search

$ 0.01

/ request

Model limitation

Context

200.00K

Max output

64.00K

Supported Parameters

max_completion_tokens

temperature

top_p

frequency_penalty

presence_penalty

seed

logit_bias

logprobs

top_logprobs

response_format

stop

tools

tool_choice

parallel_tool_calls

Model Protocol Compatibility

openai

anthropic

Data policy

Prompt training

false

Prompt Logging

30 day retention

Moderation

Responsibility of developer

Status Page

status page

Sample code and API for Claude Sonnet 4.5

ZenMux normalizes requests and responses across providers for you.

OpenAI-PythonPythonTypeScriptOpenAI-TypeScriptcURL

python
from openai import OpenAI

client = OpenAI(
  base_url="https://zenmux.ai/api/v1",
  api_key="<ZenMux_API_KEY>",
)

completion = client.chat.completions.create(
  model="anthropic/claude-sonnet-4.5",
  messages=[
    {
      "role": "user",
      "content": [
        {
          "type": "text",
          "text": "What is in this image?"
        }
      ]
    }
  ]
)
print(completion.choices[0].message.content)