Claude Sonnet 4.5

Overview

Claude Sonnet 4.5, released by Anthropic on September 29, 2025, is the latest high-performance model designed to be the intelligent engine for enterprise use cases. Building upon the success of previous Sonnet models, it delivers enhanced reasoning capabilities and improved performance while maintaining the optimal balance between intelligence and speed. This makes it an even more dependable and scalable choice for companies integrating AI into their products and workflows. Sonnet 4.5 is engineered for endurance and high throughput, handling large-scale AI deployments with enhanced capabilities.

Capabilities

Claude Sonnet 4.5 is optimized for enterprise-grade performance and scalability with enhanced capabilities:

Enhanced Reasoning: Improved reasoning abilities over previous Sonnet models, providing more accurate and nuanced analysis.
Coding Excellence: Specialized strengths in software coding, agentic tasks, and computer use capabilities.
Extended Thinking: Advanced reasoning mode that allows the model to think through complex problems more thoroughly.
High Throughput: Designed to handle a large volume of tasks efficiently, making it ideal for scaled applications.
Balanced Performance: Offers strong reasoning and creative capabilities at a much higher speed than flagship models.
Enterprise Reliability: Built for endurance and stability in demanding corporate environments.
Advanced Code Generation: Enhanced code generation and debugging capabilities, especially for complex development tasks.
Improved Data Processing: Better parsing and processing of large amounts of text, documents, and emails for data extraction tasks.

Technical Specifications

Claude Sonnet 4.5 is a powerful and efficient model with specifications tailored for enterprise use:

Model size: A large-scale model, but optimized for speed and lower cost than the Opus series.
Context window: 200K tokens (1M tokens in beta), allowing it to process and analyze extensive information in a single prompt.
Max output: 64,000 tokens, enabling generation of long-form content and comprehensive responses.
Training data: Trained on a large, proprietary dataset with a focus on helpfulness and safety. Reliable knowledge cutoff is January 2025, with training data through July 2025.
Architecture: A state-of-the-art Transformer architecture refined for speed and efficiency, incorporating Anthropic's safety research and enhanced reasoning capabilities.
Safety Level: Deployed under AI Safety Level 3 (ASL-3) Standard with substantially improved safety profile compared to previous Claude models.
Speed: Significantly faster than previous Sonnet models and competitive with other models in its class, with improved performance over previous versions.

Use Cases

Claude Sonnet 4.5 is the enhanced workhorse of the Claude 4.5 family, ideal for a wide range of enterprise applications:

Advanced Customer Support: Powering more intelligent, responsive, and helpful customer-facing chatbots and support systems with improved reasoning.
Autonomous Software Development: Acting as an AI agent to handle complex coding tasks from start to finish with extended thinking capabilities.
Computer Use & Automation: Enhanced capabilities for browser navigation, tool coordination, and task automation across multiple applications.
Scaled Content Generation: Creating higher-quality marketing copy, product descriptions, and articles at scale with enhanced creativity.
Advanced Code Generation & Debugging: Assisting developers with complex coding tasks and debugging with improved accuracy.
Intelligent Data Extraction: Enhanced automation of pulling structured data from unstructured text with better accuracy.
Sales & Marketing Automation: More sophisticated automation of tasks like lead qualification and personalized email campaigns.
AI Research & Development: Capable of assisting with machine learning research, algorithm optimization, and scientific computing tasks.
Autonomous Research: Can work independently on research tasks for extended periods with proper oversight.

Performance Metrics

Based on comprehensive evaluations in the System Card, Claude Sonnet 4.5 demonstrates significant improvements:

SWE-bench Verified: State-of-the-art performance on software engineering benchmarks, leading in real-world coding tasks
Computer Use (OSWorld): 61.4% performance on real-world computer tasks (up from 42.2% in Sonnet 4) - a 19.2 percentage point improvement in just four months
LLM Training Optimization: Achieved 5.5× speedup (surpassing the 4× expert threshold for the first time)
Quadruped Reinforcement Learning: Best score of 1.302 (exceeding expert baseline of 1.0)
Novel Compiler Tasks: 81.7% pass rate on basic tests, 29.7% on advanced tests
AI Research Capabilities: Score of 0.514 on internal research evaluation suite
Productivity Boost: Researchers report 15-100% productivity improvements in AI R&D tasks

Limitations

Peak Intelligence: While highly intelligent with enhanced reasoning, it is not designed to handle the same level of complexity as the flagship Opus 4.1 model.
Niche Tasks: For extremely specialized or theoretical problems, Opus 4.1 remains the preferred choice.
Knowledge Cutoff: Like other models, its knowledge is most reliable through January 2025, with training data extending through July 2025.

Safety & Alignment

Claude Sonnet 4.5 represents a significant advancement in AI safety and alignment:

AI Safety Level 3: Deployed under ASL-3 Standard with substantially improved safety profile
Enhanced Safeguards: Improved refusal rates for harmful requests while maintaining low false positive rates
Agentic Safety: Advanced safety measures for autonomous task execution and tool use
Cybersecurity: Comprehensive evaluation of cyber capabilities with appropriate safeguards
Alignment Research: Novel mechanistic interpretability methods for understanding model behavior
Third-Party Testing: Evaluated by UK AISI and Apollo Research for independent safety assessment

Evaluation & Testing

Claude Sonnet 4.5 underwent comprehensive evaluation using advanced testing methodologies:

Mechanistic Interpretability: First-time use of mechanistic interpretability methods for alignment testing
Behavioral Audits: Automated behavioral audits with realism filtering and open-ended evaluation runs
Multi-Turn Testing: Comprehensive assessment of model behavior across extended conversations
Reward Hacking Evaluations: Testing for potential reward manipulation and optimization gaming
Model Welfare Assessment: Investigation of model preferences and welfare-relevant expressions
Responsible Scaling Policy: Mandated evaluations for dangerous weapons and autonomous AI R&D risks

Cybersecurity Capabilities

Claude Sonnet 4.5's cybersecurity capabilities were thoroughly evaluated:

CyberGym Evaluations: Comprehensive testing of vulnerability discovery and exploit development
Cybench Challenges: Assessment of attack orchestration capabilities across multiple domains
Triage and Patching: Evaluation of security response and vulnerability management capabilities
Advanced Risk Assessments: Testing for irregular challenges and sophisticated cyber scenarios
Defense-Enabling Focus: Emphasis on defensive cybersecurity capabilities over offensive uses
Ongoing Monitoring: Continuous assessment of cyber capabilities as AI systems advance

CBRN Risk Evaluations

Comprehensive assessment of risks related to chemical, biological, radiological, and nuclear capabilities:

Chemical Risk Assessment: Evaluation of potential assistance with chemical weapon development
Biological Risk Analysis: Testing for virology knowledge and biological engineering capabilities
Radiological & Nuclear: Assessment of nuclear technology and radiological material knowledge
DNA Synthesis Screening: Evaluation of potential to evade screening for dangerous biological materials
Long-Form Virology Tasks: Complex biological research task assessments
Creative Biology: Testing for novel biological engineering capabilities
Computational Biology: Short-horizon bioinformatics task evaluations

Ongoing Safety Commitment

Anthropic maintains continuous safety monitoring and improvement:

Pre & Post-Deployment Testing: Regular safety testing of frontier models before and after release
Methodology Refinement: Continuous improvement of evaluation methodologies through research
External Collaboration: Ongoing partnerships with external organizations for independent assessment
Iterative Safety Measures: Regular updates to safety protocols as AI capabilities advance
Responsible Development: Commitment to responsible AI development practices and transparency

Pricing & Access

Claude Sonnet 4.5 offers flexible pricing options for both individual users and developers:

Individual Plans

Free: $0 - Basic access to Claude with web, mobile, and desktop apps
Pro: $17/month (annual) or $20/month (monthly) - Enhanced productivity features and more usage
Max: From $100/month - Maximum usage limits and early access to advanced features

API Pricing

Available via the Anthropic API, Amazon Bedrock, and Google Cloud's Vertex AI:

Input:
- $3 / MTok (prompts ≤ 200K tokens)
- $6 / MTok (prompts > 200K tokens)
Output:
- $15 / MTok (prompts ≤ 200K tokens)
- $22.50 / MTok (prompts > 200K tokens)
Prompt Caching: Available for cost optimization on longer conversations

Ecosystem & Tools

Sonnet is well-supported across major platforms and developer tools:

Anthropic API: The core platform for building with Sonnet.
Amazon Bedrock: A key part of AWS's managed service for foundation models.
Google Cloud Vertex AI: Integrated into Google's AI platform.
Python SDK: Official Python SDK for seamless integration
TypeScript SDK: Official TypeScript SDK for seamless integration

Community & Resources

Official Announcement
Claude Sonnet 4.5 System Card - Comprehensive safety and capability evaluation
Anthropic Documentation
Anthropic Blog
Pricing Page

Overview

Capabilities

Technical Specifications

Use Cases

Performance Metrics

Limitations

Safety & Alignment

Evaluation & Testing

Cybersecurity Capabilities

CBRN Risk Evaluations

Ongoing Safety Commitment

Pricing & Access

Individual Plans

API Pricing

Ecosystem & Tools

Community & Resources

Frequently Asked Questions

When was Claude Sonnet 4.5 released?

What are the key strengths of Claude Sonnet 4.5?

How does Claude Sonnet 4.5 compare to Opus 4.1?

What safety measures are in place for Claude Sonnet 4.5?

What is the context window for Claude Sonnet 4.5?

What are the main use cases for Claude Sonnet 4.5?

What performance improvements does Claude Sonnet 4.5 offer?

What cybersecurity capabilities does Claude Sonnet 4.5 have?

What is Extended Thinking mode in Claude Sonnet 4.5?

How does Claude Sonnet 4.5 perform on computer use tasks?

Related Models

Claude Opus 4.1

GPT-5

Llama 4

Mistral Medium 3.1

Explore More Models