C
Claude Sonnet 4.6
stablePositive
vs
competes with (1)
G
GPT-5.3
stablePositive
Coverage (30d)
9vs15
This Week
0vs4
Evidence
1 articles
Relationships
1

Timeline

GPT-5.32026-03-26

Achieved 100% resident identification accuracy in a safety evaluation for a care home smart speaker system.

Claude Sonnet 4.62026-03-26

Used in prompt compression study analyzing 358 successful runs from 1,199 real orchestration instructions

Claude Sonnet 4.62026-03-20

Anthropic released Claude Sonnet 4.6 with native chain-of-thought reasoning mode for complex coding tasks

Claude Sonnet 4.62026-03-17

Service disruption with elevated error rates reported on status page

Claude Sonnet 4.62026-03-16

Release of Claude Sonnet 4.5 model by Anthropic

GPT-5.32026-03-07

Released as OpenAI's most capable frontier model with unified coding, reasoning, and computer operation capabilities

GPT-5.32026-03-06

Demonstrated surpassing human baselines on OSWorld benchmark with 75% score

GPT-5.32026-03-05

OpenAI releases GPT-5.4 with native computer use, tool search, and 1M token context window

GPT-5.32026-03-01

Expected to follow shortly after DeepSeek v4 release

GPT-5.32026-02-28

Identified as experiencing up to 33% accuracy degradation in extended conversations according to new research

Ecosystem

Claude Sonnet 4.6

developedAnthropic5 src
usesChain-of-Thought1 src

GPT-5.3

developed byOpenAI
developedOpenAI5 src
usesPlaywright1 src
usesRust1 src
competes withClaude Sonnet 4.61 src
usesGPT-5.3-Codex1 src

Benchmarks

mmlu pro
Claude Sonnet 4.685
GPT-5.3
arena elo
Claude Sonnet 4.61470
GPT-5.3
swe bench verified
Claude Sonnet 4.679.6
GPT-5.3
gpqa
Claude Sonnet 4.6
GPT-5.392
swe bench pro
Claude Sonnet 4.6
GPT-5.356.8

Evidence (1 articles)