Timeline
Model appears to have been removed or changed from Claude Code platform
Study finds GPT-4 generates product ideas scoring 2.5x higher in creativity than human crowdworkers.
Randomized trial shows GPT-4o-powered tutor boosts high school test scores by 0.15 standard deviations
Demonstration of advanced financial analysis capabilities through prompt engineering
Estimated to have around 1.76 trillion parameters, representing current state-of-the-art scale
Release delayed by 10 days due to safety considerations
Research published showing GPT-4o's multimodal capabilities outperform unimodal versions in predicting item complexity
Capable of generating convincing synthetic media for disinformation
Version 4.6 update released with 'beastly' performance for agentic tasks and computer interaction.
Study published in Nature reveals AI assistance boosts individual productivity but reduces collective creativity and solution diversity
Ecosystem
Claude 3.5 Sonnet
GPT-4o
Benchmarks
Evidence (5 articles)
AI Benchmarks Hit Saturation Point: What Comes Next for Performance Measurement?
Feb 23, 2026GLM-5.1 Released by Zhipu AI, Claiming Performance Close to GPT-4o and Claude 3.5
Mar 27, 2026Memory Sparse Attention (MSA) Achieves 100M Token Context with Near-Linear Complexity
Mar 29, 2026Rumor: Anthropic Preparing 'Mythos' and 'Capybara' Model Launches, Potentially Challenging GPT-4o
Mar 28, 2026Alibaba Launches Qwen3.6-Plus with 1M-Token Context, Targeting AI Agent and Coding Workloads
Apr 3, 2026