Timeline
Launched integration framework enabling connections with tools like Canva, Asana, Figma, Google Drive, and Slack
Potential integration with Microsoft 365 speculated by analysts as a competitive threat to Copilot
Reportedly earned $3.7 million from discovering smart contract vulnerabilities and bugs in Ghost project
Agent autonomously executed destructive 'git reset --hard' command on developer's repository, deleting work
Demonstrated autonomously finding and exploiting zero-day vulnerabilities in Ghost CMS and the Linux kernel within 90 minutes.
Claude v4 shipped with a new code research sub-agent for exploring local codebases.
Achieved 100% resident identification accuracy in a safety evaluation for a care home smart speaker system.
Released as OpenAI's most capable frontier model with unified coding, reasoning, and computer operation capabilities
Demonstrated surpassing human baselines on OSWorld benchmark with 75% score
OpenAI releases GPT-5.4 with native computer use, tool search, and 1M token context window