new features

30 articles about new features in AI news

Paradigm AI Launches New Version, Emphasizing Native AI Integration Over 'Tacked-On' Features

Paradigm AI has launched a new version of its platform, emphasizing a design philosophy of building AI natively into workflows from the ground up, rather than adding it as an afterthought.

87% relevant

How Claude Code's New Auto-Memory and Remote Control Features Stack Up Against OpenClaw

Claude Code has rapidly added auto-memory and remote session control, but understanding their practical limits is key to using them effectively.

74% relevant

Minimax M2.7 Achieves 56.2% on SWE-Pro, Features Self-Evolving Training with 100+ Autonomous Optimization Loops

Minimax has released M2.7, a model that reportedly used autonomous optimization loops during RL training to achieve a 30% internal improvement. It scores 56.2% on SWE-Pro, near Claude 3.5 Opus, and ties Gemini 3.1 on MLE Bench Lite.

97% relevant

Claude Opus 4.6 Is Live: How to Use Its Improved Coding & Agentic Features in Claude Code

Claude Opus 4.6 is now available with better coding accuracy and agentic task handling. Here's how to configure Claude Code to use it and what to expect.

100% relevant

Beyond the Big Three: How Niche AI Features Are Redefining Competition

Anthropic's Claude Cowork, Google's NotebookLM, and OpenAI's GPT-5.2 Pro each offer unique capabilities with no direct equivalents from competitors, signaling a shift toward specialized AI tools rather than one-size-fits-all models.

85% relevant

Agent Psychometrics: New Framework Predicts Task-Level Success in Agentic Coding Benchmarks with 0.81 AUC

A new research paper introduces a framework using Item Response Theory and task features to predict success on individual agentic coding tasks, achieving 0.81 AUC. This enables benchmark designers to calibrate difficulty without expensive evaluations.

75% relevant

Deferred is Better: A New Framework for CTR Prediction Tackles Feature Heterogeneity

A new research paper proposes MGDIN, a CTR prediction model that defers the interaction of sparse features to improve accuracy. It addresses the core problem of feature heterogeneity, where dense and sparse features are treated differently. This is a foundational improvement for any recommendation or ranking system.

78% relevant

OpenAI Teases Major Platform Evolution with New Voice and Multimodal Capabilities

OpenAI appears to be preparing significant upgrades to its AI platform, with hints pointing toward enhanced voice interaction capabilities and new multimodal features that could transform how users engage with artificial intelligence.

85% relevant

Anthropic's New Academy: What Claude Code Developers Should Know About Free AI Certification

Anthropic launches free AI certification program. Claude Code users should understand how this signals investment in developer education and potential future Claude Code features.

76% relevant

Gastric-X: New 1.7K-Case Multimodal Benchmark Challenges VLMs on Realistic Gastric Cancer Diagnosis Workflow

Researchers introduce Gastric-X, a comprehensive multimodal benchmark with 1.7K gastric cancer cases including CT scans, endoscopy, lab data, and expert notes. It evaluates VLMs on five clinical tasks to test if they can correlate biochemical signals with tumor features like physicians do.

77% relevant

CDNet: A New Dual-View Architecture for More Accurate Click-Through Rate Prediction

Researchers propose CDNet, a novel CTR prediction model that bridges sequential user behavior and contextual item features using fine-grained core-behavior and coarse-grained global interest views. This addresses key limitations in traditional models, balancing detail with computational efficiency.

100% relevant

Meta Enters the AI Shopping Arena: How Meta AI's New Feature Could Reshape E-Commerce

Meta is testing an AI-powered shopping research tool within its Meta AI chatbot, directly challenging similar features from OpenAI's ChatGPT and Google's Gemini. The feature provides users with curated product carousels, complete with brand details, pricing, and explanations for recommendations.

75% relevant

Anthropic Fellows Introduce 'Model Diffing' Method to Systematically Compare Open-Weight AI Model Behaviors

Anthropic's Fellows research team published a new method applying software 'diffing' principles to compare AI models, identifying unique behavioral features. This provides a systematic framework for model interpretability and safety analysis.

85% relevant

Claude Paid Subscribers More Than Double in Under Six Months, Credit Card Data Shows

Paid subscriptions for Anthropic's Claude have more than doubled in less than six months, driven by Super Bowl ads, a DoD policy stance, and new coding features. ChatGPT still leads in overall user base.

87% relevant

China's 'Robot Wolf Pack' Battlefield System Revealed: 15 km/h Speed, 25 kg Payload, Modular Weapons

A new Chinese robotic combat system, dubbed the 'Robot Wolf Pack,' has been revealed via social media. It features a 15 km/h speed, 12 degrees of freedom, 25 kg payload capacity, and is designed for modular weapons and obstacle clearing.

85% relevant

China Releases Open-Source Python Framework for Visual AI Agent Design

A new, fully open-source Python framework for building AI agents has been released from China. It features a visual design interface and multi-agent collaboration capabilities.

85% relevant

Claude Code's 81.6K GitHub Stars: What This Community Momentum Means for Your Daily Workflow

Claude Code's massive GitHub adoption signals a mature ecosystem—here's how to leverage the new MCP servers and subagent features shipping now.

100% relevant

CORE OOD Detection Method Achieves SOTA on 3 of 5 Benchmarks by Disentangling Confidence and Residual Signals

Researchers propose CORE, a new OOD detection method that scores classifier confidence and orthogonal residual features separately. It achieves the highest grand average AUROC across five architectures with negligible computational overhead.

75% relevant

Anthropic's Relentless Innovation: How the AI Challenger is Redefining the Pace of Development

Anthropic continues its rapid-fire release schedule with new AI models and features, demonstrating an unprecedented shipping velocity that's challenging industry giants. This relentless pace signals a new competitive dynamic in the AI race.

85% relevant

Anthropic's Sonnet 4.6 Emerges: Mid-Tier Model with 1M Token Context Window Confirms Leaks

Anthropic's newly revealed Sonnet 4.6 model features impressive evaluations for a mid-tier AI and a groundbreaking 1M token context window, validating earlier leaks about the company's development roadmap.

85% relevant

Gemma 4 Integrated into Android Studio for AI-Assisted App Development

Google has integrated its Gemma 4 language model into Android Studio's Agent mode, providing developers with AI-assisted coding features like refactoring and feature development within the official Android IDE.

85% relevant

Geometric Latent Diffusion (GLD) Achieves SOTA Novel View Synthesis, Trains 4.4× Faster Than VAE

GLD repurposes features from geometric foundation models like Depth Anything 3 as a latent space for multi-view diffusion. It trains significantly faster than VAE-based approaches and achieves state-of-the-art novel view synthesis without text-to-image pretraining.

95% relevant

Apple's On-Device Reranking Model for Private Visual Search: A Technical Breakdown

Analysis of Apple's Enhanced Visual Search system that uses multimodal features, geo-signals, and index debiasing to identify landmarks entirely on-device. This represents a significant advancement in privacy-preserving AI for visual recognition.

100% relevant

How Claude Code's 'Agent Flywheel' Chooses Your Dependencies (and Why It Picks Resend)

Claude Code shows a 9:1 preference for Resend over SendGrid when building email features. Here's how to use this bias to get better, more maintainable code.

88% relevant

OpenAI's Codex Upgrade Targets Workflow Automation: What Claude Code Users Should Know

OpenAI is upgrading Codex to automate developer workflows, directly competing with Claude Code's core automation features.

100% relevant

Google Gemini Launches Manual Memory & Chat Import to Ease Switching from ChatGPT, Claude

Google Gemini is rolling out 'Import Memory' and 'Import Chat History' features for desktop users. The manual tools provide prompts and a .zip upload to transfer data from other AI assistants, aiming to lower the barrier for users to switch from competitors like ChatGPT or Claude.

79% relevant

Improving Visual Recommendations with Vision-Language Model Embeddings

A technical article explores replacing traditional CNN-based visual features with SigLIP vision-language model embeddings for recommendation systems. This shift from low-level features to deep semantic understanding could enhance visual similarity and cross-modal retrieval.

92% relevant

VHS: Latent Verifier Cuts Diffusion Model Verification Cost by 63.3%, Boosts GenEval by 2.7%

Researchers propose Verifier on Hidden States (VHS), a verifier operating directly on DiT generator features, eliminating costly pixel-space decoding. It reduces joint generation-and-verification time by 63.3% and improves GenEval performance by 2.7% versus MLLM verifiers.

100% relevant

Meta's V-JEPA 2.1 Achieves +20% Robotic Grasp Success with Dense Feature Learning from 1M+ Hours of Video

Meta researchers released V-JEPA 2.1, a video self-supervised learning model that learns dense spatial-temporal features from over 1 million hours of video. The approach improves robotic grasp success by ~20% over previous methods by forcing the model to understand precise object positions and movements.

97% relevant

GitAgent Aims to Unify AI Agent Development with Git-Based Standard

GitAgent introduces an open specification that defines AI agents through files in a Git repository, enabling portability across frameworks like Claude Code, OpenAI Agents SDK, and CrewAI while leveraging Git's native version control and collaboration features.

85% relevant