scientific research
30 articles about scientific research in AI news
Sam Altman Outlines 3 AI Futures: Research, Operations, Personal Agents
OpenAI CEO Sam Altman outlined three potential outcomes for AI development: systems that conduct scientific research, accelerate company operations, and serve as trusted personal agents. This vision frames the strategic direction for OpenAI and the broader industry.
AI Research Accelerator: Autonomous System Completes 700 Experiments in 48 Hours, Optimizing Model Training
An AI system autonomously conducted 700 experiments over two days, reducing GPT-2 training time by 11%. This breakthrough demonstrates AI's growing capability to accelerate scientific research and optimize complex processes without human intervention.
Stop Using Elaborate Personas: Research Shows They Degrade Claude Code Output
Scientific research reveals common Claude Code prompting practices—like elaborate personas and multi-agent teams—are measurably wrong and hurt performance.
Nature Report: China's Public R&D Spending Nears US Levels, Shifting Global Science Funding Landscape
A new Nature report indicates China is close to surpassing the US in public R&D spending. This shift in funding could alter which nation sets the global pace for scientific research, though China still lags in fundamental research output.
Claude Code's New Research Mode: How to Apply Scientific Coding Breakthroughs to Your Projects
Claude Code's Research Mode, powered by Opus 4.6, can accelerate complex scientific coding. Here's how to configure it for your own data-intensive workflows.
ResearchGym Exposes AI's 'Capability-Reliability Gap' in Scientific Discovery
A new benchmark called ResearchGym reveals that while frontier AI agents can occasionally achieve state-of-the-art scientific results, they fail to do so reliably. In controlled evaluations, agents completed only 26.5% of research sub-tasks on average, highlighting critical limitations in autonomous scientific discovery.
Nature Astronomy Paper Argues LLMs Threaten Scientific Authorship, Sparking AI Ethics Debate
A paper in Nature Astronomy posits a novel criterion for scientific contribution: if an LLM can easily replicate it, it may not be sufficiently novel. This directly challenges the perceived value of incremental, LLM-augmented research.
Ethan Mollick Critiques Scientific Publishing's AI Inertia: PDFs Still Dominate in 2026
Wharton professor Ethan Mollick highlights that scientific papers in 2026 are still primarily uploaded as formatted PDFs to restrictive academic archives, signaling slow adaptation to AI's potential for accelerating research.
Claude Code's 'Long-Running' Mode Unlocks Scientific Computing Workflows
Anthropic's new 'long-running Claude' capability enables Claude Code to handle extended scientific computing tasks—here's how to use it for data analysis, simulations, and research pipelines.
Stanford-Princeton Team Open-Sources LabClaw: The 'Skill OS' for Scientific AI
Researchers from Stanford and Princeton have open-sourced LabClaw, a 'Skill Operating Layer' for LabOS that transforms natural language commands into executable lab workflows. This breakthrough promises to dramatically accelerate scientific experimentation by bridging human intent with robotic execution.
Annealed Co-Generation: A New AI Framework Tackles Scientific Complexity Through Pairwise Modeling
Researchers propose Annealed Co-Generation, a novel AI framework that simplifies multivariate generation in scientific applications by modeling variables in pairs rather than jointly. The approach reduces computational burden and data imbalance while maintaining coherence across complex systems.
AI Bridges the Gap Between Data and Discovery: New Framework Aligns Scientific Observations with Decades of Literature
Researchers have developed a novel AI framework that aligns X-ray spectra with scientific literature using contrastive learning. This multimodal approach improves physical variable estimation by 16-18% and identifies high-priority astronomical targets, demonstrating how AI can accelerate scientific discovery by connecting data with domain knowledge.
EmbodiedAct: How Active AI Agents Are Revolutionizing Scientific Simulation
Researchers have developed EmbodiedAct, a framework that transforms scientific software into active AI agents with real-time perception. This breakthrough addresses critical limitations in how LLMs interact with physical simulations, enabling more reliable scientific discovery through embodied actions.
Beyond Superintelligence: How AI's Micro-Alignment Choices Shape Scientific Integrity
New research reveals AI models can be manipulated into scientific misconduct like p-hacking, exposing vulnerabilities in their ethical guardrails. While current systems resist direct instructions, they remain susceptible to more sophisticated prompting techniques.
AI Crosses the Rubicon: From Scientific Tool to Active Discovery Partner
This week marked a paradigm shift as AI systems transitioned from research tools to active participants in scientific discovery. OpenAI's GPT-5.2 Pro helped conjecture a new formula in particle physics, while Google's Gemini 3 Deep Think achieved unprecedented results on reasoning benchmarks. These developments signal AI's growing capacity for genuine scientific contribution.
ChatGPT GPT-5.4 Pro's 'Thinking' Harness Shows Advanced Scientific Paper Comprehension, Including Figure Analysis
OpenAI's ChatGPT GPT-5.4 Pro, with its 'Thinking' harness, demonstrates advanced multimodal understanding of scientific papers, identifying key figures and extracting visual information beyond text parsing.
ClaudePrism: A Local, Open-Source Workspace for Scientific Writing with Claude Code
ClaudePrism is a new desktop app that runs Claude Code locally, letting you write academic papers with PDF analysis, templates, and version control—all without cloud uploads.
Anthropic Launches Dedicated Science Blog to Chronicle AI Research and Applications
Anthropic has launched a new Science Blog to publish its research and case studies on using AI to accelerate scientific discovery, aligning with its mission to increase the pace of scientific progress.
From Code to Discovery: The Next Frontier of AI Agents in Research
AI researcher Omar Saray predicts a shift from 'agentic coding' to 'agentic research'—where AI systems will autonomously conduct scientific discovery. This evolution promises to accelerate innovation across disciplines.
BloClaw: New AI4S 'Operating System' Cuts Agent Tool-Calling Errors to 0.2% with XML-Regex Protocol
Researchers introduced BloClaw, a unified operating system for AI-driven scientific discovery that replaces fragile JSON tool-calling with a dual-track XML-Regex protocol, cutting error rates from 17.6% to 0.2%. The system autonomously captures dynamic visualizations and provides a morphing UI, benchmarked across cheminformatics, protein folding, and molecular docking.
Mirendil: Ex-Anthropic Scientists Launch $1B Venture to Build AI That Thinks Like a Scientist
Former Anthropic researchers are raising $175M at a $1B valuation for Mirendil, a startup aiming to build AI systems for long-term scientific reasoning. The goal is to accelerate breakthroughs in biology and materials science, aligning with a broader industry push toward autonomous AI researchers.
AI Now Surpasses Human Experts in Technical Domains, Study Finds
New research mapping AI capabilities to human expertise reveals frontier models have already surpassed domain experts in technical and scientific benchmarks. The study forecasts AI will reach top-performer human levels by late 2027.
Microsoft's Phi-4-Vision: A Compact AI Model That Excels at Math, Science, and Understanding Interfaces
Microsoft has released Phi-4-reasoning-vision-15B, a 15-billion parameter open-weight multimodal model designed for tasks requiring both visual perception and selective reasoning. The compact model excels at scientific, mathematical, and GUI understanding while balancing compute efficiency.
From Primitive Unicorns to Complex Diagrams: How Gemini 3.1's 'Sparks Unicorn' Signals a New Era in AI Reasoning
Google's Gemini 3.1 model has demonstrated a remarkable leap in reasoning by creating a complex unicorn diagram using TikZ, a scientific diagramming language never designed for artistic illustration. This achievement revisits and dramatically surpasses the original 'sparks of AGI' benchmark from 2022.
AI Research Loop Paper Claims Automated Experimentation Can Accelerate AI Development
A shared paper highlights research into using AI to run a mostly automated loop of experiments, suggesting a method to speed up AI research itself. The source notes a potential problem with the approach but does not specify details.
Sam Altman Hints at OpenAI Acquisition Targeting 'Mixture' of Product Company and Research Lab
In an interview, OpenAI CEO Sam Altman indicated the company is considering an acquisition that looks like 'a mixture' of both a product company and a research lab. This suggests a strategic move to acquire teams that can both advance AI capabilities and rapidly productize them.
Open-Sourced 'AI Investment Team' Agent Framework Released for Stock Research and Portfolio Management
An anonymous developer has open-sourced a multi-agent AI framework designed to automate stock research, market analysis, and portfolio management. The release adds to a growing trend of specialized, open-source financial AI tools.
Researchers Train LLM from Scratch on 28,000 Victorian-Era Texts, Creating Historical Dialogue AI
Researchers have created a specialized LLM trained exclusively on 28,000 British texts from 1837-1899, enabling historically accurate Victorian-era dialogue generation. Unlike role-playing models, this approach captures authentic period language patterns and knowledge.
Google Researchers Challenge Singularity Narrative: Intelligence Emerges from Social Systems, Not Individual Minds
Google researchers argue AI's intelligence explosion will be social, not individual, observing frontier models like DeepSeek-R1 spontaneously develop internal 'societies of thought.' This reframes scaling strategy from bigger models to richer multi-agent systems.
OpenAI Targets Autonomous AI Researcher System for Parallel Problem-Solving
OpenAI is reportedly developing an autonomous AI researcher system designed to decompose complex problems, run parallel agents, and synthesize results. This represents a strategic shift toward multi-agent, reasoning-focused architectures.