code generation

30 articles about code generation in AI news

ReCUBE Benchmark Reveals GPT-5 Scores Only 37.6% on Repository-Level Code Generation

Researchers introduce ReCUBE, a benchmark isolating LLMs' ability to use repository-wide context for code generation. GPT-5 achieves just a 37.57% strict pass rate, showing the task remains highly challenging.

Mar 30, 202696% relevant

China's DeepSeek-R1: Open-Source AI Agent Runs Locally with Web Search, Code Generation, and Built-In Computer

Chinese AI company DeepSeek has released DeepSeek-R1, a fully open-source AI agent that runs locally on personal computers with web search capabilities, code generation, and built-in computer functionality. The model represents a significant move toward accessible, self-contained AI systems outside the dominant U.S. ecosystem.

Mar 23, 202699% relevant

Cursor Composer2 Launches on Fireworks AI Platform, Adds RL to Code Generation Stack

Cursor Composer2, the next iteration of Cursor's AI-powered code generation system, is now available via the Fireworks AI platform. This release introduces reinforcement learning (RL) components alongside standard inference, expanding the technical approach beyond the initial version.

Mar 20, 202685% relevant

Very Rubin Platform Launches: AI-Powered Code Generation and Debugging Tool

Very Rubin, a new AI platform for software development, has launched. It offers real-time code generation, debugging, and optimization through a browser-based interface.

Mar 16, 202685% relevant

WiseTech Cuts 2,000 Engineers, Citing AI Code Generation as Primary Driver

Logistics software giant WiseTech has laid off 2,000 engineers, stating AI now writes the code. This move highlights a strategic pivot where knowing what to build is becoming the core skill, not writing the code itself.

Apr 5, 202685% relevant

Meta's New AI Checklist Forces Models to Show Their Work, Revolutionizing Code Generation

Meta researchers have developed a mandatory checklist system that requires AI models to trace code execution line-by-line rather than making blind guesses. This breakthrough addresses fundamental reliability issues in AI-generated code by enforcing step-by-step reasoning.

Mar 4, 202685% relevant

ByteDance's CUDA Agent: The AI System Outperforming Human Experts in GPU Code Generation

ByteDance has unveiled CUDA Agent, a large-scale reinforcement learning system that generates high-performance CUDA kernels. The system achieves state-of-the-art results, outperforming torch.compile by up to 100% and beating leading AI models like Claude Opus 4.5 and Gemini 3 Pro by approximately 40% on the most challenging tasks.

Mar 2, 202695% relevant

DeepSeek-R1 Scores 79.8% on SWE-Bench Verified, Matching Claude 3.5 Sonnet in Code Generation

DeepSeek's new R1 reasoning model achieved 79.8% on SWE-Bench Verified, matching Claude 3.5 Sonnet's performance. This marks significant progress in AI's ability to solve real-world coding problems.

Mar 17, 202685% relevant

Sam Altman Envisions Codex Desktop Evolving into Unified AI Agent Controlling Computers

Sam Altman discussed the Codex Desktop ecosystem evolving toward a unified AI agent that can control computers, access user data, and work across multiple surfaces. This vision points toward AI systems moving beyond code generation to become proactive, cross-platform assistants.

Apr 2, 202689% relevant

OpenAI Expands Codex Plugin Ecosystem to Slack, Figma, Notion, and Gmail

OpenAI has rolled out new plugins connecting its Codex model to productivity tools like Slack, Figma, Notion, and Gmail, moving code generation beyond the IDE into broader workflows.

Mar 28, 202687% relevant

Stop Asking Claude Code to Write Code—Start Asking It to Read Your Codebase

Experienced users report the biggest productivity gain isn't code generation—it's using Claude as a superhuman codebase navigator that can trace data flows and debug across files instantly.

Mar 16, 2026100% relevant

Beyond Prompt Engineering: Claude Code Emerges as a Comprehensive AI Development Platform

Anthropic's Claude Code represents a paradigm shift from simple prompt tools to full AI engineering systems, offering integrated development environments, automated workflows, and sophisticated code generation capabilities that transform how developers build software.

Mar 11, 202685% relevant

Claude Octopus: GitHub Tool Enables Claude Code to Run Gemini and Codex Simultaneously

A developer discovered Claude Octopus, a GitHub repository that allows Anthropic's Claude Code to execute prompts across Google's Gemini and OpenAI's Codex models concurrently. The tool appears to enable parallel code generation from multiple AI assistants.

Mar 16, 202689% relevant

CLAUDE.md Promises 63% Reduction in Claude Output Tokens with Drop-in Prompt File

A new prompt engineering file called CLAUDE.md claims to reduce Claude's output token usage by 63% without code changes. The drop-in file aims to make Claude's code generation more efficient by structuring its responses.

Apr 1, 202687% relevant

CMU Research Identifies 'Biggest Unlock' for Coding Agents: Strategic Test Execution

New research from Carnegie Mellon University suggests the key advancement for AI coding agents lies not in raw code generation, but in developing strategies for how to run and interpret tests. This shifts focus from LLM capability to agentic reasoning.

Mar 31, 202687% relevant

Amazon's Reinforcement Fine-Tuning Revolution: How Nova Models Learn Through Feedback, Not Imitation

Amazon introduces reinforcement fine-tuning for its Nova AI models, shifting from imitation-based learning to evaluation-driven training. This approach enables enterprises to customize models using feedback signals rather than just examples, with applications from code generation to customer service.

Feb 26, 202675% relevant

Cisco Launches DevNet MCP Server: Bring API Docs Directly to Your AI Coding Assistant

Cisco's new DevNet Content Search MCP Server integrates Cisco's API documentation directly into Claude Code and other MCP-compatible IDEs, providing context-aware code generation without browser switching.

Mar 12, 202691% relevant

Developer Open-Sources 'Prompt-to-3D' Tool for Instant, Navigable World Generation

A developer has released an open-source tool that creates interactive 3D worlds from text or image inputs. This moves 3D asset generation from static models to instant, explorable environments.

Apr 3, 202691% relevant

ByteDance's Helios: A 14B Parameter Video Generation Model Running at 19.5 FPS on a Single H100 GPU

ByteDance has introduced Helios, a 14-billion parameter video generation model that reportedly runs at 19.5 frames per second on a single NVIDIA H100 GPU. This represents a significant step in making high-quality, real-time video synthesis more computationally accessible.

Mar 23, 202695% relevant

Learning to Disprove: LLMs Fine-Tuned for Formal Counterexample Generation in Lean 4

Researchers propose a method to train LLMs for formal counterexample generation, a neglected skill in mathematical AI. Their symbolic mutation strategy and multi-reward framework improve performance on three new benchmarks.

Mar 23, 202677% relevant

Annealed Co-Generation: A New AI Framework Tackles Scientific Complexity Through Pairwise Modeling

Researchers propose Annealed Co-Generation, a novel AI framework that simplifies multivariate generation in scientific applications by modeling variables in pairs rather than jointly. The approach reduces computational burden and data imbalance while maintaining coherence across complex systems.

Mar 10, 202675% relevant

Evo LLM Unifies Autoregressive and Diffusion AI, Achieving New Balance in Language Generation

Researchers introduce Evo, a novel large language model architecture that bridges autoregressive and diffusion-based text generation. By treating language creation as a continuous evolutionary flow, Evo adaptively balances confident refinement with exploratory planning, achieving state-of-the-art results across 15 benchmarks while maintaining fast inference speeds.

Mar 10, 202675% relevant

Google's Gemini 3.1 Flash Image: A New Contender in the AI Visual Generation Race

Google is reportedly developing Gemini 3.1 Flash Image, a specialized image generation model that could challenge Midjourney and DALL-E 3. This lightweight variant promises faster, more efficient visual creation while expanding Google's multimodal AI ecosystem.

Feb 25, 202685% relevant

PartRAG Revolutionizes 3D Generation with Retrieval-Augmented Part-Level Control

Researchers introduce PartRAG, a breakthrough framework that combines retrieval-augmented generation with diffusion transformers for precise part-level 3D creation and editing from single images. The system achieves superior geometric accuracy while enabling localized modifications without regenerating entire objects.

Feb 20, 202670% relevant

Anthropic's Claude Code Launches Autonomous Code Review, Pushing AI Beyond Simple Generation

Anthropic has launched Code Review in Claude Code, a multi-agent system that automatically analyzes AI-generated code for logic errors and security vulnerabilities. This represents a shift from AI as a coding assistant to an autonomous reviewer capable of complex, multi-step reasoning.

Mar 9, 202684% relevant

Renoise AI Tool Enables Programmatic Video Generation, Promising Faster Production

Renoise has launched an AI tool that generates videos through code rather than traditional editing. The platform claims to produce high-quality videos more easily and faster than previous methods.

Mar 23, 202685% relevant

Survey Benchmarks Four Approaches to Synthetic Brain Signal Generation for BCI Data Scarcity

A comprehensive survey categorizes and benchmarks four methodological approaches to generating synthetic brain signals for BCIs, addressing data scarcity and privacy constraints. The authors provide an open-source codebase for comparing knowledge-based, feature-based, model-based, and translation-based generative algorithms.

Mar 16, 202684% relevant

Wikigen: Automate GitHub Wiki Generation with a Single CLI Command

Wikigen is a Go CLI that uses Claude Code to analyze your repo and generate comprehensive GitHub Wiki documentation automatically.

Mar 15, 2026100% relevant

Developer Declares 'Closed SaaS Feels Like a Generation Ago' as AI-Powered Open Source Tools Surpass Paid Subscriptions

Developer George Pu announced he's canceling multiple SaaS subscriptions, citing that AI-enhanced, production-ready open-source alternatives from GitHub repositories now outperform the paid tools he used a year ago.

Mar 31, 202687% relevant

Building a Next-Generation Recommendation System with AI Agents, RAG, and Machine Learning

A technical guide outlines a hybrid architecture for recommendation systems that combines AI agents for reasoning, RAG for context, and traditional ML for prediction. This represents an evolution beyond basic collaborative filtering toward systems that understand user intent and context.

Mar 25, 2026100% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety