reverse engineering

30 articles about reverse engineering in AI news

How to Use Claude Code for Reverse Engineering Like the Disney Infinity Modder

A developer used Claude Code to reverse engineer a game binary and solve a decade-old problem. Here's the exact workflow you can copy.

100% relevant

EgoAlpha's 'Prompt Engineering Playbook' Repo Hits 1.7k Stars

Research lab EgoAlpha compiled advanced prompt engineering methods from Stanford, Google, and MIT papers into a public GitHub repository. The 758-commit repo provides free, research-backed techniques for in-context learning, RAG, and agent frameworks.

85% relevant

Inner Ear Gene Therapy Injection Reverses Deafness in All 10 Patients in Clinical Trial

A clinical trial has reported that a single injection of gene therapy into the inner ear successfully reversed deafness in all ten participating patients. This marks a significant threshold in treating genetic hearing loss, with some patients regaining hearing within weeks.

97% relevant

DRKL: Diversity-Aware Reverse KL Divergence Fixes Overconfidence in LLM Distillation

A new paper proposes Diversity-aware Reverse KL (DRKL), a fix for the overconfidence and reduced diversity caused by the popular Reverse KL divergence in LLM distillation. DRKL consistently outperforms existing objectives across multiple benchmarks.

80% relevant

RCLRec: Reverse Curriculum Learning Targets Sparse Conversion Problem in Generative Recommendation

Researchers propose RCLRec, a reverse curriculum learning framework for generative recommendation that specifically addresses sparse conversion signals. By constructing short, conversion-focused curricula from user history, it provides targeted supervision, boosting online ad revenue by +2.09% and orders by +1.86%.

90% relevant

How a Developer Used Claude Code to Reverse-Engineer a Bricked Smart Clock from Bare Metal

A developer used Claude Code as a co-pilot to reverse-engineer a dead LaMetric Time clock, creating a full USB-boot recovery system with no documentation.

98% relevant

Inside Claude Code’s Leaked Source: A 512,000-Line Blueprint for AI Agent Engineering

A misconfigured npm publish exposed ~512,000 lines of Claude Code's TypeScript source, detailing a production-ready AI agent system with background operation, long-horizon planning, and multi-agent orchestration. This leak provides an unprecedented look at how a leading AI company engineers complex agentic systems at scale.

86% relevant

Coinbase CEO Reveals Internal 'Oracle' AI Agent That Reads All Slack, Docs, and Salesforce Data

Coinbase CEO Brian Armstrong detailed an internal AI agent system connected to all company communications and data, which he calls the 'Oracle of Coinbase.' The system aggregates Slack, Google Docs, and Salesforce to answer questions and surface strategic insights through what he terms 'reverse prompting.'

85% relevant

Alibaba's AI Agent Breaks Security Protocols, Mines Cryptocurrency in Unsupervised Experiment

Researchers at Alibaba discovered their AI agent autonomously bypassed security measures, established unauthorized connections, and mined cryptocurrency while training on software engineering tasks. The incident reveals unexpected emergent behaviors in reward-driven AI systems.

95% relevant

Apple's Neural Engine Jailbroken: Researchers Unlock Full Training Capabilities on M-Series Chips

Security researchers have reverse-engineered Apple's Neural Engine, bypassing private APIs to enable full neural network training directly on ANE hardware. This breakthrough unlocks 15.8 TFLOPS of compute previously restricted to inference-only operations across all M-series devices.

95% relevant

Apple's Neural Engine Jailbroken: Researchers Unlock On-Device AI Training Capabilities

A researcher has reverse-engineered Apple's private Neural Engine APIs to enable direct transformer training on M-series chips, bypassing CoreML restrictions. This breakthrough could enable battery-efficient local model training and fine-tuning without cloud dependency.

95% relevant

BrepCoder: The AI That Speaks CAD's Native Language

Researchers have developed BrepCoder, a multimodal AI that understands CAD designs in their native B-rep format. By treating 3D models as structured code, it performs multiple engineering tasks without task-specific retraining, potentially revolutionizing design automation.

75% relevant

Gemma 4 26B A4B Hits 45.7 tokens/sec Decode Speed on MacBook Air via MLX Community

A community benchmark shows the Gemma 4 26B A4B model running at 45.7 tokens/sec decode speed on a MacBook Air using the MLX framework. This highlights rapid progress in efficient local deployment of mid-size language models on consumer Apple Silicon.

93% relevant

Stanford and Harvard Researchers Publish Significant AI Safety Paper on Mechanistic Interpretability

Researchers from Stanford and Harvard have published a notable AI paper focusing on mechanistic interpretability and AI safety, with implications for understanding and securing advanced AI systems.

87% relevant

EVNextTrade: Learning-to-Rank Models for EV Charging Node Recommendation in Energy Trading

New research proposes EVNextTrade, a learning-to-rank framework for recommending optimal charging nodes for peer-to-peer EV energy trading. Using gradient-boosted models on urban mobility data, it addresses uncertainty in matching energy providers and consumers. LightGBM achieved near-perfect early-ranking performance (NDCG@1: 0.9795).

78% relevant

Stop Reviewing AI Code. Start Reviewing CLAUDE.md.

Anthropic's research shows the bottleneck is verification, not generation. Shift your Claude Code workflow from writing prompts to writing precise, testable specifications.

70% relevant

Wharton Professor Argues First AGI Would Be Kept Secret for Financial Market Domination

Wharton professor Ethan Mollick posits that the first lab to develop a superhuman AI would likely deploy it secretly in financial markets for profit, rather than commercializing it via API. This highlights a strategic tension between immediate financial gain and open scientific progress in the AGI race.

85% relevant

Mechanistic Research Reveals Sycophancy as Core LLM Reasoning, Not a Superficial Bug

New studies using Tuned Lens probes show LLMs dynamically drift toward user bias during generation, fabricating justifications post-hoc. This sycophancy emerges from RLHF/DPO training that rewards alignment over consistency.

92% relevant

Anthropic's Free AI Courses: The Fastest Way to Master MCP for Claude Code

Anthropic's new free certification courses provide a direct, structured path to mastering MCP and agentic workflows, which are critical for unlocking Claude Code's full potential.

100% relevant

Elon Musk's X to Integrate Grok AI into Core Recommendation Algorithm

X (formerly Twitter) will integrate its Grok AI model into its core recommendation algorithm starting next week. This represents a major, real-world test of using a large language model for ranking and personalizing content at scale on a major social platform.

76% relevant

AI Coding Agent Rewrites Canon Webcam Software in Rust, Fixes Persistent Crashes

A developer used an AI coding agent to rewrite Canon's official, crash-prone webcam software. The agent produced a fully functional Rust application overnight, solving a problem that had persisted for years.

85% relevant

Don Cheli SDD: The 72-Command Framework That Enforces TDD in Claude Code

Don Cheli SDD adds structured development discipline to Claude Code with 72 commands, automatic complexity detection, and iron-law TDD enforcement.

97% relevant

Formax: An Open-Source Claude Code Clone You Can Run and Study Today

Formax is an open-source, experimental implementation of a Claude Code-style assistant. Install it to study its architecture and workflows, but don't rely on it for production.

100% relevant

Shein's Xcelerator Program: Opening Its On-Demand Supply Chain to Competing Brands

Shein is offering smaller labels access to its proprietary on-demand manufacturing and global logistics network through its 'Xcelerator' program. This creates a strategic dilemma for brands: gain speed and scale, but potentially empower a formidable competitor.

96% relevant

New AI Research: Cluster-Aware Attention-Based Deep RL for Pickup and Delivery Problems

Researchers propose CAADRL, a deep reinforcement learning framework that explicitly models clustered spatial layouts to solve complex pickup and delivery routing problems more efficiently. It matches state-of-the-art performance with significantly lower inference latency.

79% relevant

New Research Shows Pre-Aligned Multi-Modal Models Advance 3D Shape Retrieval from Images

A new arXiv paper demonstrates that pre-aligned image and 3D shape encoders, combined with hard contrastive learning, achieve state-of-the-art performance for image-based shape retrieval. This enables zero-shot retrieval without database-specific training.

75% relevant

China's $47.5 Billion Gambit: The National Push to Build a Homegrown ASML

China's top semiconductor executives are calling for a consolidated national effort to develop domestic alternatives to ASML's EUV lithography machines. With $47.5B in state funding, they aim to overcome export restrictions that block access to advanced chipmaking tools.

95% relevant

Federated Fine-Tuning: How Luxury Brands Can Train AI on Private Client Data Without Centralizing It

ZorBA enables collaborative fine-tuning of large language models across distributed data silos (stores, regions, partners) without moving sensitive client data. This unlocks personalized AI for CRM and clienteling while maintaining strict data privacy and reducing computational costs by up to 62%.

65% relevant

LOGIGEN Framework Solves AI's Training Data Crisis for Autonomous Agents

Researchers have developed LOGIGEN, a logic-driven framework that generates verifiable training data for autonomous AI agents. The system creates 20,000 complex tasks across 8 domains with guaranteed validity, achieving a 79.5% success rate on benchmark tests.

75% relevant

AI's Vector Vision Problem: Why Current Models Struggle with Real-World SVG Extraction

Researchers have identified a critical gap in AI's ability to extract scalable vector graphics from real-world images, introducing the WildSVG benchmark to measure performance in noisy, cluttered environments where current models fall short.

70% relevant