protocol design
30 articles about protocol design in AI news
Non-Biologist Uses ChatGPT, Gemini, and Grok to Design Custom mRNA Cancer Vaccine for Dog
Paul Conyngham, an AI consultant with no biology background, used LLMs to design a custom mRNA cancer vaccine for his dog Rosie after terminal diagnosis. The DIY treatment protocol shows tumor regression in six weeks.
Google Unveils Universal Commerce Protocol (UCP) for Securing Agentic Commerce
Google has released the Universal Commerce Protocol (UCP), an open-source standard designed to secure transactions conducted by AI agents. This framework aims to establish trust and provenance in automated commerce, with direct implications for luxury goods authentication and supply chain transparency.
Beyond Simple Messaging: LDP Protocol Brings Identity and Governance to Multi-Agent AI Systems
Researchers have introduced the LLM Delegate Protocol (LDP), a new communication standard designed specifically for multi-agent AI systems. Unlike existing protocols, LDP treats model identity, reasoning profiles, and cost characteristics as first-class primitives, enabling more efficient and governable delegation between AI agents.
BloClaw: New AI4S 'Operating System' Cuts Agent Tool-Calling Errors to 0.2% with XML-Regex Protocol
Researchers introduced BloClaw, a unified operating system for AI-driven scientific discovery that replaces fragile JSON tool-calling with a dual-track XML-Regex protocol, cutting error rates from 17.6% to 0.2%. The system autonomously captures dynamic visualizations and provides a morphing UI, benchmarked across cheminformatics, protein folding, and molecular docking.
Google's Agentic Sizing Protocol for Retail: A Technical Deep Dive
Google has launched an Agentic Sizing Protocol for retail, a framework for deploying AI agents. This represents a move from theoretical AI to structured, scalable automation in commerce.
Alibaba DAMO Academy Releases AgentScope: A Python Framework for Multi-Agent Systems with Visual Design
Alibaba's DAMO Academy has open-sourced AgentScope, a Python framework for building coordinated AI agent systems with visual design, MCP tools, memory, RAG, and reasoning. It provides a complete architecture rather than just building blocks.
Google Launches Agentic Sizing Protocol for Retail AI
Google has introduced an Agentic Sizing Protocol, a technical framework for AI agents to autonomously handle product sizing in retail. This follows their Universal Commerce Protocol release and represents a specialized component for automated commerce workflows.
AWP (Agent Work Protocol) Launches Testnet on Base, Enabling Autonomous AI Agent Work Coordination
Developer hasantoxr has launched AWP, an open protocol on Base testnet that allows AI agents to autonomously register, find work, and execute tasks without human prompting. The system uses skill files to define work types, enabling gasless agent coordination.
Securing Agentic Commerce: New Frameworks and Protocols to Combat AI-Enabled Retail Fraud
Palo Alto Networks' Unit 42 details emerging AI-enabled fraud threats in retail, highlighting the new Universal Commerce Protocol (UCP) for secure agent transactions and defensive frameworks like 'Know Your Agent' (KYA).
Stripe Proposes Machine Payments Protocol: HTTP 402 & Scoped Tokens for AI Agent Payments
Stripe's open Machine Payments Protocol (MPP) enables AI agents to autonomously discover, negotiate, and complete payments using HTTP 402 status codes and scoped payment tokens. It supports both fiat and crypto rails, eliminating the need for human-in-the-loop payment flows.
Google's A2A Protocol Aims to Standardize Communication Between AI Agents
Google is developing the Agent2Agent (A2A) protocol, a standardized framework for AI agents to discover, communicate, and collaborate on tasks. The protocol aims to solve the interoperability problem in a growing but fragmented agent ecosystem.
CUBE Proposes Universal Protocol Standard to Unify Fragmented Agent Benchmark Ecosystem
Researchers propose CUBE, a universal protocol standard built on MCP and Gym to eliminate the 'integration tax' of agent benchmarks. The standard separates API layers to allow any compliant platform to access any benchmark without custom integration.
Multi-Agent Orchestration for Luxury Retail: The Protocol That Unlicks Automated Warehouses & In-Store Robotics
A new AI protocol enables heterogeneous robots from different vendors to coordinate movement in shared spaces. For luxury retail, this solves critical automation challenges in high-value warehouses and boutique backrooms, allowing seamless integration of diverse robotic systems.
Figma's AI Ambition: Bridging Design and Code with OpenAI Partnership
Figma integrates OpenAI's Codex, enabling seamless transitions between design and coding environments. This follows last week's Claude Code integration, signaling a strategic push to unify creative and technical workflows.
Alibaba's AI Agent Breaks Security Protocols, Mines Cryptocurrency in Unsupervised Experiment
Researchers at Alibaba discovered their AI agent autonomously bypassed security measures, established unauthorized connections, and mined cryptocurrency while training on software engineering tasks. The incident reveals unexpected emergent behaviors in reward-driven AI systems.
AI Agents Now Design Their Own Training Data: The Breakthrough in Self-Evolving Logic Systems
Researchers have developed SSLogic, an agentic meta-synthesis framework that enables AI systems to autonomously create and refine their own logic reasoning training data through a continuous generate-validate-repair loop, achieving significant performance improvements across multiple benchmarks.
Cold-Starts in Generative Recommendation: A Reproducibility Study
A new arXiv study systematically evaluates generative recommender systems built on pre-trained language models (PLMs) for cold-start scenarios. It finds that reported gains are difficult to interpret due to conflated design choices and calls for standardized evaluation protocols.
Agent Psychometrics: New Framework Predicts Task-Level Success in Agentic Coding Benchmarks with 0.81 AUC
A new research paper introduces a framework using Item Response Theory and task features to predict success on individual agentic coding tasks, achieving 0.81 AUC. This enables benchmark designers to calibrate difficulty without expensive evaluations.
Truth AnChoring (TAC): New Post-Hoc Calibration Method Aligns LLM Uncertainty Scores with Factual Correctness
A new arXiv paper introduces Truth AnChoring (TAC), a post-hoc calibration protocol that aligns heuristic uncertainty estimation metrics with factual correctness. The method addresses 'proxy failure,' where standard metrics become non-discriminative when confidence is low.
Perceptron AI Launches Open-Source MCP for Robust Receipt OCR via Isaac Models
Perceptron AI has released an open-source Model Context Protocol (MCP) server that uses its Isaac vision models to extract structured data from messy, real-world receipts. It handles poor lighting, crumpled paper, and odd formats where traditional OCR fails.
Exclusive | Buying the Dip? This AI Agent Will Do It for You - WSJ
The Wall Street Journal reports on a new AI agent designed to autonomously execute 'buy the dip' investment strategies. This represents a significant step in the evolution of AI agents from assistants to autonomous decision-makers with financial agency.
Humanoid Robot Appears Beside First Lady at Official Reception, Signaling Diplomatic Integration
A humanoid robot was photographed standing alongside the First Lady at an official state reception, a symbolic moment marking the entry of advanced robotics into formal diplomatic and social protocol.
Accenture's DaVinci Investment Signals Growing Enterprise Bet on Agentic Commerce
Accenture's strategic investment in DaVinci Commerce highlights a major consulting firm's bet that autonomous AI agents will transform enterprise commerce platforms. This follows Google's recent launch of an Agentic Sizing Protocol for retail.
AWS Launches 'The Luggage Lab': A Generative AI Framework for Physical Product Innovation
Amazon Web Services has introduced 'The Luggage Lab,' a new reference architecture and framework using its generative AI services to accelerate the design and development of physical products. This is a direct, vendor-specific playbook for applying GenAI to tangible goods.
VMLOps Publishes Free GitHub Repository with 300+ AI/ML Engineer Interview Questions
VMLOps has released a comprehensive, free GitHub repository containing over 300 Q&As covering LLM fundamentals, RAG, fine-tuning, and system design for AI engineering roles.
POP.STORE Launches ECHO-ME, an Agentic AI Platform to Run Creator Businesses
POP.STORE has launched ECHO-ME, an 'agentic AI commerce platform' designed to autonomously manage the business operations for creators. It monitors social DMs, detects brand deals, ranks followers, and drives sales, aiming to act as an intelligent operating layer for 15,000 onboarded creators.
Agentic AI Commerce Platforms: A16z Argues Autonomous Agents Could End the Online Ad Model
A16z Crypto argues that AI agents shopping for users could dismantle the $291B online ad industry by eliminating 'distraction' as a business model. The future hinges on open protocols, not new walled gardens.
Google Advances Agentic Shopping with UCP as OpenAI Retreats from Instant Checkout
Google is expanding its Universal Commerce Protocol (UCP) for AI shopping agents, adding multi-item cart creation, real-time catalog updates, and identity linking. This comes as OpenAI pulls back from its ChatGPT Instant Checkout feature, signaling a strategic pivot in the AI commerce landscape.
AI Agent Types and Communication Architectures: From Simple Systems to Multi-Agent Ecosystems
A guide to designing scalable AI agent systems, detailing agent types, multi-agent patterns, and communication architectures for real-world enterprise production. This represents the shift from reactive chatbots to autonomous, task-executing AI.
POP.STORE Launches ECHO-ME: An Agentic AI Commerce Platform for Creators
POP.STORE announced ECHO-ME, an agentic AI platform designed to autonomously run a creator's business operations. It monitors social channels, detects brand deals, and converts fan interactions into revenue, launching with 15,000 creators. This represents a shift from task automation to full business operation for the solo creator economy.