object detection

30 articles about object detection in AI news

YOLO26 Eliminates NMS Bottleneck, Revolutionizing Real-Time Object Detection

YOLO26 introduces a groundbreaking single-pass architecture that eliminates the need for Non-Maximum Suppression, dramatically accelerating inference speeds while maintaining high detection accuracy for up to 300 objects per image.

85% relevant

Roboflow's RF-DETR Model Ported to Apple MLX, Enabling Real-Time On-Device Instance Segmentation

Roboflow's RF-DETR object detection model is now available on Apple's MLX framework, enabling real-time instance segmentation on Apple Silicon devices. This port unlocks new on-device visual analysis applications for robotics and augmented vision-language models.

89% relevant

VGGT-Det: How AI Is Learning to See in 3D Without Camera Calibration

Researchers have developed VGGT-Det, a breakthrough framework for multi-view 3D object detection that works without calibrated camera poses. The system mines internal geometric priors through attention mechanisms, outperforming traditional methods in indoor environments.

85% relevant

mmAnomaly: New Multi-Modal Framework Uses Conditional Latent Diffusion to Achieve 94% F1 Score for mmWave Anomaly Detection

Researchers introduced mmAnomaly, a multi-modal anomaly detection system that uses a conditional latent diffusion model to synthesize expected mmWave spectra from visual context, achieving up to a 94% F1 score for detecting concealed weapons and through-wall anomalies.

72% relevant

New AI Framework Uses Diffusion Models to Authenticate Anti-Counterfeit Codes

Researchers propose a novel diffusion-based AI system to authenticate Copy Detection Patterns (CDPs), a key anti-counterfeiting technology. It outperforms existing methods by classifying printer signatures, showing resilience against unseen counterfeits.

89% relevant

RF-DETR: A Real-Time Transformer Architecture That Surpasses 60 mAP on COCO

RF-DETR is a new lightweight detection transformer using neural architecture search and internet-scale pre-training. It's the first real-time detector to exceed 60 mAP on COCO, addressing generalization issues in current models.

85% relevant

GPT-5.2 Pro Emerges as Powerful Fact-Checking Assistant, Transforming Verification Workflows

OpenAI's GPT-5.2 Pro demonstrates remarkable fact-checking capabilities, automatically identifying objections, caveats, and mathematical errors in written content. This represents a significant advancement in AI-assisted verification previously limited to specialized domains.

85% relevant

Beyond the Hype: The New Open Benchmark Putting Every AI Code Review Tool to the Test

A new open benchmarking platform allows developers to test their custom AI code review bots against eight leading commercial tools using real-world data. This transparent approach moves beyond marketing claims to provide objective performance comparisons.

85% relevant

Humanoid Robot Deployed for Traffic Control in Shenzhen, China

A humanoid robot equipped with cameras and AI has been deployed to direct traffic at a busy intersection in Shenzhen, China. This represents a real-world test of embodied AI for public infrastructure management.

85% relevant

Simon Willison's 'scan-for-secrets' CLI Tool Detects API Keys in Logs

Simon Willison built 'scan-for-secrets', a Python CLI tool for scanning log files for accidentally exposed API keys. It's a lightweight utility for developers to sanitize data before sharing.

75% relevant

Bones Studio Demos Motion-Capture-to-Robot Pipeline for Home Tasks

Bones Studio released a demo showing its 'Captured → Labeled → Transferred' pipeline. It uses optical motion capture to record human tasks, then transfers the data for a humanoid robot to replicate the actions in simulation.

85% relevant

Hasan Toor Announces 'First AI Sales Tool That Does the Whole Job' in Cryptic Tweet

AI influencer Hasan Toor posted a tweet claiming a new AI sales tool is the first to handle the entire sales job, not just data or enrichment. No product name, company, or technical specifications were provided.

89% relevant

Computer Vision Is Transforming Retail Loss Prevention

The article discusses the growing adoption of computer vision systems in retail to prevent theft, manage inventory, and enhance store security. This represents a direct application of AI to a long-standing, costly industry problem.

100% relevant

AI Model Analyzes Blood Proteins to Diagnose Alzheimer's, Parkinson's, ALS, and Stroke with 17,187-Patient Study

An AI model can diagnose Alzheimer's, Parkinson's, ALS, frontotemporal dementia, and stroke from a single blood sample by analyzing protein profiles. It outperformed symptom-based diagnosis at predicting future cognitive decline in a Nature-published study of 17,187 people.

97% relevant

Axios NPM Package Under Active Supply Chain Attack, Potentially Impacts 100M+ Weekly Installs

The widely-used JavaScript HTTP client library Axios may be compromised via a malicious dependency in its latest release, exhibiting malware-like behavior including shell execution and artifact cleanup. With over 100 million weekly downloads, this represents a critical software supply chain threat.

99% relevant

Atlanta Startup Deploys AI-Powered Robot Dogs for Nighttime Neighborhood Security

A U.S. startup based in Atlanta is deploying quadrupedal robots for autonomous nighttime neighborhood patrols. The units are designed to detect intruders and alert residents, representing a commercial pivot for legged robotics.

85% relevant

Stop Reviewing AI Code. Start Reviewing CLAUDE.md.

Anthropic's research shows the bottleneck is verification, not generation. Shift your Claude Code workflow from writing prompts to writing precise, testable specifications.

70% relevant

GUIDE: A New Benchmark Reveals AI's Struggle to Understand User Intent in GUI Software

Researchers introduce GUIDE, a benchmark for evaluating AI's ability to understand user behavior and intent in open-ended GUI tasks. Across 10 software applications, state-of-the-art models struggled, highlighting a critical gap between automation and true collaborative assistance.

74% relevant

How to Build a 3D Engine with Claude Code: The Demoscene Case Study

A developer used Claude Code to build a complete 3D engine from scratch. Here are the actionable prompting techniques and CLAUDE.md strategies that made it work.

90% relevant

Study of 280,000 Samples Shows AI Detectors Fail on Short Coursework and STEM Writing, Flagging Real Student Work

A comprehensive study testing 13 AI detectors on 280,000+ samples found they perform unreliably, especially on short assignments and STEM writing, where real student work is often flagged as AI-generated due to formulaic language.

87% relevant

AgenticGEO: Self-Evolving AI Framework for Generative Search Engine Optimization Outperforms 14 Baselines

Researchers propose AgenticGEO, an AI framework that evolves content strategies to maximize inclusion in generative search engine outputs. It uses MAP-Elites and a Co-Evolving Critic to reduce costly API calls, achieving state-of-the-art performance across 3 datasets.

91% relevant

AgentComm-Bench Exposes Catastrophic Failure Modes in Cooperative Embodied AI Under Real-World Network Conditions

Researchers introduce AgentComm-Bench, a benchmark that stress-tests multi-agent embodied AI systems under six real-world network impairments. It reveals performance drops of over 96% in navigation and 85% in perception F1, highlighting a critical gap between lab evaluations and deployable systems.

100% relevant

OpenClaw AI Agent Adds Real-Time Vision to Meta Ray-Ban Smart Glasses via Gemini Live API

An open-source project enables Meta Ray-Ban smart glasses to function as a real-time AI assistant. It streams the glasses' camera feed (~1fps) to Gemini Live for visual context, then delegates actions via the OpenClaw agent framework.

85% relevant

Deloitte on Driving Adoption of the 'Human with Agentic AI' Era

Deloitte outlines the shift to a 'human with agentic AI' paradigm, where autonomous AI agents act as proactive partners. This requires new organizational strategies to integrate agents that can preserve institutional knowledge and interface with legacy systems.

79% relevant

ORCA Dexterity Open-Sources Three 3D-Printable Robotic Hands with Self-Dislocating Joints for ~$2,200

ORCA Dexterity released STL files for three tendon-driven anthropomorphic robotic hands featuring self-dislocating joints for reliability. The OrcaHand Touch variant includes high-resolution fingertip sensors with 83 taxels per fingertip at 1mm resolution.

97% relevant

Efficient Fine-Tuning of Vision-Language Models with LoRA & Quantization

A technical guide details methods for fine-tuning large VLMs like GPT-4V and LLaVA using Low-Rank Adaptation (LoRA) and quantization. This reduces computational cost and memory footprint, making custom VLM training more accessible.

80% relevant

Financial AI Audit Test Reveals LLMs Struggle with Complex Rule-Based Reasoning

Researchers introduce FinRule-Bench, a new benchmark testing how well large language models can audit financial statements against accounting principles. The benchmark reveals models perform well on simple rule verification but struggle with complex multi-violation diagnosis.

79% relevant

AI Breakthrough: Single Model Masters Multiple Code Analysis Tasks with Minimal Training

Researchers demonstrate that parameter-efficient fine-tuning enables large language models to perform diverse code analysis tasks simultaneously, matching full fine-tuning performance while reducing computational costs by up to 85%.

83% relevant

The AI Trap: How Professors Are Fighting Back Against Student Over-Reliance on Language Models

University professors are deploying 'trap words' in digital assignments to catch students who blindly use AI for complex cognitive tasks. While science departments embrace these tools, literature professors report a collapse in students' ability to synthesize information independently.

85% relevant

Vision AI Trends 2026: Manufacturing, Warehouse Automation, and Luxury Authentication Enter Visual Data Era

A 2026 trends report highlights Vision AI's expansion into manufacturing quality inspection, warehouse automation, and luxury brand authentication, marking a shift toward 3D visual data systems. This reflects the maturation of computer vision beyond basic recognition into operational and trust applications.

95% relevant