medical imaging
30 articles about medical imaging in AI news
MAIL Network: A Breakthrough in Efficient and Robust Multimodal Medical AI
Researchers have developed MAIL and Robust-MAIL networks that overcome key limitations in multimodal medical imaging analysis, achieving up to 9.34% performance gains while reducing computational costs by 78.3% and enhancing adversarial robustness.
GPT-5 Shows Promise as Clinical Assistant but Can't Replace Specialized Medical AI
New research evaluates GPT-5's clinical reasoning capabilities, finding significant improvements over GPT-4o in medical text analysis but limitations in specialized imaging tasks. The study reveals generalist AI models are advancing toward integrated clinical reasoning but still trail domain-specific systems in critical diagnostic areas.
CoRe Framework Integrates Equivariant Contrastive Learning for Medical Image Registration, Surpassing Baseline Methods
Researchers propose CoRe, a medical image registration framework that jointly optimizes an equivariant contrastive learning objective with the registration task. The method learns deformation-invariant feature representations, improving performance on abdominal and thoracic registration tasks.
ReXInTheWild Benchmark Reveals VLMs Struggle with Medical Photos: Gemini-3 Leads at 78%, MedGemma Trails at 37%
Researchers introduced ReXInTheWild, a benchmark of 955 clinician-verified questions based on 484 real medical photographs. Leading multimodal models show wide performance gaps, with Gemini-3 scoring 78% accuracy while the specialized MedGemma model achieved only 37%.
Health AI Benchmarks Show 'Validity Gap': 0.6% of Queries Use Raw Medical Records, 5.5% Cover Chronic Care
Analysis of 18,707 health queries across six public benchmarks reveals a structural misalignment with clinical reality. Benchmarks over-index on wellness data (17.7%) while under-representing lab values (5.2%), imaging (3.8%), and safety-critical scenarios.
Microsoft Releases GigaTIME: AI Model Generates Protein Maps from Standard Medical Images
Microsoft has released GigaTIME, an AI model that generates detailed spatial protein maps from standard, low-cost medical images like H&E stains. This could significantly reduce the cost and time of cancer tissue analysis.
Engineer Uses ChatGPT and Google to Self-Diagnose Rare Spinal Condition After 17-Month Medical Odyssey
A software engineer with no medical training used ChatGPT-4o and Google to correctly diagnose his own rare spinal CSF leak after 17 months of failed specialist consultations. The case highlights AI's emerging role as a diagnostic aid in complex medical scenarios.
Musk Predicts Humanoid Robots Will Democratize Elite Medical Care Worldwide
Elon Musk claims humanoid robots with advanced dexterity will soon deliver medical care superior to today's best hospitals to every person on Earth, outperforming current human surgical standards.
The Hidden Achilles' Heel of AI Imaging: How Tiny Mismatches Cripple Compressive Vision Systems
New research reveals that state-of-the-art AI for compressive imaging catastrophically fails when its mathematical assumptions about hardware don't match reality. The InverseNet benchmark shows performance drops of 10-21 dB, eliminating AI's advantage over classical methods in real-world deployment.
MedFeat: How AI is Revolutionizing Medical Feature Engineering with Model-Aware Intelligence
Researchers have developed MedFeat, an innovative framework that combines large language models with clinical expertise to create smarter features for medical predictions. Unlike traditional approaches, MedFeat incorporates model awareness and explainability to generate features that improve accuracy and generalization across healthcare settings.
How AI Overfitting Masks Medical Breakthroughs: fMRI Study Reveals Critical Flaw in Parkinson's Detection
New research reveals that standard AI evaluation methods for detecting early Parkinson's disease from brain scans suffer from severe data leakage, creating misleading near-perfect results. When properly tested, lightweight models outperform complex ones in data-scarce medical applications.
Medical AI Breakthrough: New Method Teaches Vision-Language Models to Understand Clinical Negation
Researchers have developed a novel fine-tuning technique that significantly improves how medical vision-language models understand negation in clinical reports. The method uses causal tracing to identify which neural network layers are most responsible for processing negative statements, then selectively trains those layers.
Microsoft & CUHK Debut 'Medical AI Scientist' Agent That Generates Ideas, Runs Experiments, and Writes Papers
Microsoft Research and CUHK have developed an autonomous AI agent that can formulate research ideas, execute experiments, and author papers, achieving near-MICCAI quality on 171 clinical cases across 19 tasks.
Claude AI Diagnoses Positional Headache in Complex Medical Case After Specialists Failed
A 62-year-old patient with multiple chronic conditions and positional migraines received a correct diagnosis and treatment plan from Claude AI after years of unsuccessful specialist visits. The $317 CPAP machine it recommended solved the previously unexplained condition.
Beyond the Hype: New Benchmark Reveals When AI Truly Benefits from Combining Medical Data
A comprehensive new study systematically benchmarks multimodal AI fusion of Electronic Health Records and chest X-rays, revealing precisely when combining data types improves clinical predictions and when it fails. The research provides crucial guidance for developing effective and reliable AI systems for healthcare deployment.
Neko Health Launches $400 AI-Powered Full-Body Health Scans in New York This Spring
Neko Health, the $1.8B startup founded by Spotify's Daniel Ek, is launching its AI-driven full-body health screening service in the US. The $400 scan uses imaging and blood tests to screen for cancer, heart disease, and diabetes risk, though medical experts are divided on its efficacy.
SteerViT Enables Natural Language Control of Vision Transformer Attention Maps
Researchers introduced SteerViT, a method that modifies Vision Transformers to accept natural language instructions, enabling users to steer the model's visual attention toward specific objects or concepts while maintaining representation quality.
AI Model Analyzes Blood Proteins to Diagnose Alzheimer's, Parkinson's, ALS, and Stroke with 17,187-Patient Study
An AI model can diagnose Alzheimer's, Parkinson's, ALS, frontotemporal dementia, and stroke from a single blood sample by analyzing protein profiles. It outperformed symptom-based diagnosis at predicting future cognitive decline in a Nature-published study of 17,187 people.
AI's 'Hollowing Out' Effect: How Automation Targets High-Value, High-Skill Tasks First
A viral commentary by George Pu posits that AI's primary impact isn't mass job elimination but the systematic automation of a role's most valuable, specialized, and well-compensated tasks, leaving workers with diminished, less critical duties.
Multimodal RAG System for Chest X-Ray Reports Achieves 0.95 Recall@5, Reduces Hallucinations with Citation Constraints
Researchers developed a multimodal retrieval-augmented generation system for drafting radiology impressions that fuses image and text embeddings. The system achieves Recall@5 above 0.95 on clinically relevant findings and enforces citation coverage to prevent hallucinations.
Gastric-X: New 1.7K-Case Multimodal Benchmark Challenges VLMs on Realistic Gastric Cancer Diagnosis Workflow
Researchers introduce Gastric-X, a comprehensive multimodal benchmark with 1.7K gastric cancer cases including CT scans, endoscopy, lab data, and expert notes. It evaluates VLMs on five clinical tasks to test if they can correlate biochemical signals with tumor features like physicians do.
FedAgain: Dual-Trust Federated Learning Boosts Kidney Stone ID Accuracy to 94.7% on MyStone Dataset
Researchers propose FedAgain, a trust-based federated learning framework that dynamically weights client contributions using benchmark reliability and model divergence. It achieves 94.7% accuracy on kidney stone identification while maintaining robustness against corrupted data from multiple hospitals.
Microsoft's AI Converts Standard Pathology Slides to Spatial Proteomics Maps, Cutting Costs and Time
Microsoft researchers developed an AI method to generate spatial proteomics data from routine H&E-stained pathology slides. This bypasses expensive, specialized equipment, potentially accelerating cancer analysis and expanding access.
Microsoft's GigaTIME AI Predicts Protein Maps from $5 Tissue Slides, Revealing 1,234 New Survival Correlations
Microsoft released GigaTIME, an AI model that predicts expensive protein maps from cheap tissue slides. Trained on 40M cells from 14,256 patients, it discovered 1,234 new protein-survival connections.
NVIDIA Releases Brain MRI Generation Model on Hugging Face: 3D Latent Diffusion for T1, FLAIR, T2, and SWI Scans
NVIDIA has open-sourced a 3D latent diffusion model for generating high-resolution brain MRI scans across four modalities. The model claims state-of-the-art FID scores and 33× faster inference than prior methods.
Beyond Simple Recognition: How DeepIntuit Teaches AI to 'Reason' About Videos
Researchers have developed DeepIntuit, a new AI framework that moves video classification from simple pattern imitation to intuitive reasoning. The system uses vision-language models and reinforcement learning to handle complex, real-world video variations where traditional models fail.
When AI Gets Stumped: Study Reveals Language Models' 'Brain Activity' Collapses Under Pressure
New research shows that when large language models encounter difficult questions, their internal representations dramatically shrink and simplify. This 'activity collapse' reveals fundamental limitations in how current AI processes complex reasoning tasks.
Granulon AI Model Bridges Vision-Language Gap with Adaptive Granularity
Researchers propose Granulon, a new multimodal AI that dynamically adjusts visual analysis granularity based on text queries. The DINOv3-based model improves accuracy by ~30% and reduces hallucinations by ~20% compared to CLIP-based systems.
HyperTokens Break the Forgetting Cycle: A New Architecture for Continual Multimodal AI Learning
Researchers introduce HyperTokens, a transformer-based system that generates task-specific tokens on demand for continual video-language learning. This approach dramatically reduces catastrophic forgetting while maintaining fixed memory costs, enabling AI models to learn sequentially without losing previous knowledge.
AI's Hidden Reasoning Flaw: New Framework Tackles Multimodal Hallucinations at Their Source
Researchers introduce PaLMR, a novel framework that addresses a critical weakness in multimodal AI: 'process hallucinations,' where models give correct answers but for the wrong visual reasons. By aligning both outcomes and reasoning processes, PaLMR significantly improves visual reasoning fidelity.