technology release

30 articles about technology release in AI news

Google Releases Gemma 4 Family Under Apache 2.0, Featuring 2B to 31B Models with MoE and Multimodal Capabilities

Google has released the Gemma 4 family of open-weight models, derived from Gemini 3 technology. The four models, ranging from 2B to 31B parameters and including a Mixture-of-Experts variant, are available under a permissive Apache 2.0 license and feature multimodal processing.

100% relevant

OpenAI Image Generation V2 Release Imminent, Per Leak

A post from a known leaker indicates OpenAI's next image generation model, potentially DALL-E 4, is about to be released. This would mark a major competitive move in the rapidly evolving text-to-image space.

85% relevant

Google Gemma 4 Model Reportedly in Testing, Signaling Next-Gen Open-Weight LLM Release

A developer reports that Google's Gemma 4 model is 'incoming' and currently being tested. This suggests the next iteration of Google's open-weight language model family is nearing release.

87% relevant

Tongyi Lab Releases World's First Open-Source Multi-Speaker AI Dubbing Model

Alibaba's Tongyi Lab has released the first open-source AI model capable of dubbing multi-speaker conversations, addressing one of the hardest problems in AI video generation. The model synchronizes voice with lip movements across multiple speakers in a single pass.

85% relevant

CATCHES Launches Generative AI with Physics-Based Sizing Technology for Fashion E-Commerce

CATCHES has launched a generative AI platform for fashion e-commerce featuring physics-based sizing technology. The launch is in partnership with luxury brand AMIRI and is powered by NVIDIA's AI infrastructure. This directly targets a core pain point in online apparel retail: fit uncertainty and high return rates.

100% relevant

Massive Open-Source Dataset of Computer Screen Recordings Released to Train AI Agents

Researchers have released the world's largest open-source dataset of computer-use recordings on Hugging Face. The collection contains 48,478 screen recording videos totaling approximately 12,300 hours of professional software usage, licensed under CC-BY-4.0 for AI training and evaluation.

97% relevant

Safety Gap: OpenAI's Most Powerful AI Models Released Without Critical Risk Assessments

OpenAI's GPT-5.4 Pro, potentially the world's most capable AI for high-risk tasks like bioweapons research and cyber operations, has been released without published safety evaluations or system cards, continuing a concerning pattern with 'Pro' model releases.

85% relevant

Google's Gemini 3.1 Flash Emerges as OpenAI Prepares Next Major Release

Google announces Gemini 3.1 Flash as a lightweight AI model optimized for speed and efficiency, arriving just as OpenAI prepares its next major release. This development signals intensifying competition in the AI space with implications for developers and enterprise users.

85% relevant

NVIDIA Releases NVPanoptix-3D on Hugging Face: Single-Image 3D Indoor Scene Reconstruction

NVIDIA has open-sourced NVPanoptix-3D, a model that reconstructs complete 3D indoor scenes—including panoptic segmentation, depth, and geometry—from a single RGB image in one forward pass.

90% relevant

Kling AI Video Platform Goes Global: How 3.0 Release Redefines Accessible Cinematic AI

Kling AI has launched its 3.0 platform worldwide, offering 1080p cinematic video generation and advanced motion control. This marks a significant step toward professional-grade AI video tools becoming accessible to global creators.

85% relevant

Nvidia DLSS 4.5 Launches with Enhanced AI Frame Generation and Ray Reconstruction

Nvidia has released DLSS 4.5, a major update to its AI-powered upscaling technology featuring new frame generation modes and improved ray reconstruction. The update is available now for GeForce RTX 40 and 50 Series GPUs.

85% relevant

The Self-Improving AI Era Begins: GPT-5.4 and Autonomous Research Breakthroughs

OpenAI's GPT-5.4 release and Andrej Karpathy's autonomous AI research experiment signal a paradigm shift where AI systems can now improve their own underlying technology. This marks the beginning of closed-loop AI self-improvement.

75% relevant

Google's Gemma 4 Emerges: The Next Generation of Open AI Models

Google has announced the upcoming release of Gemma 4, the next iteration of its open-source AI model family. This development signals Google's continued commitment to accessible AI technology and intensified competition in the open model space.

85% relevant

OpenAI's Audio Revolution: New Voice Models Signal Major AI Advancements

OpenAI appears poised to release new audio models that could significantly enhance voice interaction capabilities. This development follows recent trademark filings and suggests major improvements to voice mode technology.

85% relevant

AI Giants Poised for Breakthrough: 1 Trillion Parameter Models with Million-Token Context Windows

Industry insiders hint at imminent releases of AI models with unprecedented scale—1 trillion parameters and 1 million token context windows. This represents a quantum leap in AI capability that could transform how we interact with technology.

85% relevant

Bones Studio Demos Motion-Capture-to-Robot Pipeline for Home Tasks

Bones Studio released a demo showing its 'Captured → Labeled → Transferred' pipeline. It uses optical motion capture to record human tasks, then transfers the data for a humanoid robot to replicate the actions in simulation.

85% relevant

Developer Open-Sources 'Prompt-to-3D' Tool for Instant, Navigable World Generation

A developer has released an open-source tool that creates interactive 3D worlds from text or image inputs. This moves 3D asset generation from static models to instant, explorable environments.

91% relevant

Atomic Bot Launches Native App to Simplify OpenClaw (Clawdbot) Setup on macOS and Windows

Atomic Bot has released a native, open-source desktop application that simplifies the notoriously complex setup process for the OpenClaw AI agent. The app allows users to install and configure OpenClaw with one click on macOS and Windows, with Linux support planned.

85% relevant

Perceptron AI Launches Open-Source MCP for Robust Receipt OCR via Isaac Models

Perceptron AI has released an open-source Model Context Protocol (MCP) server that uses its Isaac vision models to extract structured data from messy, real-world receipts. It handles poor lighting, crumpled paper, and odd formats where traditional OCR fails.

93% relevant

Qwen 3.6 Plus Preview Launches on OpenRouter with Free 1M Token Context, Disrupting API Pricing

Alibaba's Qwen team has released a preview of Qwen 3.6 Plus on OpenRouter with a 1 million token context window, charging $0 for both input and output tokens. This directly undercuts paid long-context offerings from Anthropic and OpenAI.

97% relevant

OpenAI Publishes Codex Use-Case Gallery with Practical Examples for Developers

OpenAI has released a public gallery of practical examples demonstrating how to use its Codex model for real-world programming tasks. The resource provides concrete prompts and outputs for developers building with the API.

85% relevant

Cohere Transcribe: 2B-Parameter Open-Source ASR Model Achieves 5.42% WER, Topping Hugging Face Leaderboard

Cohere released Transcribe, a 2B-parameter open-source speech recognition model. It claims a 5.42% average word error rate, beating OpenAI Whisper v3 and topping the Hugging Face Open ASR Leaderboard.

100% relevant

LVMH Executive Makes Personal Investment in Generative AI Virtual Try-On Startup

An LVMH executive has personally invested in a generative AI-powered virtual try-on technology startup. This signals high-level, direct belief in the technology's potential to impact the luxury customer journey, beyond corporate R&D.

100% relevant

Anthropic's Legal AI Plugin Triggers Market Shift as Legal Data Provider Stocks Decline

Anthropic's release of a legal plugin for its Claude Cowork agent system has reportedly caused a decline in legal data provider stocks, highlighting the competitive pressure AI agents place on traditional legal tech.

75% relevant

OpenAI Launches GPT-5.4 Mini and Nano: Smaller, Cheaper Variants with Same Reasoning Modes

OpenAI has released GPT-5.4 mini and nano, two more affordable variants of its GPT-5.4 model. The nano version is positioned as the smallest and most cost-effective option in the lineup.

85% relevant

Waves Audio Launches Lightning V3.1: 10-Second Voice Cloning with 44.1kHz Studio Quality

Waves Audio released Lightning V3.1, a voice cloning model that creates studio-quality voice replicas from just 10 seconds of audio with under 100ms latency. The update supports over 50 languages and targets real-time applications.

87% relevant

Luma Labs Launches Uni-1: An Autoregressive Transformer for Image Generation with a Pre-Generation Reasoning Phase

Luma Labs has released Uni-1, a foundational image model that uses an autoregressive transformer to reason about user intent before generating pixels. It aims to address the 'intent gap' common in diffusion models by adding a structured reasoning step.

88% relevant

Google Unveils Universal Commerce Protocol (UCP) for Securing Agentic Commerce

Google has released the Universal Commerce Protocol (UCP), an open-source standard designed to secure transactions conducted by AI agents. This framework aims to establish trust and provenance in automated commerce, with direct implications for luxury goods authentication and supply chain transparency.

70% relevant

NVIDIA DLSS 5 Demo Shows 3D Guided Neural Rendering for Next-Gen Upscaling

A leaked demo of NVIDIA's upcoming DLSS 5 technology showcases 3D guided neural rendering, promising a significant leap in image reconstruction quality for real-time graphics.

85% relevant

NVIDIA DLSS 5 Leak Suggests AI Frame Generation Without Motion Vectors

A leaked NVIDIA roadmap slide suggests DLSS 5 will use a new 'AI Frame Generation' technique that does not rely on traditional motion vectors, potentially simplifying game integration. The feature is slated for a 2026 release.

85% relevant