technology release
30 articles about technology release in AI news
Google Releases Gemma 4 Family Under Apache 2.0, Featuring 2B to 31B Models with MoE and Multimodal Capabilities
Google has released the Gemma 4 family of open-weight models, derived from Gemini 3 technology. The four models, ranging from 2B to 31B parameters and including a Mixture-of-Experts variant, are available under a permissive Apache 2.0 license and feature multimodal processing.
OpenAI Image Generation V2 Release Imminent, Per Leak
A post from a known leaker indicates OpenAI's next image generation model, potentially DALL-E 4, is about to be released. This would mark a major competitive move in the rapidly evolving text-to-image space.
Google Gemma 4 Model Reportedly in Testing, Signaling Next-Gen Open-Weight LLM Release
A developer reports that Google's Gemma 4 model is 'incoming' and currently being tested. This suggests the next iteration of Google's open-weight language model family is nearing release.
Tongyi Lab Releases World's First Open-Source Multi-Speaker AI Dubbing Model
Alibaba's Tongyi Lab has released the first open-source AI model capable of dubbing multi-speaker conversations, addressing one of the hardest problems in AI video generation. The model synchronizes voice with lip movements across multiple speakers in a single pass.
CATCHES Launches Generative AI with Physics-Based Sizing Technology for Fashion E-Commerce
CATCHES has launched a generative AI platform for fashion e-commerce featuring physics-based sizing technology. The launch is in partnership with luxury brand AMIRI and is powered by NVIDIA's AI infrastructure. This directly targets a core pain point in online apparel retail: fit uncertainty and high return rates.
Massive Open-Source Dataset of Computer Screen Recordings Released to Train AI Agents
Researchers have released the world's largest open-source dataset of computer-use recordings on Hugging Face. The collection contains 48,478 screen recording videos totaling approximately 12,300 hours of professional software usage, licensed under CC-BY-4.0 for AI training and evaluation.
Safety Gap: OpenAI's Most Powerful AI Models Released Without Critical Risk Assessments
OpenAI's GPT-5.4 Pro, potentially the world's most capable AI for high-risk tasks like bioweapons research and cyber operations, has been released without published safety evaluations or system cards, continuing a concerning pattern with 'Pro' model releases.
Google's Gemini 3.1 Flash Emerges as OpenAI Prepares Next Major Release
Google announces Gemini 3.1 Flash as a lightweight AI model optimized for speed and efficiency, arriving just as OpenAI prepares its next major release. This development signals intensifying competition in the AI space with implications for developers and enterprise users.
NVIDIA Releases NVPanoptix-3D on Hugging Face: Single-Image 3D Indoor Scene Reconstruction
NVIDIA has open-sourced NVPanoptix-3D, a model that reconstructs complete 3D indoor scenes—including panoptic segmentation, depth, and geometry—from a single RGB image in one forward pass.
Kling AI Video Platform Goes Global: How 3.0 Release Redefines Accessible Cinematic AI
Kling AI has launched its 3.0 platform worldwide, offering 1080p cinematic video generation and advanced motion control. This marks a significant step toward professional-grade AI video tools becoming accessible to global creators.
Nvidia DLSS 4.5 Launches with Enhanced AI Frame Generation and Ray Reconstruction
Nvidia has released DLSS 4.5, a major update to its AI-powered upscaling technology featuring new frame generation modes and improved ray reconstruction. The update is available now for GeForce RTX 40 and 50 Series GPUs.
The Self-Improving AI Era Begins: GPT-5.4 and Autonomous Research Breakthroughs
OpenAI's GPT-5.4 release and Andrej Karpathy's autonomous AI research experiment signal a paradigm shift where AI systems can now improve their own underlying technology. This marks the beginning of closed-loop AI self-improvement.
Google's Gemma 4 Emerges: The Next Generation of Open AI Models
Google has announced the upcoming release of Gemma 4, the next iteration of its open-source AI model family. This development signals Google's continued commitment to accessible AI technology and intensified competition in the open model space.
OpenAI's Audio Revolution: New Voice Models Signal Major AI Advancements
OpenAI appears poised to release new audio models that could significantly enhance voice interaction capabilities. This development follows recent trademark filings and suggests major improvements to voice mode technology.
AI Giants Poised for Breakthrough: 1 Trillion Parameter Models with Million-Token Context Windows
Industry insiders hint at imminent releases of AI models with unprecedented scale—1 trillion parameters and 1 million token context windows. This represents a quantum leap in AI capability that could transform how we interact with technology.
Bones Studio Demos Motion-Capture-to-Robot Pipeline for Home Tasks
Bones Studio released a demo showing its 'Captured → Labeled → Transferred' pipeline. It uses optical motion capture to record human tasks, then transfers the data for a humanoid robot to replicate the actions in simulation.
Developer Open-Sources 'Prompt-to-3D' Tool for Instant, Navigable World Generation
A developer has released an open-source tool that creates interactive 3D worlds from text or image inputs. This moves 3D asset generation from static models to instant, explorable environments.
Atomic Bot Launches Native App to Simplify OpenClaw (Clawdbot) Setup on macOS and Windows
Atomic Bot has released a native, open-source desktop application that simplifies the notoriously complex setup process for the OpenClaw AI agent. The app allows users to install and configure OpenClaw with one click on macOS and Windows, with Linux support planned.
Perceptron AI Launches Open-Source MCP for Robust Receipt OCR via Isaac Models
Perceptron AI has released an open-source Model Context Protocol (MCP) server that uses its Isaac vision models to extract structured data from messy, real-world receipts. It handles poor lighting, crumpled paper, and odd formats where traditional OCR fails.
Qwen 3.6 Plus Preview Launches on OpenRouter with Free 1M Token Context, Disrupting API Pricing
Alibaba's Qwen team has released a preview of Qwen 3.6 Plus on OpenRouter with a 1 million token context window, charging $0 for both input and output tokens. This directly undercuts paid long-context offerings from Anthropic and OpenAI.
OpenAI Publishes Codex Use-Case Gallery with Practical Examples for Developers
OpenAI has released a public gallery of practical examples demonstrating how to use its Codex model for real-world programming tasks. The resource provides concrete prompts and outputs for developers building with the API.
Cohere Transcribe: 2B-Parameter Open-Source ASR Model Achieves 5.42% WER, Topping Hugging Face Leaderboard
Cohere released Transcribe, a 2B-parameter open-source speech recognition model. It claims a 5.42% average word error rate, beating OpenAI Whisper v3 and topping the Hugging Face Open ASR Leaderboard.
LVMH Executive Makes Personal Investment in Generative AI Virtual Try-On Startup
An LVMH executive has personally invested in a generative AI-powered virtual try-on technology startup. This signals high-level, direct belief in the technology's potential to impact the luxury customer journey, beyond corporate R&D.
Anthropic's Legal AI Plugin Triggers Market Shift as Legal Data Provider Stocks Decline
Anthropic's release of a legal plugin for its Claude Cowork agent system has reportedly caused a decline in legal data provider stocks, highlighting the competitive pressure AI agents place on traditional legal tech.
OpenAI Launches GPT-5.4 Mini and Nano: Smaller, Cheaper Variants with Same Reasoning Modes
OpenAI has released GPT-5.4 mini and nano, two more affordable variants of its GPT-5.4 model. The nano version is positioned as the smallest and most cost-effective option in the lineup.
Waves Audio Launches Lightning V3.1: 10-Second Voice Cloning with 44.1kHz Studio Quality
Waves Audio released Lightning V3.1, a voice cloning model that creates studio-quality voice replicas from just 10 seconds of audio with under 100ms latency. The update supports over 50 languages and targets real-time applications.
Luma Labs Launches Uni-1: An Autoregressive Transformer for Image Generation with a Pre-Generation Reasoning Phase
Luma Labs has released Uni-1, a foundational image model that uses an autoregressive transformer to reason about user intent before generating pixels. It aims to address the 'intent gap' common in diffusion models by adding a structured reasoning step.
Google Unveils Universal Commerce Protocol (UCP) for Securing Agentic Commerce
Google has released the Universal Commerce Protocol (UCP), an open-source standard designed to secure transactions conducted by AI agents. This framework aims to establish trust and provenance in automated commerce, with direct implications for luxury goods authentication and supply chain transparency.
NVIDIA DLSS 5 Demo Shows 3D Guided Neural Rendering for Next-Gen Upscaling
A leaked demo of NVIDIA's upcoming DLSS 5 technology showcases 3D guided neural rendering, promising a significant leap in image reconstruction quality for real-time graphics.
NVIDIA DLSS 5 Leak Suggests AI Frame Generation Without Motion Vectors
A leaked NVIDIA roadmap slide suggests DLSS 5 will use a new 'AI Frame Generation' technique that does not rely on traditional motion vectors, potentially simplifying game integration. The feature is slated for a 2026 release.