Sony Pixel Power calrec Sony

NVIDIA Accelerates Google DeepMind's DiffusionGemma for Local AI

10/06/2026

Today, Google DeepMind released DiffusionGemma - an experimental open model built for exceptionally fast text generation. NVIDIA has optimized DiffusionGemma to run even faster across NVIDIA GeForce RTX GPUs, the NVIDIA RTX PRO platform and NVIDIA DGX Spark systems, from local PCs to the cloud.

Rather than generating text one word at a time, DiffusionGemma generates multiple words in parallel to output whole blocks of text, opening a new, low-latency frontier for the kind of single-user workloads that developers, researchers and AI enthusiasts run every day.

Features of the new model include:

Parallel generation: DiffusionGemma denoises up to 256 tokens per step instead of predicting one at a time.

Built on Gemma 4: DiffusionGemma is built on Gemma 4, a 26-billion-parameter mixture-of-experts model that activates just 3.8 billion parameters per step, pairing a diffusion head with Google's Gemma 4 architecture.

Up to 4x faster performance: The boost means fast text generation, where single-user generation usually stalls - on local hardware.

Open and local: DiffusionGemma is open weights under a permissive Apache 2.0 license and runs entirely on RTX and DGX Spark - no cloud, no per-token cost - with day-zero support in Hugging Face Transformers, vLLM and Unsloth.

A Different Way to Generate Text Almost every large language model (LLM) in wide use today is autoregressive - meaning it generates text one token at a time, with each new word depending on the one before it. That sequential process is what makes interactive AI feel like it's typing.

DiffusionGemma takes a different path. Built on the Gemma 4 26B mixture-of-experts architecture, it generates text the way diffusion models generate images: by starting from noise and refining a whole block of text at once. Each step denoises up to 256 tokens in parallel rather than emitting a single token and waiting to compute the next.

The result is a model that thinks in blocks instead of sequentially. For latency-sensitive, single-user work - such as interactive chat, agentic loops or on-device assistants that plan and act - that parallelism translates into responses fast enough to keep pace with how developers think and iterate.

DiffusionGemma Flies on NVIDIA GPUs Generating one token at a time is fundamentally a memory-bound problem - a traditional LLM spends most of its time waiting on memory bandwidth, not doing math, which leaves a lot of compute on the table.

Diffusion flips the equation. Pulling a full 256-token block through the transformer in parallel is a compute-bound workload - exactly what NVIDIA GPUs are built for. NVIDIA Tensor Cores accelerate the dense parallel math, and the CUDA software stack lets the model run efficiently from day one without bespoke tuning. In short, the model's design plays directly to the GPU' s strengths.

That shows up in the numbers. DiffusionGemma delivers 1,000 tokens/sec on a single NVIDIA H100 Tensor Core GPU, 150 tokens/sec on NVIDIA DGX Spark and up to 2,000 tokens/sec on NVIDIA DGX Station - roughly 4x faster than an equivalent autoregressive model running in the same single-user regime.

That advantage holds across NVIDIA's full lineup, running:

Locally on the NVIDIA DGX Spark deskside personal AI supercomputer - powered by the NVIDIA GB10 Grace Blackwell Superchip with 128GB of unified memory - with the preinstalled NVIDIA AI software stack ready for prototyping, fine-tuning and fully local agent workflows.

On NVIDIA RTX PRO 6000 workstations, providing developers, researchers and AI professionals with the headroom to run local low-latency generation and agentic loops as part of a professional workflow.

On DGX Station, delivering best-in-class, local high-speed inference with up to 2,000 tokens/sec for low-latency text generation and agentic loops with 748GB of coherent memory.

On GeForce RTX GPUs, with llama.cpp support coming soon.

Get Started Locally The fastest way to start testing and prototyping the model is through Hugging Face Transformers, which runs DiffusionGemma on a GeForce RTX 5090 or DGX Spark out of the box. For higher-throughput inference, vLLM provides day-zero serving support.

For adapting the model to a specific task or domain, fine-tuning is available through Unsloth and NVIDIA NeMo framework, with ready-made DGX Spark playbooks to get a local environment running quickly. Check out the vLLM playbooks for DGX Spark , RTX PRO and DGX Station.

Try Diffusion Gemma on Hugging Face or test it for free using NVIDIA-hosted application programming interfaces at build.nvidia.com.

Go deeper on the architecture and local deployment by reading the NVIDIA technical blog and the Google DeepMind announcement.

#ICYMI: The Latest From RTX AI Garage NVIDIA researchers released SANA-WM, an open source world model that turns a single image and a camera path into a minute-long, 720p video with precise 6-DoF control. At just 2.6 billion parameters, its distilled version generates a full 60-second clip in 34 seconds on a single NVIDIA GeForce RTX 5090 GPU using the NVFP4 format - delivering up to 36x higher throughput than comparable open models while running on one GPU. Read the paper.

Building Windows agents just got a full toolset - NVIDIA and Microsoft rolled out turnkey agent sandboxing on native Windows - Microsoft eXecution Containers plus the NVIDIA OpenShell runtime - alongside up to 2x faster agentic inference and native Windows support for Hermes Agent.

DGX Spark goes from unboxing to a running agent in minutes - A streamlined NVIDIA NemoClaw install gets developers to a working local agent fast, with Qwen3.6-35B running up to 2.6x faster on vLLM. And the new cluster assistant in NVIDIA Sync links up to four DGX Spark units into one 512GB pool - enough for 400-billion-parameter models.

Plug in to RTX Spark on Fac
LINK: https://blogs.nvidia.com/blog/rtx-ai-garage-local-gemma-diffusion/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

06/09/2026

Dolby and MagentaTV Bring Fans Closer to the FIFA World Cup 2026 in Germany with Dolby Vision and Dolby Atmos

June 9 2026, 23:00 (PDT) Dolby and MagentaTV Bring Fans Closer to the FIFA Worl...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

11/06/2026

NABs LeGeyt Urges Congress to Limit NFL's Antitrust Exemption

Share Copy link Facebook X Linkedin Bluesky Email...

11/06/2026

Fubo Inks New Distribution Agreement with NBCUniversal

Share Copy link Facebook X Linkedin Bluesky Email...

11/06/2026

Kiloview to Showcase Broadcast-Grade AV-over-IP Solutions...

Kiloview, a leading innovator in AV-over-IP video solutions, will return to InfoComm 2026 (Booth# N8327) with broadcast-grade AV-over-IP solutions designed for ...

11/06/2026

Australian Games Industry Glossary of Terms

Australian Games Industry Glossary of Terms 10 June 2026 From DAU and EULA to COT and QADE, here's a list of game industry terms, industry jargon and their...

11/06/2026

Berklee's Tonya Butler Named Music Business Educator of the Year

Berklee's Tonya Butler Named Music Business Educator of the Year The Music Business Association honored Butler at its annual Bizzy Awards. June 10, 2026 ...

11/06/2026

Ann Mincieli to Receive Honorary Doctorate at Berklee NYC Graduate Commencement

Ann Mincieli to Receive Honorary Doctorate at Berklee NYC Graduate Commencement The five-time Grammy-winning engineer and producer, known for her longstanding...

10/06/2026

SVG Sit-Down: Team Whistle's Joe Caporoso on Building World Cup Content Around Fans, Culture, IRL Experiences

DAZN-owned digital-media company launches three fan-first series leaning into cr...

10/06/2026

Clear-Com Appoints Jason Dino as Southwest Regional Sales Manager

Clear-Com has announced the appointment of Jason Dino as Southwest Regional Sales Manager USA, covering Southern California and the Southwest region. Dino joins...

10/06/2026

Caretta Research: 2026 World Cup Revenue Growth Due to More Matches; Rights Revenue Up 32%

An 11% decrease in number of global broadcast deals reflects the organization...

10/06/2026

Women Without Boundaries Awards Are Back!

The Women Without Boundaries Awards recognize women whose work is advancing the future of media, broadcast, AV, workplace technology, digital experience, and re...

10/06/2026

On Eve of World Cup Kickoff, FIFA and HBS Offer Deep Dive into IBC Operations, Commentary, and Ref Cam

Today is match day minus two for FIFA and HBS. On Thursday, there will be two ma...

10/06/2026

SES Supporting World's Biggest Soccer Tournament Broadcast Distribution Worldwide

SES is supporting broadcast distribution of the world's biggest football tou...

10/06/2026

BirdDog Achieves Full NDI 6.3 Compatibility Across Entire Product Line

NDI has announced that BirdDog has become the first hardware manufacturer to achieve full NDI 6.3 compatibility across its complete lineup of cameras, encoders,...

10/06/2026

Emmy Award-Winning Audio Team To Present at SVG Audio Symposium

Vince Caputo and Scott Carter, winners of the 2026 Sports Emmy for Outstanding Post Produced Audio have been announced as presenters for the 2026 SVG Advanced A...

10/06/2026

FOX One Set to Deliver World Cup in 4K; Personalization via AI Drives Experience

FOX One today unveiled a slate of new product features and enhancements designed to elevate the viewing experience for fans on the official streaming platform o...

10/06/2026

PWHL Scales Broadcast Operation in Season 3, Relying on World-Feed Model and Key Vendors

Primary production partners Dome Productions and Raycom Sports once again played...

10/06/2026

NFL Films Application Deadline for Women in Sports Filmmaking Experienceship, Augusts 26-29 in Mount Laurel, Closes June 18

The Women in Sports Filmmaking Experienceship is an immersive professional devel...

10/06/2026

NBAs In-House Broadcast Ops & Engineering Teams Power Global Finals Coverage From NYC, San Antonio

The league has expanded its HSAN architecture for the NBA Finals to manage more ...

10/06/2026

MoonPay X Games League Winter Draft Set for September 16 at Cosm Los Angeles

The inaugural MoonPay X Games League (XGL) Winter Draft will take place Wednesday, September 16, 2026 at Cosm Los Angeles from 7-9 p.m. PT. The event will strea...

10/06/2026

University of Oklahoma and Learfield Extend 30-Year Partnership, Announce Sooner Evolution Center

The University of Oklahoma (OU) Athletics Department and Learfield have announce...

10/06/2026

VSF Releases RIST Satellite-Hybrid Out-of-Band Specification

The Video Services Forum (VSF) has released TR-06-4 Part 8, a new specification for RIST Satellite-Hybrid: Out-of-Band Method. The specification creates a mecha...

10/06/2026

Riedel Artist Intercom Powers Live Neurovascular Conference in Lisbon

Riedel Communications provided the communications infrastructure for the 14th World Live Neurovascular Conference (WLNC) in Lisbon, supporting live medical proc...

10/06/2026

Sundance Film Festival 101: Films by LGBTQ+ Directors

A still from The Doom Generation by Gregg Araki (Courtesy of Sundance Institute) By Lucy Spicer Have you checked out our Sundance Film Festival 101 list yet...

10/06/2026

GearExpo UK: Interfaces & Mic Preamp Update

Get Hands-On with Interfaces & Mic Preamp Brands If youre after a new interface or preamp, then GearExpo UK is the place to be! Well have a whole host of au...

10/06/2026

MONO Music Conference 2026

November 13-14 2026, The Midway, San Francisco Following their recent rebranding, MONO Music Conference (formerly Music Expo) have officially announced thei...

10/06/2026

ebbandflow launch with deFORM

Debut instrument free for limited time deFORM is the debut release from newly founded developer ebbandflow, and it's being offered as a free download fo...

10/06/2026

Alone Australia Season 4: Meet the Cast

Alone Australia Season 4: Meet the Cast 10 June, 2026 Media releases WATCH THE TRAILER Smash-hit survival series Alone Australia drops its highly anticipa...

10/06/2026

DEADLY THEN, DEADLY NOW, DEADLY ALWAYS: SBS & NITV IGNITE NAIDOC WEEK 2026 WITH 50 YEARS OF DEADLY

DEADLY THEN, DEADLY NOW, DEADLY ALWAYS: SBS & NITV IGNITE NAIDOC WEEK 2026 WITH ...

10/06/2026

Rohde & Schwarz and TRUMPF advance laser-based drone defense with THORIS LCS

Rohde & Schwarz and TRUMPF advance laser-based drone defense with THORIS LCS Rohde & Schwarz is showcasing THORIS at ILA 2026: A sovereign, end to end counter...

10/06/2026

MAHLE and Rohde & Schwarz develop application for sensor testing of modern driver assistance systems

MAHLE and Rohde & Schwarz develop application for sensor testing of modern drive...

10/06/2026

NFVF CALL FOR FUNDING APPLICATIONS: PRODUCTION & DEVELOPMENT 2026/27

Production and Development Funding supports the creation of compelling, commercially viable, artistic and culturally relevant South African screen content. Deve...

10/06/2026

Nielsen launches Four-Screen Ad Deduplication measurement on YouTube campaigns in Japan

Media buyers and sellers can now compare YouTube reach from computer, mobile, an...

10/06/2026

Ecoflow X Launches as Experimentation Arm for Sustainabil...

Accedo, Humans not Robots, and the Institution of Engineering and Technology (IET) have announced the launch of Ecoflow X. Formerly an IBC Accelerator project, ...

10/06/2026

Frequency Appoints James Smith as General Manager - Monet...

Frequency, the engine powering the world's leading streaming television channels, today announced that James Smith has joined the company as General Manager...

10/06/2026

Riedel Artist at the Heart of the 14th World Live Neurova...

At the 14th World Live Neurovascular Conference (WLNC) in Lisbon, Riedel Communications provided the communications infrastructure for live medical procedures s...

10/06/2026

Globecast Unveils Content Exchange Platform Powered by Or...

Globecast, a leading provider of broadcast, media, and entertainment managed services, today announced the launch of its Content Exchange platform powered by Or...

10/06/2026

Venues and integrators shift toward professional recharge...

Klvr will showcase how venues, integrators and production teams are rethinking disposable battery usage at InfoComm 2026 (Las Vegas, June 17-19, booth #N6311). ...

10/06/2026

VSF Releases Specification for RIST Satellite Hybrid Out-...

The Video Services Forum (VSF) has further enhanced the Reliable Internet Streaming Transport (RIST) protocol by incorporating a new feature, RIST Satellite-Hyb...

10/06/2026

Microphone Maker Audix Adds Eric Reese as VP

Share Copy link Facebook X Linkedin Bluesky Email...

10/06/2026

NVIDIA Accelerates Google DeepMind's DiffusionGemma for Local AI

Today, Google DeepMind released DiffusionGemma - an experimental open model built for exceptionally fast text generation. NVIDIA has optimized DiffusionGemma to...

10/06/2026

For Robotaxis, Safety Must Be Built In, Not Bolted On

A car pulls up to the curb. The app says, Your ride is here. No one's in the driver's seat. For people who live in one of the dozens of cities now hos...

10/06/2026

VEON's Banglalink Brings Every World Cup 2026 Match to Football Fans in Bangladesh on Toffee

10 Jun 2026 VEON's Banglalink Brings Every World Cup 2026 Match to Football...

10/06/2026

How to watch every ICC Womens T20 World Cup 2026 match live on Sky Sports

Wednesday 10 June 2026 How to watch every ICC Women's T20 World Cup 2026 match live on Sky Sports Where is the ICC Women's T20 World Cup 2026 availabl...

10/06/2026

PRLA brings first-ever Beautifully Clean Oral Care TV campaign to screens nationwide with Sky

Wednesday 10 June 2026 P RLA brings first-ever Beautifully Clean Oral Care'...

10/06/2026

Sky reveals pulse-pounding first teaser trailer for upcoming crime drama Fightland

Wednesday 10 June 2026 Sky reveals pulse-pounding first teaser trailer for upco...

10/06/2026

Riedel Artist at the Heart of the 14th World Live Neurovascular Conference

Wuppertal June 10, 2026 Riedel Artist at the Heart of the 14th World Live Neurovascular ConferenceAt the 14th World Live Neurovascular Conference (WLNC) in Li...