Sony Pixel Power calrec Sony

How GeForce RTX 50 Series GPUs Are Built to Supercharge Generative AI on PCs

05/02/2025

NVIDIA's GeForce RTX 5090 and 5080 GPUs - which are based on the groundbreaking NVIDIA Blackwell architecture -offer up to 8x faster frame rates with NVIDIA DLSS 4 technology, lower latency with NVIDIA Reflex 2 and enhanced graphical fidelity with NVIDIA RTX neural shaders.

These GPUs were built to accelerate the latest generative AI workloads, delivering up to 3,352 AI trillion operations per second (TOPS), enabling incredible experiences for AI enthusiasts, gamers, creators and developers.

To help AI developers and enthusiasts harness these capabilities, NVIDIA at the CES trade show last month unveiled NVIDIA NIM and AI Blueprints for RTX. NVIDIA NIM microservices are prepackaged generative AI models that let developers and enthusiasts easily get started with generative AI, iterate quickly and harness the power of RTX for accelerating AI on Windows PCs. NVIDIA AI Blueprints are reference projects that show developers how to use NIM microservices to build the next generation of AI experiences.

NIM and AI Blueprints are optimized for GeForce RTX 50 Series GPUs. These technologies work together seamlessly to help developers and enthusiasts build, iterate and deliver cutting-edge AI experiences on AI PCs.

NVIDIA NIM Accelerates Generative AI on PCs While AI model development is rapidly advancing, bringing these innovations to PCs remains a challenge for many people. Models posted on platforms like Hugging Face must be curated, adapted and quantized to run on PC. They also need to be integrated into new AI application programming interfaces (APIs) to ensure compatibility with existing tools, and converted to optimized inference backends for peak performance.

NVIDIA NIM microservices for RTX AI PCs and workstations can ease the complexity of this process by providing access to community-driven and NVIDIA-developed AI models. These microservices are easy to download and connect to via industry-standard APIs and span the key modalities essential for AI PCs. They are also compatible with a wide range of AI tools and offer flexible deployment options, whether on PCs, in data centers, or in the cloud.

NIM microservices include everything needed to run optimized models on PCs with RTX GPUs, including prebuilt engines for specific GPUs, the NVIDIA TensorRT software development kit (SDK), the open-source NVIDIA TensorRT-LLM library for accelerated inference using Tensor Cores, and more.

Microsoft and NVIDIA worked together to enable NIM microservices and AI Blueprints for RTX in Windows Subsystem for Linux (WSL2). With WSL2, the same AI containers that run on data center GPUs can now run efficiently on RTX PCs, making it easier for developers to build, test and deploy AI models across platforms.

In addition, NIM and AI Blueprints harness key innovations of the Blackwell architecture that the GeForce RTX 50 series is built on, including fifth-generation Tensor Cores and support for FP4 precision.

Tensor Cores Drive Next-Gen AI Performance AI calculations are incredibly demanding and require vast amounts of processing power. Whether generating images and videos or understanding language and making real-time decisions, AI models rely on hundreds of trillions of mathematical operations to be completed every second. To keep up, computers need specialized hardware built specifically for AI.

NVIDIA GeForce RTX desktop GPUs deliver up to 3,352 AI TOPS for unmatched speed and efficiency in AI-powered workflows. In 2018, NVIDIA GeForce RTX GPUs changed the game by introducing Tensor Cores - dedicated AI processors designed to handle these intensive workloads. Unlike traditional computing cores, Tensor Cores are built to accelerate AI by performing calculations faster and more efficiently. This breakthrough helped bring AI-powered gaming, creative tools and productivity applications into the mainstream.

Blackwell architecture takes AI acceleration to the next level. The fifth-generation Tensor Cores in Blackwell GPUs deliver up to 3,352 AI TOPS to handle even more demanding AI tasks and simultaneously run multiple AI models. This means faster AI-driven experiences, from real-time rendering to intelligent assistants, that pave the way for greater innovation in gaming, content creation and beyond.

FP4 - Smaller Models, Bigger Performance Another way to optimize AI performance is through quantization, a technique that reduces model sizes, enabling the models to run faster while reducing the memory requirements.

Enter FP4 - an advanced quantization format that allows AI models to run faster and leaner without compromising output quality. Compared with FP16, it reduces model size by up to 60% and more than doubles performance, with minimal degradation.

For example, Black Forest Labs' FLUX.1 [dev] model at FP16 requires over 23GB of VRAM, meaning it can only be supported by the GeForce RTX 4090 and professional GPUs. With FP4, FLUX.1 [dev] requires less than 10GB, so it can run locally on more GeForce RTX GPUs.

On a GeForce RTX 4090 with FP16, the FLUX.1 [dev] model can generate images in 15 seconds with just 30 steps. With a GeForce RTX 5090 with FP4, images can be generated in just over five seconds.

FP4 is natively supported by the Blackwell architecture, making it easier than ever to deploy high-performance AI on local PCs. It's also integrated into NIM microservices, effectively optimizing models that were previously difficult to quantize. By enabling more efficient AI processing, FP4 helps to bring faster, smarter AI experiences for content creation.

AI Blueprints Power Advanced AI Workflows on RTX PCs NVIDIA AI Blueprints, built on NIM microservices, provide prepackaged, optimized reference implementations that make it easier to develop advanced AI-powered projects - whether for digital humans, podcast generators or application assistants.

At CES, NVIDIA demonstrated PDF to Podcast
LINK: https://blogs.nvidia.com/blog/rtx-ai-garage-blackwell-nim-blueprints-p...
See more stories from nvidia

North America Stories

08/05/2025

What to Watch: 6 Sundance Institute-Supported Films by Filipino Directors

A sinister fairy infiltrates a desperate family in Kenneth Dagatan's In My Mother's Skin, which premiered at the 2023 Sundance Film Festival. Photo co...

08/05/2025

Managing the Mission: Teaching Technique to C3ISR Operators

For skyward-bound operators, training focuses on the unique aspects of flying ISR missions, including the management of onboard surveillance equipment and the e...

08/05/2025

Cable Industry Backs Broadcasters' Move to Software-Based EAS

The cable industry has told the Federal Communications Commission it supports the National Association of Broadcasters' proposal to allow broadcasters to us...

08/05/2025

CTA Tells FCC: Dont Mandate ATSC 3.0 Tuners

WASHINGTON The Consumer Technology Association has continued its opposition to mandates requiring that NextGen TV/ATSC 3.0 tuners be included in new TV sets, sa...

08/05/2025

TAG Video Systems Appoints Paul Maroni as Vice President...

TAG Video Systems, the leader in software-based IP end-to-end workflow monitoring, deep probing, and real time visualization, has named Paul Maroni as Vice Pres...

08/05/2025

BroadcastAsia 2025 Showcases Best of British Innovation

This year's UK Pavilion in hall 5, once again managed by Tradefair, will provide visitors with the unique opportunity to discuss and be involved in cutting ...

08/05/2025

Rohde & Schwarz to highlight innovative broadcast technol...

Rohde & Schwarz will showcase its latest energy-efficient transmitters and 5G Broadcast technologies, designed to support network operators and content provider...

08/05/2025

Nexstar Appoints Bill Nardi VP of Station Operations

IRVING, Texas Nexstar Media Group has tapped Bill Nardi as vice president of station operations, responsible for overseeing the day-to-day broadcast operations ...

08/05/2025

LumaTouch Partners With CNN Academy on Training

SEATTLE LumaTouch is partnering with CNN Academy to improve mobile storytelling techniques and support training across all of CNN Academy's training simulat...

08/05/2025

SBE Backs NAB Proposals to Change EAS Rules

WASHINGTON The Society of Broadcast Engineers has filed comments with the Federal Communications Commission that support a proposal by the National Association ...

08/05/2025

OAN to Provide News to VOA, USAGM Networks

Senior adviser to the United States Agency for Global Media Kari Lake has announced that One America News Network (OAN) will provide newsfeed services for fre...

08/05/2025

EdMon Expands as AI-Driven Post Production Workflows Gains Traction in Sweden and Beyond

EdMon Expands as AI-Driven Post Production Workflows Gains Traction in Sweden an...

08/05/2025

Using Luma Mattes in Adobe Premiere Pro

Using Luma Mattes in Adobe Premiere Pro Graham Quince May 7, 2025 0 Comments This very quick tutorial shows you how to take an RGB clip and apply its ...

08/05/2025

OpenDrives Unveils Free Your Data' Initiative with New Astraeus Cloud-Native Data Services Platform

OpenDrives Unveils Free Your Data' Initiative with New Astraeus Cloud-Nativ...

08/05/2025

Student Spotlight: Grigori Balasanyan

Student Spotlight: Grigori Balasanyan The Armenian composer, who was named Boston Conservatory at Berklees 2025 student commencement speaker, talks about his ...

08/05/2025

Tribeca Festival 2025 Unveils New Premieres Spanning Film and Music

May 8th, 2025 Press Materials Available Here Tribeca Festival 2025 Unveils New Premieres Spanning Film and Music Slick Rick's Victory with Idris Elba a...

08/05/2025

Tribeca Festival 2025 Announces Lineup for Inaugural Storytelling Summit

May 8th, 2025 Press Materials Available Here Tribeca Festival 2025 Announces Lineup for Inaugural Storytelling Summit 11-Day Industry Event Launches with Tal...

08/05/2025

SVG Sit-Down: Vizrt's Nicholas Jameson on AI in Workflows, Pushing Boundaries With XR/AR

SVG Sit-Down: Vizrt's Nicholas Jameson on AI in Workflows, Pushing Boundarie...

08/05/2025

Creating Alternative Brand Experiences: Live Sports in the Age of Fortnite, Meta Horizon, and Beyond

Creating Alternative Brand Experiences: Live Sports in the Age of Fortnite, Meta...

08/05/2025

PGA TOUR's David Piccolo: Advanced Graphics and Virtual Production Tools are Elevating Live Golf Coverage

PGA TOUR's David Piccolo: Advanced Graphics and Virtual Production Tools are...

08/05/2025

Tech Focus: Advancing Immersion in Sports Broadcasting with AR and Virtual Production

Tech Focus: Advancing Immersion in Sports Broadcasting with AR and Virtual Produ...

08/05/2025

Now in Production: Comedy Action Film Husbands in Action' Puts Unlikely Allies on a Rescue Mission

Back to All News Now in Production: Comedy Action Film Husbands in Action'...

08/05/2025

Wildfire Prevention: AI Startups Support Prescribed Burns, Early Alerts

Artificial intelligence is helping identify and treat diseases faster with better results for humankind. Natural disasters like wildfires are next. Fires in th...

08/05/2025

Join the Family: GeForce NOW Welcomes 2K's Acclaimed Mafia' Franchise to the Cloud

Calling all wiseguys - 2K's acclaimed Mafia franchise is available to stream...

08/05/2025

LM Studio Accelerates LLM Performance With NVIDIA GeForce RTX GPUs and CUDA 12.8

As AI use cases continue to expand - from document summarization to custom software agents - developers and enthusiasts are seeking faster, more flexible ways t...

07/05/2025

March 2025 Less Time Spent Watching Video

Warsaw, Poland - April 28, 2025 - Nielsen, a global leader in audience measurement, data and analytics, has released its latest March All Screens Video Landscap...

07/05/2025

Studios Delay Moving Films to Streaming to Protect Box Office

LONDON Movie fans hoping to save money by waiting until their favorite new films appear on streaming services will have to wait a bit longer now, according to a...

07/05/2025

Saudi Broadcasting Authority Turns to Grass Valley for Major Tech Upgrade

MECCA, Saudi Arabia Saudi Broadcasting Authority (SBA) has selected Grass Valley to provide a major technology upgrade of its broadcast facility here....

07/05/2025

Sony and Nevion provide guidance on IP network architecture options for live production in new whitepaper

Sony and Nevion provide guidance on IP network architecture options for live pro...

07/05/2025

Media Pioneer Publishing AG Expands Editorial Capacity

Media Pioneer Publishing AG Expands Editorial Capacity Brie Clayton May 7, 2025 0 Comments Pioneer 2 boat production environment powered by Blackmagic...

07/05/2025

Studios Delaying Moving Films to Streaming to Protect Box Office

LONDON Movie fans hoping to save money by waiting until their favorite new films appear on streaming services will have to wait a bit longer now, according to a...

07/05/2025

FCC Seeks Comments on LPTV Adoption of 5G Broadcasting

WASHINGTON The Federal Communications Commission's Media Bureau is seeking public comment on a Petition for Rulemaking from HC2 Broadcasting Holdings asking...

07/05/2025

U.S. Department of Education Terminates CPB's Ready to Learn Grant

WASHINGTON Following a decision by U.S. Department of Education to terminate its 2020-2025 Ready To Learn to the Corporation for Public Broadcasting, CPB has in...

07/05/2025

Tubi Announces New Interactive Ad Formats

NEW YORK Fox's ad-supported streaming Tubi made a series of product and partnership announcements during IAB NewFronts in New York, including the launch of ...

07/05/2025

The GFiber App Gets an Upgrade

MOUNTAIN VIEW, Calif. Google Fiber (GFiber) has announced a redesigned app that the company said will simplify how customers set up service, manage devices, and...

07/05/2025

NASA+ Launches FAST Channel on Prime Video

WASHINGTON NASAs on-demand streaming service, NASA+, has launched a FAST (Free Ad-Supported Television) channel on Prime Video....

07/05/2025

The WNET Group Names Randall T. Decker Senior Director, Technology

NEW YORK The WNET Group, parent company of the PBS station Thirteen, has announced the appointment of Randall T. Decker to senior director, technology, effectiv...

07/05/2025

Peter Barber appointed new Chief Executive Officer at Ato...

Atomos announced an executive leadership transition as the Company continues to evolve and expand its strategic focus. Peter Barber, currently serving as Chie...

07/05/2025

Steadicam Vol Honored with AMPAS 2025 Scientific and Tech...

Steve Wagner, Jerry Holway and Robert Orf at the 2024 Scientific and Technical Awards at the Academy Museum of Motion Pictures on Tuesday, April 29, 2025. The ...

07/05/2025

nxtedition and TASCAM bring precision audio control to TV...

nxtedition, the Swedish company behind the leading integrated platform for news and program production, and TASCAM, the iconic Japanese manufacturer of professi...

07/05/2025

Signiant Brings Real-Time Camera Raw-to-Cloud Innovation...

Signiant is bringing its Camera-Raw-to-Any-Cloud workflow to the UK for the first time at the Media Production & Technology Show 2025 (Booth# M69) with a live d...

07/05/2025

MNC Software extends its global reach with Gencom Technol...

MNC Software Inc., a global leader in software and network solutions tailored to the broadcast and media industry, has appointed Gencom Technology as an officia...

07/05/2025

Rise AV Announces Inaugural UK Cohort for 2025 Mentoring...

Rise AV, the award-winning advocacy group championing gender diversity and professional development in the AV sector, is proud to announce 31 mentor-mentee pair...

07/05/2025

Leader to show Test and Measurement solutions for workflo...

Test & measurement innovator, Leader Electronics of Europe, is to bring a selection of its leading products for IP, SDI and hybrid workflow requirements to this...

07/05/2025

World Skills Cafe Returns at IBC2025 with Expanded Talent...

The Global Media and Entertainment Talent Manifesto announces that the World Skills Caf will return at IBC2025 with an expanded skills and diversity programme,...

07/05/2025

Moments Lab and LucidLink Expand AI-Powered Workflow Inte...

Moments Lab, a leader in AI video discovery, and LucidLink, the pioneer in real-time cloud collaboration, are proud to announce the integration of Moments Lab&#...

07/05/2025

Obvious C Broadcasts Skiing World Cup with Blackmagic Design

Obvious C Broadcasts Skiing World Cup with Blackmagic Design Brie Clayton May 6, 2025 0 Comments Blackmagic Design cameras capture cinematic sports pr...

07/05/2025

Larry Jordan Interviews Signiant's Jon Finegold at NAB 2025

Larry Jordan Interviews Signiant's Jon Finegold at NAB 2025 Brie Clayton May 6, 2025 0 Comments Jon Finegold, Chief Marketing Officer at Signiant,...

07/05/2025

Cadence Taps NVIDIA Blackwell to Accelerate AI-Driven Engineering Design and Scientific Simulation

A new supercomputer offered by Cadence, a leading provider of technology for ele...