Sony Pixel Power calrec Sony

How GeForce RTX 50 Series GPUs Are Built to Supercharge Generative AI on PCs

05/02/2025

NVIDIA's GeForce RTX 5090 and 5080 GPUs - which are based on the groundbreaking NVIDIA Blackwell architecture -offer up to 8x faster frame rates with NVIDIA DLSS 4 technology, lower latency with NVIDIA Reflex 2 and enhanced graphical fidelity with NVIDIA RTX neural shaders.

These GPUs were built to accelerate the latest generative AI workloads, delivering up to 3,352 AI trillion operations per second (TOPS), enabling incredible experiences for AI enthusiasts, gamers, creators and developers.

To help AI developers and enthusiasts harness these capabilities, NVIDIA at the CES trade show last month unveiled NVIDIA NIM and AI Blueprints for RTX. NVIDIA NIM microservices are prepackaged generative AI models that let developers and enthusiasts easily get started with generative AI, iterate quickly and harness the power of RTX for accelerating AI on Windows PCs. NVIDIA AI Blueprints are reference projects that show developers how to use NIM microservices to build the next generation of AI experiences.

NIM and AI Blueprints are optimized for GeForce RTX 50 Series GPUs. These technologies work together seamlessly to help developers and enthusiasts build, iterate and deliver cutting-edge AI experiences on AI PCs.

NVIDIA NIM Accelerates Generative AI on PCs While AI model development is rapidly advancing, bringing these innovations to PCs remains a challenge for many people. Models posted on platforms like Hugging Face must be curated, adapted and quantized to run on PC. They also need to be integrated into new AI application programming interfaces (APIs) to ensure compatibility with existing tools, and converted to optimized inference backends for peak performance.

NVIDIA NIM microservices for RTX AI PCs and workstations can ease the complexity of this process by providing access to community-driven and NVIDIA-developed AI models. These microservices are easy to download and connect to via industry-standard APIs and span the key modalities essential for AI PCs. They are also compatible with a wide range of AI tools and offer flexible deployment options, whether on PCs, in data centers, or in the cloud.

NIM microservices include everything needed to run optimized models on PCs with RTX GPUs, including prebuilt engines for specific GPUs, the NVIDIA TensorRT software development kit (SDK), the open-source NVIDIA TensorRT-LLM library for accelerated inference using Tensor Cores, and more.

Microsoft and NVIDIA worked together to enable NIM microservices and AI Blueprints for RTX in Windows Subsystem for Linux (WSL2). With WSL2, the same AI containers that run on data center GPUs can now run efficiently on RTX PCs, making it easier for developers to build, test and deploy AI models across platforms.

In addition, NIM and AI Blueprints harness key innovations of the Blackwell architecture that the GeForce RTX 50 series is built on, including fifth-generation Tensor Cores and support for FP4 precision.

Tensor Cores Drive Next-Gen AI Performance AI calculations are incredibly demanding and require vast amounts of processing power. Whether generating images and videos or understanding language and making real-time decisions, AI models rely on hundreds of trillions of mathematical operations to be completed every second. To keep up, computers need specialized hardware built specifically for AI.

NVIDIA GeForce RTX desktop GPUs deliver up to 3,352 AI TOPS for unmatched speed and efficiency in AI-powered workflows. In 2018, NVIDIA GeForce RTX GPUs changed the game by introducing Tensor Cores - dedicated AI processors designed to handle these intensive workloads. Unlike traditional computing cores, Tensor Cores are built to accelerate AI by performing calculations faster and more efficiently. This breakthrough helped bring AI-powered gaming, creative tools and productivity applications into the mainstream.

Blackwell architecture takes AI acceleration to the next level. The fifth-generation Tensor Cores in Blackwell GPUs deliver up to 3,352 AI TOPS to handle even more demanding AI tasks and simultaneously run multiple AI models. This means faster AI-driven experiences, from real-time rendering to intelligent assistants, that pave the way for greater innovation in gaming, content creation and beyond.

FP4 - Smaller Models, Bigger Performance Another way to optimize AI performance is through quantization, a technique that reduces model sizes, enabling the models to run faster while reducing the memory requirements.

Enter FP4 - an advanced quantization format that allows AI models to run faster and leaner without compromising output quality. Compared with FP16, it reduces model size by up to 60% and more than doubles performance, with minimal degradation.

For example, Black Forest Labs' FLUX.1 [dev] model at FP16 requires over 23GB of VRAM, meaning it can only be supported by the GeForce RTX 4090 and professional GPUs. With FP4, FLUX.1 [dev] requires less than 10GB, so it can run locally on more GeForce RTX GPUs.

On a GeForce RTX 4090 with FP16, the FLUX.1 [dev] model can generate images in 15 seconds with just 30 steps. With a GeForce RTX 5090 with FP4, images can be generated in just over five seconds.

FP4 is natively supported by the Blackwell architecture, making it easier than ever to deploy high-performance AI on local PCs. It's also integrated into NIM microservices, effectively optimizing models that were previously difficult to quantize. By enabling more efficient AI processing, FP4 helps to bring faster, smarter AI experiences for content creation.

AI Blueprints Power Advanced AI Workflows on RTX PCs NVIDIA AI Blueprints, built on NIM microservices, provide prepackaged, optimized reference implementations that make it easier to develop advanced AI-powered projects - whether for digital humans, podcast generators or application assistants.

At CES, NVIDIA demonstrated PDF to Podcast
LINK: https://blogs.nvidia.com/blog/rtx-ai-garage-blackwell-nim-blueprints-p...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

01/04/2026

DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION

January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION Douyin Users Can Now Create And Share Videos With Stun...

04/02/2026

Vinten Extends VEGA Platform with VEGA Lite PTZ Control S...

New control solution applies broadcast robotics workflows to PTZ cameras with third-party integration and upgrade paths Vinten, a global leader in robotic cam...

04/02/2026

Vinten Launches Vega Lite PTZ Control System

Share Copy link Facebook X Linkedin Bluesky Email...

04/02/2026

Chyron to Provide Graphics, Virtual Sets for Winter Olympics Coverage

Share Copy link Facebook X Linkedin Bluesky Email...

04/02/2026

NBC Sports Taps Appear for 2026 Winter Olympics Production

Share Copy link Facebook X Linkedin Bluesky Email...

04/02/2026

Katie Vitolins Announced as Vice President of Alumni Products and Services

Katie Vitolins Announced as Vice President of Alumni Products and Services An alumna and former trustee, Vitolins will lead the relaunch of Berklee's alum...

04/02/2026

RT Statement on Home of the Year and The Great House Revival

Following the passing of our friend and colleague Hugh Wallace, and with the full support of his family, RT will proceed with the broadcast of the new series o...

04/02/2026

Nemotron Labs: How AI Agents Are Turning Documents Into Real-Time Business Intelligence

Editor's note: This post is part of the Nemotron Labs blog series, which exp...

03/02/2026

Tagboard's New Partner Development Kit Turns Complex Third-Party Integrations into Instant Graphics

Tagboard, a modern, interactive graphics system for news, sports, and entertainm...

03/02/2026

SNS Launches New S3-Compatible Cloud Storage Service

Studio Network Solutions (SNS) announces the launch of Trio, a new S3-compatible cloud storage service fully integrated with EVO for media backup, archival, and...

03/02/2026

NEP Group Running at Full-Scale This Month in Support of Major International Events

50 Production Trucks at center of 160 U.S.-based productions...

03/02/2026

Nielsen Launches Co-Viewing Pilot Program to Further Enhance TV Measurement

Nielsen, which specializes in audience measurement, data, and media intelligence, announces that it is piloting a new methodology enhancement to more accurately...

03/02/2026

NBC Sports Relies on Appear X Platform for Winter Games

Appear's X Platform will be used by NBC Sports to deliver video compression, satellite modulation and transport stream aggregation for its production of the...

03/02/2026

Behind The Mic: NBC Sports Unveils Presentation Cast for Super Bowl LX as Part of Legendary February

Behind The Mic provides a roundup of recent news regarding on-air talent, includ...

03/02/2026

NHL Network to Take Fans Rinkside as NHL Players Return to Winter Olympic Games

NHL Network announces its comprehensive programming schedule for the upcoming Winter Olympic Games, underscoring the highly anticipated return of NHL players to...

03/02/2026

Canon Lenses Power NBC Coverage of Milano Cortina Winter Games

NBC Sports has chosen Canon to deliver 115 Canon UHD broadcast lenses for its production of the 2026 Winter Olympics and Paralympics. Canon will also send suppo...

03/02/2026

NBC Sports Selects Audio-Technica for Vital Winter Games Role

Audio-Technica equipment will be used by NBC Sports to deliver much of its audio capture requirements across all sporting venues for its production of the 2026 ...

03/02/2026

Cisco VXLAN Makes Olympic Debut for NBC Sports

NBC Sports will use Cisco to deliver AI networking technology for its all-IP production of the 2026 Winter Olympics and Paralympics, including the deployment of...

03/02/2026

Chyron PRIME CG Drives NBC Sports Winter Games Graphics

NBC Sports will utilize Chyron PRIME CG to produce live broadcast graphics to display names, athlete information, scores, statistics, leaderboards, headshots an...

03/02/2026

NBC Sports Relies on Grass Valley for Winter Games Signal Conversion, Routing, Orchestration

NBC Sports will utilize Grass Valley to deliver advanced signal conversion, rout...

03/02/2026

Planar Supplies LED Wall Technology for NBC Sports Winter Games Coverage

NBC Sports will utilize Planar to deliver leading-edge fine pixel pitch LED video wall technology for its production of the 2026 Winter Olympics and Paralympics...

03/02/2026

FOX Sports Wraps Run Airing Westminster Kennel Club Dog Show

Taking place in two venues, the 2026 production celebrates the 150th Anniversary of the Super Bowl of Dogs' When the Westminster Kennel Club Dog Show conc...

03/02/2026

ESPN To Broadcast 2026 NFL Pro Bowl Games Live From San Francisco's Convention Center

For the indoor production, NEP Specialty Capture has mounted a 100-ft. overhead ...

03/02/2026

Inside League One Volleyball: Mobile TV Group Drives Broadcasts via Onsite Support, Remotely From Mountain Media Center

MTVG Edge, the production-services supplier's software-based production solu...

03/02/2026

Inside League One Volleyball: CMO Raquel Braun on Partnering With Omaha Productions, Mobile TV Group

In the league's second year, new partnerships enhance broadcast quality for ...

03/02/2026

NBC Sports Taps Ross Video's Rocket Surgery for Virtual and AR at 2026 Winter Games

Rocket Surgery will showcase the combined creative strength of Ross Video, with ...

03/02/2026

Sony Supplies 100+ Cameras, 500 Monitors, and More for NBC's Winter Olympics Coverage

NBC Sports will utilize Sony Electronics to deliver imaging, monitoring, and tec...

03/02/2026

NBC Sports Selects SMT to Handle Results, Timing, and Production Data Services for Milano-Cortina 2026

SMT will provide TVI (broadcast television interface) support services for figur...

03/02/2026

Spotify Celebrates the 2026 Best New Artist Nominees During a Star-Studded Party in LA

Spotify's annual Best New Artist celebration returned to Los Angeles last ni...

03/02/2026

The smash-hit series The Hospital: In the Deep End returns to shine a spotlight on the real-life heroes of our healthcare system

The smash-hit series The Hospital: In the Deep End returns to shine a spotlight ...

03/02/2026

Introducing: FoMa Antares

FoMa Antares: Redefining Camera Stabilization for Modern ProductionsIn high-end broadcast and cinematic environments, precision, reliability, and flexibility ar...

03/02/2026

Viper Shield Flight Tests Accelerates Delivery with New Digital Electronic Warfare Capability

Viper Shield's robust phase of flight-testing production representative hard...

03/02/2026

L3Harris: A Strategic Partner for National Vigilance and Regional Peace

Countries throughout the Indo-Pacific have made the ability to defend their people and sovereign borders a top priority, but no single nation can monitor every ...

03/02/2026

L3Harris Achieves Record-Breaking Fuzing Delivery Milestone

U.S. Marine loads a round in an 81 mm mortar system during an exercise. (Credit: U.S. Marines)...

03/02/2026

Cobalt Targets Need for Multiple Video Feeds with openGear 9905MPx Multi-Channel Processor

9905MPx Card features unprecedented level of density with four independent signa...

03/02/2026

Cobalt Wins Three 2021 NAB Show Product of the Year Awards

Champaign, IL - November 15, 2021 - Cobalt Digital, announced today that three of its products have received 2021 NAB Show Product of the Year Awards. Cobalt, ...

03/02/2026

LOG4J Does Not Affect Cobalt Digital Products

Cobalt is pleased to confirm that Log4j vulnerability does NOT affect ANY Cobalt products including the HPF-9000 frame. Given the recent concerns regarding wi...

03/02/2026

Cobalt Digital's Launch of +UDX-Dante-1616 Introduces First License-based 12G-SDI Bridge to Dante Audio

Software-based embedding and de-embedding solution will be highlighted at NAB 20...

03/02/2026

Cobalt to Show New Line-up of EO/OE 12G Mini-Convertor BBG Boxes at NAB

Series developed to fill an industry need. Champaign, IL April 13, 2022 Cobalt Digital will introduce a new series of BlueBox Group (BBG) EO/OE 12G mini...

03/02/2026

Cobalt Takes Home Four Awards from NAB 2022

Company receives Best in Shows from Next TV and TVB Europe, and two Product of the Year awards from NAB Champaign, IL - May 6, 2022 - Cobalt Digital Inc. head...

03/02/2026

Cobalt to Participate in Several Events in and Around IBC 2022

IBC Stand 10.B44Journalists: Click to visit Cobalt Dr. Ciro Noronha, CTO, tapped as a panelist and speaker by many - and then there's the party! Amsterda...

03/02/2026

Company Announces Changes in Sales and Engineering Team Just in Time for IBC

IBC Stand 10.B44Journalists: Click to visit Cobalt Growth Drives Cobalt Digital to Promote Dr. Ciro Noronha to CTO, and Appoint Anthony Tan as Director of Sale...

03/02/2026

Cobalt Targets Top Trends at NAB NY with Solutions that Support IP, 4K ST 2110, HDR, RIST, Dante and Internet Security.

IBC Stand 10.B44Journalists: Click to visit Cobalt Dr. Ciro Noronha, CTO, tappe...

03/02/2026

Cobalt is Bringing an Award-Winning Portfolio of New Products to CABSAT 2023

Manufacturer plans also include solutions that provide a path to the cloud and redundancy updatesCABSAT Stand S1-I42Journalists: Click to visit Cobalt Dubai - ...

03/02/2026

Cobalt Announces Presence at ANGA COM 2023

Award-Winning Product Portfolio is Being Presented on Smart Video Group Stand, and Company's Berend Blokzijl is Scheduled to Share RIST Insight on Innovatio...

03/02/2026

COBALT Announces an IBC Line-up that Features IP Products and Extends 12-G Support Options

NEW Location IBC Stand 10.F42 Same Hall, NEW StandJournalists: Click to visi...