Sony Pixel Power calrec Sony

How GeForce RTX 50 Series GPUs Are Built to Supercharge Generative AI on PCs

05/02/2025

NVIDIA's GeForce RTX 5090 and 5080 GPUs - which are based on the groundbreaking NVIDIA Blackwell architecture -offer up to 8x faster frame rates with NVIDIA DLSS 4 technology, lower latency with NVIDIA Reflex 2 and enhanced graphical fidelity with NVIDIA RTX neural shaders.

These GPUs were built to accelerate the latest generative AI workloads, delivering up to 3,352 AI trillion operations per second (TOPS), enabling incredible experiences for AI enthusiasts, gamers, creators and developers.

To help AI developers and enthusiasts harness these capabilities, NVIDIA at the CES trade show last month unveiled NVIDIA NIM and AI Blueprints for RTX. NVIDIA NIM microservices are prepackaged generative AI models that let developers and enthusiasts easily get started with generative AI, iterate quickly and harness the power of RTX for accelerating AI on Windows PCs. NVIDIA AI Blueprints are reference projects that show developers how to use NIM microservices to build the next generation of AI experiences.

NIM and AI Blueprints are optimized for GeForce RTX 50 Series GPUs. These technologies work together seamlessly to help developers and enthusiasts build, iterate and deliver cutting-edge AI experiences on AI PCs.

NVIDIA NIM Accelerates Generative AI on PCs While AI model development is rapidly advancing, bringing these innovations to PCs remains a challenge for many people. Models posted on platforms like Hugging Face must be curated, adapted and quantized to run on PC. They also need to be integrated into new AI application programming interfaces (APIs) to ensure compatibility with existing tools, and converted to optimized inference backends for peak performance.

NVIDIA NIM microservices for RTX AI PCs and workstations can ease the complexity of this process by providing access to community-driven and NVIDIA-developed AI models. These microservices are easy to download and connect to via industry-standard APIs and span the key modalities essential for AI PCs. They are also compatible with a wide range of AI tools and offer flexible deployment options, whether on PCs, in data centers, or in the cloud.

NIM microservices include everything needed to run optimized models on PCs with RTX GPUs, including prebuilt engines for specific GPUs, the NVIDIA TensorRT software development kit (SDK), the open-source NVIDIA TensorRT-LLM library for accelerated inference using Tensor Cores, and more.

Microsoft and NVIDIA worked together to enable NIM microservices and AI Blueprints for RTX in Windows Subsystem for Linux (WSL2). With WSL2, the same AI containers that run on data center GPUs can now run efficiently on RTX PCs, making it easier for developers to build, test and deploy AI models across platforms.

In addition, NIM and AI Blueprints harness key innovations of the Blackwell architecture that the GeForce RTX 50 series is built on, including fifth-generation Tensor Cores and support for FP4 precision.

Tensor Cores Drive Next-Gen AI Performance AI calculations are incredibly demanding and require vast amounts of processing power. Whether generating images and videos or understanding language and making real-time decisions, AI models rely on hundreds of trillions of mathematical operations to be completed every second. To keep up, computers need specialized hardware built specifically for AI.

NVIDIA GeForce RTX desktop GPUs deliver up to 3,352 AI TOPS for unmatched speed and efficiency in AI-powered workflows. In 2018, NVIDIA GeForce RTX GPUs changed the game by introducing Tensor Cores - dedicated AI processors designed to handle these intensive workloads. Unlike traditional computing cores, Tensor Cores are built to accelerate AI by performing calculations faster and more efficiently. This breakthrough helped bring AI-powered gaming, creative tools and productivity applications into the mainstream.

Blackwell architecture takes AI acceleration to the next level. The fifth-generation Tensor Cores in Blackwell GPUs deliver up to 3,352 AI TOPS to handle even more demanding AI tasks and simultaneously run multiple AI models. This means faster AI-driven experiences, from real-time rendering to intelligent assistants, that pave the way for greater innovation in gaming, content creation and beyond.

FP4 - Smaller Models, Bigger Performance Another way to optimize AI performance is through quantization, a technique that reduces model sizes, enabling the models to run faster while reducing the memory requirements.

Enter FP4 - an advanced quantization format that allows AI models to run faster and leaner without compromising output quality. Compared with FP16, it reduces model size by up to 60% and more than doubles performance, with minimal degradation.

For example, Black Forest Labs' FLUX.1 [dev] model at FP16 requires over 23GB of VRAM, meaning it can only be supported by the GeForce RTX 4090 and professional GPUs. With FP4, FLUX.1 [dev] requires less than 10GB, so it can run locally on more GeForce RTX GPUs.

On a GeForce RTX 4090 with FP16, the FLUX.1 [dev] model can generate images in 15 seconds with just 30 steps. With a GeForce RTX 5090 with FP4, images can be generated in just over five seconds.

FP4 is natively supported by the Blackwell architecture, making it easier than ever to deploy high-performance AI on local PCs. It's also integrated into NIM microservices, effectively optimizing models that were previously difficult to quantize. By enabling more efficient AI processing, FP4 helps to bring faster, smarter AI experiences for content creation.

AI Blueprints Power Advanced AI Workflows on RTX PCs NVIDIA AI Blueprints, built on NIM microservices, provide prepackaged, optimized reference implementations that make it easier to develop advanced AI-powered projects - whether for digital humans, podcast generators or application assistants.

At CES, NVIDIA demonstrated PDF to Podcast
LINK: https://blogs.nvidia.com/blog/rtx-ai-garage-blackwell-nim-blueprints-p...
See more stories from nvidia

Most recent headlines

27/03/2025

Sundance Institute Announces Boulder, Colorado, as the New Home for the Sundance Film Festival Beginning in 2027

Boulders Vibrant Community of Artists, Enthusiastic Audiences, and Breathtaking ...

27/03/2025

The Spotify Partner Program Expands To Nine New Markets, Giving More Creators New Ways To Monetize Their Content

Since announcing the Spotify Partner Program last fall, we've heard from cre...

27/03/2025

Initial Modem Setup Guide

Applicable Products Part number Description ZETA-GEP-LTE4 (EU) Low Power LTE Cat 4 European Modem with GPIO and GNSS ZETA-G-GPRS Entry Level GPRS Modem...

27/03/2025

Incoming and Outgoing Data Socket Connection using a Siretta Modem

Applicable Products Part number Description ZETA-GEP-LTE4 (EU) Low Power LTE Cat 4 European Modem with GPIO and GNSS ZETA-G-GPRS Entry Level GPRS Modem...

27/03/2025

How to Replace CSD using SirettaLINK

Applicable Products Part number Description SL500-LTE1 (EU) SL500-LTE1 (EU) Low Power LTE Cat 1 Serial Gateway SL500-LTEM (GL) SL500-LTEM (GL) Low Powe...

27/03/2025

Request Time from the Network using GPRS

Applicable Products Part number Description ZETA-GEP-LTE4 (EU) Low Power LTE Cat 4 European Modem with GPIO and GNSS ZETA-G-GPRS Entry Level GPRS Modem...

27/03/2025

Circuit Switched vs Packet-Switched Networks

Circuit switched networks physically connect two endpoints using a communication channel through the network. This utilises a fixed bandwidth for the session wh...

27/03/2025

Modem Troubleshooting

Applicable Products Part number Description ZETA-GEP-LTE4 (EU) Low Power LTE Cat 4 European Modem with GPIO and GNSS ZETA-G-GPRS Entry Level GPRS Modem...

27/03/2025

Standard Modem COM Port Settings

Applicable Products Part number Description ZETA-GEP-LTE4 (EU) Low Power LTE Cat 4 European Modem with GPIO and GNSS ZETA-G-GPRS Entry Level GPRS Modem...

27/03/2025

Dial-up Networking using a Siretta Industrial Modem

Applicable Products Part number Description ZETA-GEP-LTE4 (EU) Low Power LTE Cat 4 European Modem with GPIO and GNSS ZETA-G-GPRS Entry Level GPRS Modem...

27/03/2025

GPS Latitude Longitude Conversion Guide for Google Maps

Applicable Products Part number Description ZETA-GEP-LTE4 (EU) Low Power LTE Cat 4 European Modem with GPIO and GNSS ZETA-G-GPRS Entry Level GPRS Modem...

27/03/2025

Ultra Low Power Operation with a Siretta NLP Modem

Applicable Products Part number Description ZETA-NLP-LTE1 (EU) Ultra Low Power European LTE Cat 1 Modem ZETA-NLP-LTEM (GL) Ultra Low Power Global LTE C...

27/03/2025

POST Magazine News: MTI Film adds automation improvements to dailies & restoration tools at NAB 2025 in Las Vegas

POST Magazine News: MTI Film adds automation improvements to dailies & restorati...

27/03/2025

postPerspective News: MTI Film Updates Cortex and DRS Nova With MTai Technology in v6.0

postPerspective News: MTI Film Updates Cortex and DRS Nova With MTai Technology ...

27/03/2025

Help Grow the California Production Coalition

The California Production Coalition (CPC) is bringing together businesses and organizations from across the entertainment industry to protect and strengthen fil...

27/03/2025

HPA Award Winner: Damian McDonnell finishing colourist on Time Bandits

Creating not just a whole new fantasy adventure world but one fantasy adventure world per episode was the herculean task set by Taika Waititi, Iain Morris and J...

27/03/2025

Clear-Com Enhances Abilene Christian University's Campus-Wide Production Communications

eds3_5_jq(document).ready(function($) { $(#eds_sliderM519).chameleonSlider_2_1({...

27/03/2025

Join Calrec at MPTS 2025

Join Calrec at MPTS 2025 | May 14 -15 | Stand A40 | Olympia, London We're looking forward to meeting up with customers and partners at this year's Media...

27/03/2025

Nielsen and Acxiom Collaborate to Bring Increased Connectivity to Advanced Audiences

First of its kind ID-based, advanced audience integration NEW YORK - March 27, ...

27/03/2025

NAB Show 2025 Exhibitor Insight: Lawo

TV Tech: What do you anticipate to be the most significant technology trends at the 2025 NAB Show?...

27/03/2025

Diversified introduces Atlas Orchestrate Cloud Deployment Platform

PLANO, Texas Diversified has announced the launch of Atlas Orchestrate, a new platform that is designed to eliminate the complexity of cloud deployment by enabl...

27/03/2025

Pliant Technologies' Latest Wireless Intercom Solutions Now Shipping

Pliant Technologies' Latest Wireless Intercom Solutions Now Shipping Brie Clayton March 27, 2025 0 Comments Brand to Highlight Variety of Newly Av...

27/03/2025

Golaem crowd tools added to Autodesk Media & Entertainment Collection

Golaem crowd tools added to Autodesk Media & Entertainment Collection Brie Clayton March 27, 2025 0 Comments Dune: Part Two made with Autodesk Maya. I...

27/03/2025

Marshall Announces RCP Plus Controller at NAB 2025

Marshall Announces RCP Plus Controller at NAB 2025 Brie Clayton March 27, 2025 0 Comments Mix and Match Different Cameras Without Shifting Modes, Usin...

27/03/2025

DHD to Highlight RX2, SX2 and TX2 Audio Mixers at the 2025 NAB Show, Las Vegas

DHD to Highlight RX2, SX2 and TX2 Audio Mixers at the 2025 NAB Show, Las Vegas Brie Clayton March 27, 2025 0 Comments Accompanying image shows a DHD S...

27/03/2025

DOGE Grills Public Broadcasters on Capitol Hill

Leaders of the Public Broadcasting Service and National Public Radio appeared before the DOGE subcommittee on Wednesday in a congressional hearing in which th...

27/03/2025

Sony unveils 70 per cent smaller Venice Extension System Mini

The Mini has a footprint equal to the size of an average smartphone and can be placed side by side for use in stereoscopic or volumetric video By Jenny Priestl...

27/03/2025

Mr Bates vs The Post Office, Baby Reindeer, The Traitors among BAFTA Craft Awards nominations

The teams behind House of the Dragon, Masters of the Air, Silo and The Lord of t...

27/03/2025

Rise Bursary to boost opportunity for next generation of women in media tech

The initiative aims to help young women pursue higher education in broadcast and media technology, offering financial assistance and networking opportunities B...

27/03/2025

MediaForEurope launches 1.3 billion bid for ProSieben

We believe that ProSiebenSat.1 needs a strong shareholder that can provide expertise and experience in the industry, making an active contribution to its growth...

27/03/2025

OWC Launches OWC Jellyfish B24 and OWC Jellyfish S24 Storage Solutions

OWC Launches OWC Jellyfish B24 and OWC Jellyfish S24 Storage Solutions Brie Clayton March 27, 2025 0 Comments With Blazing-Fast Performance, Massive S...

27/03/2025

ESPN Platforms Scores Strong Ratings for NCAA Women's Tournament

Women's college basketball continued to produce hefty audiences in the first and second rounds of the NCAA Women's Tournament, with ESPN platforms repor...

27/03/2025

Sony To Feature New VENICE Extension System Mini At 2025 NAB Show

SYDNEY Sony Electronics will showcase its newly announced VENICE Extension System Mini (CBK-3621XS), the latest addition to its CineAlta lineup, during the 2025...

27/03/2025

The WNET Group Names Dana Roberson GM, Thirteen and Production Operations

NEW YORK The WNET Group, the parent company of the PBS station Thirteen, has announced the appointment of Dana Roberson to general manager, Thirteen and product...

27/03/2025

NBCU to Launch 40 FAST Channels on LG Channels

NEW YORK NBCUniversal and LG Electronics (LG) have announced a deal that will make a wide variety of content available from NBCU on LG smart TVs and add e-comme...

27/03/2025

DirecTV Joins IBCAP Anti-Piracy Group

DENVER The International Broadcaster Coalition Against Piracy (IBCAP) has announced the addition of DirecTV, a leading video distribution company in the U.S., a...

27/03/2025

VEON Returns to Capital Markets with Successful Syndication of USD 210 Million Term Loan

27 Mar 2025 VEON Returns to Capital Markets with Successful Syndication of USD ...

27/03/2025

Telos Alliance & Syndicate Of Sounds Have You Surrounded at NAB 2025

Telos Alliance & Syndicate Of Sounds Have You Surrounded at NAB 2025 Search Cleveland, Ohio (March 27, 2025) - Telos Alliance and Syndicate of Sounds are...

27/03/2025

Grass Valley Technology Allows F&F GTX-21 Truck To Make Leap to ST 2110 IP

Grass Valley Technology Allows F&F GTX-21 Truck To Make Leap to ST 2110 IP The new mobile unit is scheduled for completion later this year By Ken Kerschbaumer,...

27/03/2025

Inside the MLB Media Center: How Baseball Amplifies Success and Charts Its Content Future

Inside the MLB Media Center: How Baseball Amplifies Success and Charts Its Conte...

27/03/2025

MLB Opening Day 2025: MLB Local Media Rolls Out WireCam, UmpCam, Shallow-Depth-of-Field RF for Five Teams

MLB Opening Day 2025: MLB Local Media Rolls Out WireCam, UmpCam, Shallow-Depth-o...

27/03/2025

MLB Opening Day 2025: YES Network's Matt Duarte on Populating Gotham Sports App With Yankees-Centric Content

MLB Opening Day 2025: YES Network's Matt Duarte on Populating Gotham Sports ...

27/03/2025

MLB Opening Day: ESPN Gets REMI, REMCO Models in a Groove for Sunday Night Baseball' Schedule

MLB Opening Day: ESPN Gets REMI, REMCO Models in a Groove for Sunday Night Base...

27/03/2025

Rohde & Schwarz pushes technological boundaries for Special Forces at SOFINS

Rohde & Schwarz pushes technological boundaries for Special Forces at SOFINS Rohde & Schwarz enhances Special Forces tactical supremacy with cutting-edge EMS ...

27/03/2025

Rohde & Schwarz to host High-Speed Digital Test Forum 2025

Rohde & Schwarz to host High-Speed Digital Test Forum 2025 With digitalization becoming omnipresent in multiple industry sectors, it is essential that design ...

27/03/2025

Haivision Unveils Falkon X2: Pushing the Boundaries of 5G Video Transmission for Live Broadcasting

Haivision Unveils Falkon X2: Pushing the Boundaries of 5G Video Transmission for...

27/03/2025

Live Microphone Kit tested at The Belonging Co.

According to Audio Director Brenton Miles, the kit delivered noticeable improvements in sound clarity and reduced the need for extensive EQ adjustments - stream...

27/03/2025

Dalet Focuses On Fast Deployment, Accelerated ROI, Setting Direction for Lasting Change at 2025 NAB Show

Dalet, a leading technology and service provider for media-rich organizations, t...

27/03/2025

RT Update: Voluntary Exit Programme

In an email to RT staff this evening, RT Director-General, Kevin Bakhurst confirmed as follows: Dear colleagues,...