Sony Pixel Power calrec Sony

How GeForce RTX 50 Series GPUs Are Built to Supercharge Generative AI on PCs

05/02/2025

NVIDIA's GeForce RTX 5090 and 5080 GPUs - which are based on the groundbreaking NVIDIA Blackwell architecture -offer up to 8x faster frame rates with NVIDIA DLSS 4 technology, lower latency with NVIDIA Reflex 2 and enhanced graphical fidelity with NVIDIA RTX neural shaders.

These GPUs were built to accelerate the latest generative AI workloads, delivering up to 3,352 AI trillion operations per second (TOPS), enabling incredible experiences for AI enthusiasts, gamers, creators and developers.

To help AI developers and enthusiasts harness these capabilities, NVIDIA at the CES trade show last month unveiled NVIDIA NIM and AI Blueprints for RTX. NVIDIA NIM microservices are prepackaged generative AI models that let developers and enthusiasts easily get started with generative AI, iterate quickly and harness the power of RTX for accelerating AI on Windows PCs. NVIDIA AI Blueprints are reference projects that show developers how to use NIM microservices to build the next generation of AI experiences.

NIM and AI Blueprints are optimized for GeForce RTX 50 Series GPUs. These technologies work together seamlessly to help developers and enthusiasts build, iterate and deliver cutting-edge AI experiences on AI PCs.

NVIDIA NIM Accelerates Generative AI on PCs While AI model development is rapidly advancing, bringing these innovations to PCs remains a challenge for many people. Models posted on platforms like Hugging Face must be curated, adapted and quantized to run on PC. They also need to be integrated into new AI application programming interfaces (APIs) to ensure compatibility with existing tools, and converted to optimized inference backends for peak performance.

NVIDIA NIM microservices for RTX AI PCs and workstations can ease the complexity of this process by providing access to community-driven and NVIDIA-developed AI models. These microservices are easy to download and connect to via industry-standard APIs and span the key modalities essential for AI PCs. They are also compatible with a wide range of AI tools and offer flexible deployment options, whether on PCs, in data centers, or in the cloud.

NIM microservices include everything needed to run optimized models on PCs with RTX GPUs, including prebuilt engines for specific GPUs, the NVIDIA TensorRT software development kit (SDK), the open-source NVIDIA TensorRT-LLM library for accelerated inference using Tensor Cores, and more.

Microsoft and NVIDIA worked together to enable NIM microservices and AI Blueprints for RTX in Windows Subsystem for Linux (WSL2). With WSL2, the same AI containers that run on data center GPUs can now run efficiently on RTX PCs, making it easier for developers to build, test and deploy AI models across platforms.

In addition, NIM and AI Blueprints harness key innovations of the Blackwell architecture that the GeForce RTX 50 series is built on, including fifth-generation Tensor Cores and support for FP4 precision.

Tensor Cores Drive Next-Gen AI Performance AI calculations are incredibly demanding and require vast amounts of processing power. Whether generating images and videos or understanding language and making real-time decisions, AI models rely on hundreds of trillions of mathematical operations to be completed every second. To keep up, computers need specialized hardware built specifically for AI.

NVIDIA GeForce RTX desktop GPUs deliver up to 3,352 AI TOPS for unmatched speed and efficiency in AI-powered workflows. In 2018, NVIDIA GeForce RTX GPUs changed the game by introducing Tensor Cores - dedicated AI processors designed to handle these intensive workloads. Unlike traditional computing cores, Tensor Cores are built to accelerate AI by performing calculations faster and more efficiently. This breakthrough helped bring AI-powered gaming, creative tools and productivity applications into the mainstream.

Blackwell architecture takes AI acceleration to the next level. The fifth-generation Tensor Cores in Blackwell GPUs deliver up to 3,352 AI TOPS to handle even more demanding AI tasks and simultaneously run multiple AI models. This means faster AI-driven experiences, from real-time rendering to intelligent assistants, that pave the way for greater innovation in gaming, content creation and beyond.

FP4 - Smaller Models, Bigger Performance Another way to optimize AI performance is through quantization, a technique that reduces model sizes, enabling the models to run faster while reducing the memory requirements.

Enter FP4 - an advanced quantization format that allows AI models to run faster and leaner without compromising output quality. Compared with FP16, it reduces model size by up to 60% and more than doubles performance, with minimal degradation.

For example, Black Forest Labs' FLUX.1 [dev] model at FP16 requires over 23GB of VRAM, meaning it can only be supported by the GeForce RTX 4090 and professional GPUs. With FP4, FLUX.1 [dev] requires less than 10GB, so it can run locally on more GeForce RTX GPUs.

On a GeForce RTX 4090 with FP16, the FLUX.1 [dev] model can generate images in 15 seconds with just 30 steps. With a GeForce RTX 5090 with FP4, images can be generated in just over five seconds.

FP4 is natively supported by the Blackwell architecture, making it easier than ever to deploy high-performance AI on local PCs. It's also integrated into NIM microservices, effectively optimizing models that were previously difficult to quantize. By enabling more efficient AI processing, FP4 helps to bring faster, smarter AI experiences for content creation.

AI Blueprints Power Advanced AI Workflows on RTX PCs NVIDIA AI Blueprints, built on NIM microservices, provide prepackaged, optimized reference implementations that make it easier to develop advanced AI-powered projects - whether for digital humans, podcast generators or application assistants.

At CES, NVIDIA demonstrated PDF to Podcast
LINK: https://blogs.nvidia.com/blog/rtx-ai-garage-blackwell-nim-blueprints-p...
See more stories from nvidia

North America Stories

26/03/2025

Disguise to Showcase Immersive Sports Programming Technologies at NAB 2025

Disguise to Showcase Immersive Sports Programming Technologies at NAB 2025 Brie Clayton March 26, 2025 0 Comments See Demos Across Partner Booths; Wat...

26/03/2025

Archiware PresentsUpcoming P5 Version 7.4 at NAB Show 2025

Archiware PresentsUpcoming P5 Version 7.4 at NAB Show 2025 Brie Clayton March 26, 2025 0 Comments Archiware, a leading provider of data management sof...

26/03/2025

Avid Redefines Digital-First News Production at NAB Show 2025

Avid Redefines Digital-First News Production at NAB Show 2025 Brie Clayton March 26, 2025 0 Comments Avid and Wolftech Debuting Seamless, Digital-Firs...

26/03/2025

Cine Gear Expo to be Held at Universal Studios Lot Univer...

Los Angeles, California: The industry's premier filmmaking exposition has chosen the world-renowned Universal Studios Lot for Cine Gear Expo LA 2025. Regist...

26/03/2025

MovieLabs Announces Industry Forum Leadership Council and...

MovieLabs, the technology joint venture of the major Hollywood studios today announced the formation of a Leadership Council and the inaugural member companies ...

26/03/2025

Magewell to Showcase Newest Innovations at 2025 NAB Show

Magewell, developer of innovative, high-performance video I/O and IP workflow solutions, will be showcasing its latest innovations at the 2025 NAB Show, Las Veg...

26/03/2025

Breaking the Bottleneck - Building Time to Value Solution...

The media industry's rapid transformation presents organizations with both unprecedented opportunities and challenges. Over the years, Dalet has been at the...

26/03/2025

Pixel Power Announces Brad Rochon as Senior Business Deve...

Pixel Power, a Rohde & Schwarz company, is pleased to announce that Brad Rochon has recently joined the company in the newly created position of Senior Business...

26/03/2025

Avid Redefines Digital-First News Production at NAB Show...

Avid will showcase its most advanced news production solutions, designed to accelerate digital-first, story-driven journalism and broaden audience reach, at NA...

26/03/2025

Net Insight and Globecast power Premier Padels global exp...

Net Insight and Globecast are providing Premier Padel, the leading professional padel tour, with a cutting-edge IP and cloud-based distribution solution. Levera...

26/03/2025

Vizrt delivers modern broadcast MAM demands with Viz One...

Vizrt, the leader in real-time graphics and live production solutions for content creators, today announces Viz One 8, the biggest update to its Enterprise Medi...

26/03/2025

Limecraft Teams with DPG Media to Launch New Capabilities...

With the 2025 NAB Show approaching, Limecraft announces the release of the second in a series of eight major platform updates planned for this year. Building on...

26/03/2025

Kiloview to Showcase Its Innovative AV-over-IP Solutions...

Kiloview has announced to unveil its most complete and lightweight broadcast solutions at NAB 2025 in Las Vegas. Located at SL9413, the company will showcase it...

26/03/2025

Avid Discusses New Leadership, NAB Show Plans

BURLINGTON, Mass. At the 2025 NAB Show, April 6-9 in Las Vegas, Avid will showcase new features to its MediaCentral platform, featuring the latest AI-powered ne...

26/03/2025

YouTube Sees Record Viewing, Beats Disney in TV Viewing Share

NEW YORK YouTube hit record share of monthly TV viewing in February and had the largest share of TV viewing by the major media companies, according to Nielsen&#...

26/03/2025

MovieLabs Announces Industry Forum Leadership Council

SAN FRANCISCO MovieLabs, the technology joint venture of the major Hollywood studios today announced the formation of a Leadership Council and the inaugural mem...

26/03/2025

New Music USA and Berklee Institute of Jazz and Gender Justice Announce 2025 Next Jazz Legacy Cohort

New Music USA and Berklee Institute of Jazz and Gender Justice Announce 2025 Nex...

26/03/2025

Buzz Solutions Uses Vision AI to Supercharge the Electric Grid

The reliability of the electric grid is critical. From handling demand surges and evolving power needs to preventing infrastructure failures that can cause wil...

25/03/2025

Sencore Brings Next-gen IP and Hospitality Solutions to NAB2025

Alongside advancements in orchestration, monitoring and distribution, Sencore, a leading provider of broadcast and media technology solutions, will be showcasin...

25/03/2025

YouTube Achieves Best Monthly Performance to Date and Pulls Ahead in Nielsen's February Media Distributor Gauge

YouTube captures 11.6% of TV viewing in February. FOX climbs into the top three...

25/03/2025

Survey: Consumers Express Growing Dissatisfaction with Streaming Services

The honeymoon between the streaming industry and consumers is definitely over, with a new consumer survey showing deep dissatisfaction. Nearly half (47%) of tho...

25/03/2025

Stream7 Relied on Dejero Technology to Live Stream an International Event

WATERLOO, Canada Stream7, a U.K.-based live event broadcast and production company, relied on Dejero Smart Blending Technology to live stream a historic three-d...

25/03/2025

Vizrt Unveils Containerized Enterprise MAM System

BERGEN, Norway Vizrt has unveiled Viz One 8, the company's largest update to its enterprise Media Asset Management (MAM) system in more than 10 years. The u...

25/03/2025

Telemundo 51 Miami Names Liliet Heredero VP of News & Content

MIRAMAR, Fla. Telemundo 51 Miami/WSCV has named Liliet Heredero vice president of news and content. The multiplatform news veteran with 20 years of experience w...

25/03/2025

Dish Media Partners with Decentrix on New Ad Management Platform

ENGLEWOOD, Colo., Dish Media has launched a new, specially configured Order Management System that was created in collaboration with Decentrix's BIAnalytix ...

25/03/2025

Mediagenix Integrates Spideo Recommendation and Curation Capabilities into Its Title Management Solution

Mediagenix Integrates Spideo Recommendation and Curation Capabilities into Its T...

25/03/2025

Digital Nirvana and Avid Partner to Enhance Media Production Workflows Through Application of Intelligent Metadata

Digital Nirvana and Avid Partner to Enhance Media Production Workflows Through A...

25/03/2025

Chaos Launches Arena, a New Way to Create Virtual Productions Without a Game Engine

Chaos Launches Arena, a New Way to Create Virtual Productions Without a Game Eng...

25/03/2025

Lightware Unveils Taurus UCX-1x1-C40 Docking Station for...

New Taurus Smart Dock Redefines Workplace Connectivity and Enhances Meeting Room Efficiency Lightware, a leader in connectivity solutions for the professional ...

25/03/2025

Techex to Showcase Cutting-Edge Video Over IP Solutions a...

Innovative Technology at the Forefront of Live Broadcast and Cloud Integration Techex, a leading provider of live broadcast infrastructure solutions, will be s...

25/03/2025

Studio Technologies to Display its New Range of ST 2110 S...

Studio Technologies, manufacturer of high-quality audio, video, and fiber-optic solutions, will unveil three new announcer consoles that feature ST 2110 support...

25/03/2025

Professional Wireless Systems PWS Delivers Seamless Audio...

When Univision presented the 37th annual Premio Lo Nuestro at the Kaseya Center in Miami, Professional Wireless Systems (PWS) was onsite to handle frequency coo...

25/03/2025

FOR-A Introduces Software-Defined Live Production Platfor...

FOR-A IMPULSE Includes Essential Production Tasks as Software Nodes within Highly Flexible System as "Station in a Box" Concept In response to the global incre...

25/03/2025

Leader brings clarity to NDI troubleshooting

Test & measurement innovator Leader Instruments Corporation has unveiled a new tool for comprehensive NDI signal troubleshooting and monitoring. The software a...

25/03/2025

Chaos Launches Arena a New Way to Create Virtual Producti...

Today, Chaos launches Chaos Arena, a new tool that saves Hollywood millions of dollars by removing the biggest costs of virtual production. With Arena, artists ...

25/03/2025

LTN and Lumen transform premier sports events with live p...

Joint offering provides tier-one sports leagues and broadcasters including RTL Deutschland with low-latency, high-bandwidth connectivity and real-time content e...

25/03/2025

Overwatch Aero Enhances Drone Video Reliability with Zixi...

In a major advancement for drone-based aerial intelligence, Overwatch Aero has successfully integrated Zixi's cutting-edge technology to ensure reliable rea...

25/03/2025

Dejero powers seamless connectivity for global virtual bl...

EnGo and GateWay deliver uninterrupted two-way connectivity between Argentina and Kenya for historic three-day constitutional meeting The award-winning Dejero ...

25/03/2025

COW Job Listing: Creative Strategist

COW Job Listing: Creative Strategist Brie Clayton March 25, 2025 0 Comments Creative Strategist March 20, 2025COW Job Listing: Freelance BRU / ArGes...

25/03/2025

'IAB State of Data 2025' Report Shows AI Use is Accelerating

NEW YORK AI is set to transform the advertising industry, but the industry still has a ways to go in adoption, according to new research from the IAB....

25/03/2025

Future Of Remote Production Comes Into Focus During the TV Tech Summit

Ask someone involved with live, remote television production what motivates the adoption of a remote integration model (REMI) production, and they'll likely...

25/03/2025

Ookla: T-Mobile Leads in Median Fixed Wireless Internet Access Speeds

A new study of fixed wireless access (FWA) providers and their internet access speeds shows that operators continue to improve their speeds, with T-Mobile's...

25/03/2025

Florida Broadcasters Announce 2025 Hall of Fame Inductees

TALLAHASSEE, Fla. The Florida Association of Broadcasters has announced that it will recognize some of the state's most impactful, inspirational, and influe...

25/03/2025

Root Sports Launches New Seattle Mariners Streaming App

NEW YORK Streaming tech provider ViewLift has announced the launch of the new Root Sports Stream app that deliveres Seattle Mariners games and Root Sports progr...

25/03/2025

Gridiron Graphics: National Broadcasters Discuss the Latest in NFL Graphics Packages

Gridiron Graphics: National Broadcasters Discuss the Latest in NFL Graphics Pack...

25/03/2025

FanDuel Sports Network's Wade Nielsen on Monetizing Digital Content Through Creative Ad Insertion

FanDuel Sports Network's Wade Nielsen on Monetizing Digital Content Through ...

25/03/2025

How ESL FACEIT Group Produced Three Esports Events Across Two Continents in One Epic Weekend

How ESL FACEIT Group Produced Three Esports Events Across Two Continents in One ...

25/03/2025

How Generative AI Allowed the PGA TOUR to Add Written Descriptions to 30,000 Shots

How Generative AI Allowed the PGA TOUR to Add Written Descriptions to 30,000 Sho...

25/03/2025

David Kline to Depart as Chief Technology Officer of News Corp

David Kline to Depart as Chief Technology Officer of News Corp Kline's departure follows a successful five-year tenure that fostered innovation across the ...

25/03/2025

F&F Productions Partners with Grass Valley to Equip New Fully 2110 IP 4K OB Vehicle

The new GTX-21 IP 4K remote production truck will include a suite of Grass Valle...