
NVIDIA's GeForce RTX 5090 and 5080 GPUs - which are based on the groundbreaking NVIDIA Blackwell architecture -offer up to 8x faster frame rates with NVIDIA DLSS 4 technology, lower latency with NVIDIA Reflex 2 and enhanced graphical fidelity with NVIDIA RTX neural shaders.
These GPUs were built to accelerate the latest generative AI workloads, delivering up to 3,352 AI trillion operations per second (TOPS), enabling incredible experiences for AI enthusiasts, gamers, creators and developers.
To help AI developers and enthusiasts harness these capabilities, NVIDIA at the CES trade show last month unveiled NVIDIA NIM and AI Blueprints for RTX. NVIDIA NIM microservices are prepackaged generative AI models that let developers and enthusiasts easily get started with generative AI, iterate quickly and harness the power of RTX for accelerating AI on Windows PCs. NVIDIA AI Blueprints are reference projects that show developers how to use NIM microservices to build the next generation of AI experiences.
NIM and AI Blueprints are optimized for GeForce RTX 50 Series GPUs. These technologies work together seamlessly to help developers and enthusiasts build, iterate and deliver cutting-edge AI experiences on AI PCs.
NVIDIA NIM Accelerates Generative AI on PCs While AI model development is rapidly advancing, bringing these innovations to PCs remains a challenge for many people. Models posted on platforms like Hugging Face must be curated, adapted and quantized to run on PC. They also need to be integrated into new AI application programming interfaces (APIs) to ensure compatibility with existing tools, and converted to optimized inference backends for peak performance.
NVIDIA NIM microservices for RTX AI PCs and workstations can ease the complexity of this process by providing access to community-driven and NVIDIA-developed AI models. These microservices are easy to download and connect to via industry-standard APIs and span the key modalities essential for AI PCs. They are also compatible with a wide range of AI tools and offer flexible deployment options, whether on PCs, in data centers, or in the cloud.
NIM microservices include everything needed to run optimized models on PCs with RTX GPUs, including prebuilt engines for specific GPUs, the NVIDIA TensorRT software development kit (SDK), the open-source NVIDIA TensorRT-LLM library for accelerated inference using Tensor Cores, and more.
Microsoft and NVIDIA worked together to enable NIM microservices and AI Blueprints for RTX in Windows Subsystem for Linux (WSL2). With WSL2, the same AI containers that run on data center GPUs can now run efficiently on RTX PCs, making it easier for developers to build, test and deploy AI models across platforms.
In addition, NIM and AI Blueprints harness key innovations of the Blackwell architecture that the GeForce RTX 50 series is built on, including fifth-generation Tensor Cores and support for FP4 precision.
Tensor Cores Drive Next-Gen AI Performance AI calculations are incredibly demanding and require vast amounts of processing power. Whether generating images and videos or understanding language and making real-time decisions, AI models rely on hundreds of trillions of mathematical operations to be completed every second. To keep up, computers need specialized hardware built specifically for AI.
NVIDIA GeForce RTX desktop GPUs deliver up to 3,352 AI TOPS for unmatched speed and efficiency in AI-powered workflows. In 2018, NVIDIA GeForce RTX GPUs changed the game by introducing Tensor Cores - dedicated AI processors designed to handle these intensive workloads. Unlike traditional computing cores, Tensor Cores are built to accelerate AI by performing calculations faster and more efficiently. This breakthrough helped bring AI-powered gaming, creative tools and productivity applications into the mainstream.
Blackwell architecture takes AI acceleration to the next level. The fifth-generation Tensor Cores in Blackwell GPUs deliver up to 3,352 AI TOPS to handle even more demanding AI tasks and simultaneously run multiple AI models. This means faster AI-driven experiences, from real-time rendering to intelligent assistants, that pave the way for greater innovation in gaming, content creation and beyond.
FP4 - Smaller Models, Bigger Performance Another way to optimize AI performance is through quantization, a technique that reduces model sizes, enabling the models to run faster while reducing the memory requirements.
Enter FP4 - an advanced quantization format that allows AI models to run faster and leaner without compromising output quality. Compared with FP16, it reduces model size by up to 60% and more than doubles performance, with minimal degradation.
For example, Black Forest Labs' FLUX.1 [dev] model at FP16 requires over 23GB of VRAM, meaning it can only be supported by the GeForce RTX 4090 and professional GPUs. With FP4, FLUX.1 [dev] requires less than 10GB, so it can run locally on more GeForce RTX GPUs.
On a GeForce RTX 4090 with FP16, the FLUX.1 [dev] model can generate images in 15 seconds with just 30 steps. With a GeForce RTX 5090 with FP4, images can be generated in just over five seconds.
FP4 is natively supported by the Blackwell architecture, making it easier than ever to deploy high-performance AI on local PCs. It's also integrated into NIM microservices, effectively optimizing models that were previously difficult to quantize. By enabling more efficient AI processing, FP4 helps to bring faster, smarter AI experiences for content creation.
AI Blueprints Power Advanced AI Workflows on RTX PCs NVIDIA AI Blueprints, built on NIM microservices, provide prepackaged, optimized reference implementations that make it easier to develop advanced AI-powered projects - whether for digital humans, podcast generators or application assistants.
At CES, NVIDIA demonstrated PDF to Podcast
North America Stories
24/11/2025
ROCHESTER, N.Y. Sinclair said it has elevated Sean LaRose, director of sales at WUHF and partner station WHAM here, to vice president and general manager, effec...
24/11/2025
Editor's note: This post is part of the AI On blog series, which explores the latest techniques and real-world applications of agentic AI, chatbots and copi...
22/11/2025
The deadline for entries for the 2025 Best in Market Awards has been extended to 23:59 PST on November 28, 2025....
22/11/2025
Clear-Com announced the upcoming launch of its 4-Channel HelixNet beltpack, a next-generation advancement of its widely used 2-channel model. The new beltpack...
22/11/2025
Marshall Electronics is showcasing the latest additions to its CV600 Series of PTZ cameras, the CV625 and CV612, which both feature AI track and follow capabili...
22/11/2025
At this year's European Respiratory Society (ERS) Congress, held at the RAI Amsterdam, LiveConnect delivered an ambitious and technically complex live produ...
22/11/2025
Professional Wireless Systems (PWS), a leader in wireless frequency coordination and RF system design, provided a comprehensive wireless gear package and onsite...
22/11/2025
Telestream, a global leader in media workflow technologies, today announced the release of ARGUS v2.3, which introduces Live Look, a powerful new feature that e...
22/11/2025
Peer Software today announced significant advancements across its enterprise data orchestration and analytics platform with new releases of Peer Global File Ser...
22/11/2025
At InterBEE 2025, Atomos announces a major firmware update that brings integrated camera control to the Ninja TX GO and Ninja TX its new CFexpress-based monit...
22/11/2025
Today, AWS announces the general availability of AWS Elemental MediaConnect Router, a new capability that enables broadcasters and content providers to dynamica...
22/11/2025
Rise, the award-winning advocacy group for gender diversity in the broadcast media technology sector, is delighted to announce the winners for this year's R...
22/11/2025
Lightware, industry leader in signal management, is strengthening its Taurus UCX product family with the introduction of the new HC60 lineup. The new product li...
22/11/2025
CARSON, Calif. IDX has introduced the IDX CUE-J Series battery/charger kits, including the CUE-J98, CUE-J150 and CUE-J198....
22/11/2025
The NBA has released encouraging viewing and social media data that the beginnings of its $76 billion deal with NBC/Peacock, Prime Video and ESPN are paying off...
22/11/2025
WASHINGTON The Federal Communications Commission has set deadlines for comments on its newest proposals for NextGen TV, aka ATSC 3.0, with comments due on Jan. ...
22/11/2025
Seeking Advice for a New Opera, Laura Kaminsky Consulted the Experts: Her Studen...
21/11/2025
Platinum White Paper: Appear Shares Why Media Exchange Is the Missing Link in So...
21/11/2025
NWSL Championship 2025: CBS Sports To Deploy Two-Point FlyCam for Match Coverage...
21/11/2025
NWSL Caps 2025 Season With Awards Show, Skills Challenge ProductionsA team of 70 is on the ground in California to produce both eventsBy Mark J Burns, SVG Contr...
21/11/2025
USL and NEP Ready for Largest USL Championship Final Production EverThe broadcast from Tulsa, OK, will air CBS and TUDN on Saturday at 12 p.m. ETBy Jason Dachma...
21/11/2025
With Two New Teams, PWHL Boosts Production Workforce and Central Review for Seas...
21/11/2025
Jared Lank and his mother in the '90s...
21/11/2025
MELBOURNE, Fla., Nov. 21, 2025 - L3Harris Technologies (NYSE: LHX) has announced this year's LHX Excellence Awards, the company's most prestigious recog...
21/11/2025
WASHINGTON The Federal Communications Commission by a 3-0 vote opened a notice of proposed rulemaking (NPRM) to advance Congress's mandate to clear a minimu...
21/11/2025
WASHINGTON The Federal Communications Commission by a 3-0 vote adopted a Notice of Proposed Rulemaking (NPRM) to advance Congress's mandate to clear a minim...
21/11/2025
STAMFORD, Conn. Charter Communications' Spectrum brand has expanded the range of devices that can offer 4K content on the Spectrum TV app to compatible Appl...
21/11/2025
NAPERVILLE, Ill. Media industry employers are continuing their multiyear trend of increasing salaries for all worker segments but lag general industry raises, s...
21/11/2025
WASHINGTON The National Association of Broadcasters said it is accepting nominations for the 2026 NAB Technology Awards, honors that recognize excellence in bro...
21/11/2025
American Amplifier Technologies has released a vector network analysis module....
21/11/2025
The Best Movie Musicals on Every Streaming Platform From Wicked to The Sound of Music, heres where to stream all the classic movie musicals and recent hits on...
20/11/2025
MLB Media-Rights Shakeup: NBC's New Three-Year Deal Covers Sunday Night Bas...
20/11/2025
MLB Media-Rights Shakeup: New Deal Will Bring 30 National Games to ESPN's Li...
20/11/2025
MLB Media-Rights Shakeup: Netflix Lands Opening Night, Home Run Derby, Field of ...
20/11/2025
MLB Media-Rights Shakeup Overview: ESPN, NBCU, Netflix Ink Three-Year DealsESPN gets new 30-game package, MLB.TV; NBC extends Sunday nights; Netflix adds tentpo...
20/11/2025
SVG Students To Watch: Henry Thuss, Indiana UniversityThe Southern California product has his goals set on the front benchBy Brandon Costa, Director of Digital ...
20/11/2025
Done+Dusted's Guy Carrington on Creating the Spectacular League of Legends W...
20/11/2025
FIA Extreme H World Cup Host Broadcaster Aurora Goes Inside the Production of th...
20/11/2025
Platinum White Paper: Amagi Utilizes Cloud Production for Sports Events - Multi-...
20/11/2025
2025 Sports Broadcasting Hall of Fame: Marc Herklotz, Steady Hand Behind the Sce...
20/11/2025
NFL Deep Dive: How 32 Cameras at Each Stadium Drive Virtual Measurement, Boundar...
20/11/2025
Charlie Shackleton attends the 2025 Sundance Film Festival premiere of Zodiac K...
20/11/2025
L3Harris has achieved NSA Cybersecurity Directorate certification for its KSV-650 space hub end cryptographic unit, ensuring secure, adaptable communications fo...
20/11/2025
Left to Right: David Taubman, Regional Managing Director, Central and Eastern Europe; Arek Szalpuk, Poland Sr. Account Manager; Mr. Marcin Wi niewski, President...
20/11/2025
CINCINNATI Oklahoma Community Television (OCT) has selected GatesAir and Triveni Digital as key technology partners in an ATSC 3.0 deployment that establishes a...
20/11/2025
WASHINGTON The Federal Communications Commission has opened a wide-ranging inquiry into the relations between broadcast networks and their affiliates that could...
20/11/2025
NEW YORK Ad spend in the creator economy has more than doubled since 2021 from $13.9B to $29.5B in 2024 and that amount is projected to reach $37 billion in 202...
20/11/2025
Graduate Spotlight: Krysta Mirsik DePuy The educator from Hackettstown, New Jersey, shares how it took her 11 years to find the right graduate program for her...
20/11/2025
LONDON The winners of the Rise Awards 2025, which recognize women and companies whose achievements have stood out in the media technology industry, have been an...
20/11/2025
SANTA MONICA & NEW YORK Lionsgate's Worldwide Television Distribution Group and Debmar-Mercury have launched MovieSphere Gold in more than 30 million homes....