
NVIDIA today announced at Microsoft Build new AI performance optimizations and integrations for Windows that help deliver maximum performance on NVIDIA GeForce RTX AI PCs and NVIDIA RTX workstations.
Large language models (LLMs) power some of the most exciting new use cases in generative AI and now run up to 3x faster with ONNX Runtime (ORT) and DirectML using the new NVIDIA R555 Game Ready Driver. ORT and DirectML are high-performance tools used to run AI models locally on Windows PCs.
WebNN, an application programming interface for web developers to deploy AI models, is now accelerated with RTX via DirectML, enabling web apps to incorporate fast, AI-powered capabilities. And PyTorch will support DirectML execution backends, enabling Windows developers to train and infer complex AI models on Windows natively. NVIDIA and Microsoft are collaborating to scale performance on RTX GPUs.
These advancements build on NVIDIA's world-leading AI platform, which accelerates more than 500 applications and games on over 100 million RTX AI PCs and workstations worldwide.
RTX AI PCs - Enhanced AI for Gamers, Creators and Developers NVIDIA introduced the first PC GPUs with dedicated AI acceleration, the GeForce RTX 20 Series with Tensor Cores, along with the first widely adopted AI model to run on Windows, NVIDIA DLSS, in 2018. Its latest GPUs offer up to 1,300 trillion operations per second of dedicated AI performance.
In the coming months, Copilot+ PCs equipped with new power-efficient systems-on-a-chip and RTX GPUs will be released, giving gamers, creators, enthusiasts and developers increased performance to tackle demanding local AI workloads, along with Microsoft's new Copilot+ features.
For gamers on RTX AI PCs, NVIDIA DLSS boosts frame rates by up to 4x, while NVIDIA ACE brings game characters to life with AI-driven dialogue, animation and speech.
For content creators, RTX powers AI-assisted production workflows in apps like Adobe Premiere, Blackmagic Design DaVinci Resolve and Blender to automate tedious tasks and streamline workflows. From 3D denoising and accelerated rendering to text-to-image and video generation, these tools empower artists to bring their visions to life.
For game modders, NVIDIA RTX Remix, built on the NVIDIA Omniverse platform, provides AI-accelerated tools to create RTX remasters of classic PC games. It makes it easier than ever to capture game assets, enhance materials with generative AI tools and incorporate full ray tracing.
For livestreamers, the NVIDIA Broadcast application delivers high-quality AI-powered background subtraction and noise removal, while NVIDIA RTX Video provides AI-powered upscaling and auto-high-dynamic range to enhance streamed video quality.
Enhancing productivity, LLMs powered by RTX GPUs execute AI assistants and copilots faster, and can process multiple requests simultaneously.
And RTX AI PCs allow developers to build and fine-tune AI models directly on their devices using NVIDIA's AI developer tools, which include NVIDIA AI Workbench, NVIDIA cuDNN and CUDA on Windows Subsystem for Linux. Developers also have access to RTX-accelerated AI frameworks and software development kits like NVIDIA TensorRT, NVIDIA Maxine and RTX Video.
The combination of AI capabilities and performance deliver enhanced experiences for gamers, creators and developers.
Faster LLMs and New Capabilities for Web Developers Microsoft recently released the generative AI extension for ORT, a cross-platform library for AI inference. The extension adds support for optimization techniques like quantization for LLMs like Phi-3, Llama 3, Gemma and Mistral. ORT supports different execution providers for inferencing via various software and hardware stacks, including DirectML.
ORT with the DirectML backend offers Windows AI developers a quick path to develop AI capabilities, with stability and production-grade support for the broad Windows PC ecosystem. NVIDIA optimizations for the generative AI extension for ORT, available now in R555 Game Ready, Studio and NVIDIA RTX Enterprise Drivers, help developers get up to 3x faster performance on RTX compared to previous drivers.
Inference performance for three LLMs using ONNX Runtime and the DirectML execution provider with the latest R555 GeForce driver compared to the previous R550 driver. INSEQ=2000 representative of document summarization workloads. All data captured with GeForce RTX 4090 GPU using batch size 1. The generative AI extension support for int4 quantization, plus the NVIDIA optimizations, result in up to 3x faster performance for LLMs. Developers can unlock the full capabilities of RTX hardware with the new R555 driver, bringing better AI experiences to consumers, faster. It includes:
Support for DQ-GEMM metacommand to handle INT4 weight-only quantization for LLMs
New RMSNorm normalization methods for Llama 2, Llama 3, Mistral and Phi-3 models
Group and multi-query attention mechanisms, and sliding window attention to support Mistral
In-place KV updates to improve attention performance
Support for GEMM of non-multiple-of-8 tensors to improve context phase performance
Additionally, NVIDIA has optimized AI workflows within WebNN to deliver the powerful performance of RTX GPUs directly within browsers. The WebNN standard helps web app developers accelerate deep learning models with on-device AI accelerators, like Tensor Cores.
Now available in developer preview, WebNN uses DirectML and ORT Web, a Javascript library for in-browser model execution, to make AI applications more accessible across multiple platforms. With this acceleration, popular models like Stable Diffusion, SD Turbo and Whisper run up to 4x faster on WebNN compared to WebGPU and are now available for developers to use. Microsoft Build attendees can learn more about developing on RTX in the Accelerating development on Windows PCs w
North America Stories
06/02/2026
Appear, which specializes in live production technology, announces the appointme...
06/02/2026
Baller League US announces CBS Sports and its 24/7 soccer streaming channel CBS Sports Golazo Network will air the league's programming in the United States...
06/02/2026
Gravity Media, which concentrates in production, content, media services, and fa...
06/02/2026
The Alliance for IP Media Solutions (AIMS), together with the Video Services Forum (VSF), the Advanced Media Workflow Association (AMWA), and the European Broad...
06/02/2026
Bitmovin, a provider of video streaming solutions, announces that 1001, an OTT service in Iraq, has chosen the Bitmovin Player to improve its video streaming pe...
06/02/2026
Combate Global and content creator Shane Fazen announce a licensing agreement to distribute the Hispanic-focused franchise's first three live MMA events in ...
06/02/2026
Cisco is powering the invisible backbone of Super Bowl LX at Levi's Stadium as the technology giant delivers secure, high-capacity connectivity for over 70,...
06/02/2026
Over the past decade, the NFL and Amazon Web Services have changed how football analytics are analyzed and presented through Next Gen Stats. There's real-ti...
06/02/2026
In-venue and creative video staffers at the professional and collegiate level ha...
06/02/2026
In-venue and creative video staffers at the professional and collegiate level ha...
06/02/2026
Ratings Roundup is a rundown of recent rating news and is derived from press rel...
06/02/2026
How the podcast-turned-studio-show Boston Has Entered The Chat became an anchor ...
06/02/2026
ORF, the public service broadcaster for Austria, is in Italy for Milano Cortina 2026, ready to bring the country's most popular winter sports direct to view...
06/02/2026
Milano Cortina 2026 is now underway and Austrian public service broadcaster, ORF...
06/02/2026
Warner Bros. Discovery (WBD) has lifted the curtain on its studios in Italy that...
06/02/2026
Milano Cortina marks the first time since London 2012 that NRK has had the full ...
06/02/2026
Winter sports are wildly popular in Norway, with cross-country skiing and biathl...
06/02/2026
Norwegian broadcaster NRK has the free-to-air rights to the Olympics back for th...
06/02/2026
The production of the mega-esports event also leverages facilities at EA headqua...
06/02/2026
Here's a preview of NBC's massive game and pregame production operation as Super Bowl Sunday approaches....
06/02/2026
Despite most never having strapped on skis or skates, Aussies are keen for some ...
06/02/2026
MNC Software, a global leader in network management and operational support systems tailored to the broadcast and media industry, today announced the launch of ...
06/02/2026
The annual Junior Eurovision Song Contest arrived at Tbilisi's Gymnastic Hall in Olympic City, presenting an international stage for young talent with rich,...
06/02/2026
NAB Show 2026 | April 19 22 | Booth # N2471
At this year s NAB Show, Sonnet will showcase new Thunderbolt 5 products, including desktop and rackmount PCIe card...
06/02/2026
The Alliance for IP Media Solutions (AIMS), together with the Video Services Forum (VSF), the Advanced Media Workflow Association (AMWA), and the European Broad...
06/02/2026
Dalet, a leading technology and service provider for media-rich organizations, today announced a major update to Dalet Flex. Building on the workflow packages a...
06/02/2026
Getting closer to the business through highly respected technology partner
Stand 4P880, ISE 2026, Fira de Barcelona, 3 6 February 2026
Bitfocus is acceleratin...
06/02/2026
Bitmovin, a leading provider of video streaming solutions, has announced that 1001, a premier OTT service in Iraq, has chosen the Bitmovin Player to improve its...
06/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
06/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
06/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
06/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
06/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
06/02/2026
Back to All News
Next on Netflix Thailand 2026: The Widest Variety of Thai Stor...
06/02/2026
How invisible vaccine scaffolding boosts HIV immune response Scripps Research scientists designed a DNA scaffold that carries HIV vaccine proteins into the bo...
05/02/2026
Three examples of how wireless microphones are deployed to bring fans in deep an...
05/02/2026
Broadcast coverage will include 25 cameras distributed around the venues, including to some athletes; Galaxy AI Interpreter will also be deployed
The Opening C...
05/02/2026
Kiswe has partnered with the Mountain West Conference to power the next iteratio...
05/02/2026
NBCUniversal and Roku announce the launch of the 2026 NBC Winter Olympics Experience, a destination delivering NBCUniversal's comprehensive CTV coverage of ...
05/02/2026
Vizrt, which specializes in live production technology as well as transforming v...
05/02/2026
Canon USA has launched the RF7-14mm F2.8-3.5 L fisheye STM zoom lens and the RF14mm F1.4 L VCM prime lens. Building on Canon's legacy of innovative optics, ...
05/02/2026
The Paul E. Tsongas Center at UMass Lowell in Massachusetts has chosen Ikegami cameras for incorporation into its broadcast-quality television production facili...
05/02/2026
Once again, service members and Veterans worldwide will enjoy free access to NBC...
05/02/2026
Advanced Systems Group, LLC (ASG), a technology and services provider for media ...
05/02/2026
Broadcast Management Group (BMG) is strengthening its leadership team to support...
05/02/2026
NBC Sports selects Comcast Technology Solutions (CTS) to provide multiscreen vid...
05/02/2026
AIM Sports Group, a sports enterprise dedicated to elevating youth athletics thr...
05/02/2026
Designed for efficient use of shared services and resources, the home of OBS pro...
05/02/2026
The Yankees fan from Connecticut is executive producer of BTN StudentU for the Wolverines
In the live-sports-video industry, the future is bright. Our series S...
05/02/2026
In an Olympic first, the ceremony will be held in four locations simultaneously...