
NVIDIA today announced at Microsoft Build new AI performance optimizations and integrations for Windows that help deliver maximum performance on NVIDIA GeForce RTX AI PCs and NVIDIA RTX workstations.
Large language models (LLMs) power some of the most exciting new use cases in generative AI and now run up to 3x faster with ONNX Runtime (ORT) and DirectML using the new NVIDIA R555 Game Ready Driver. ORT and DirectML are high-performance tools used to run AI models locally on Windows PCs.
WebNN, an application programming interface for web developers to deploy AI models, is now accelerated with RTX via DirectML, enabling web apps to incorporate fast, AI-powered capabilities. And PyTorch will support DirectML execution backends, enabling Windows developers to train and infer complex AI models on Windows natively. NVIDIA and Microsoft are collaborating to scale performance on RTX GPUs.
These advancements build on NVIDIA's world-leading AI platform, which accelerates more than 500 applications and games on over 100 million RTX AI PCs and workstations worldwide.
RTX AI PCs - Enhanced AI for Gamers, Creators and Developers NVIDIA introduced the first PC GPUs with dedicated AI acceleration, the GeForce RTX 20 Series with Tensor Cores, along with the first widely adopted AI model to run on Windows, NVIDIA DLSS, in 2018. Its latest GPUs offer up to 1,300 trillion operations per second of dedicated AI performance.
In the coming months, Copilot+ PCs equipped with new power-efficient systems-on-a-chip and RTX GPUs will be released, giving gamers, creators, enthusiasts and developers increased performance to tackle demanding local AI workloads, along with Microsoft's new Copilot+ features.
For gamers on RTX AI PCs, NVIDIA DLSS boosts frame rates by up to 4x, while NVIDIA ACE brings game characters to life with AI-driven dialogue, animation and speech.
For content creators, RTX powers AI-assisted production workflows in apps like Adobe Premiere, Blackmagic Design DaVinci Resolve and Blender to automate tedious tasks and streamline workflows. From 3D denoising and accelerated rendering to text-to-image and video generation, these tools empower artists to bring their visions to life.
For game modders, NVIDIA RTX Remix, built on the NVIDIA Omniverse platform, provides AI-accelerated tools to create RTX remasters of classic PC games. It makes it easier than ever to capture game assets, enhance materials with generative AI tools and incorporate full ray tracing.
For livestreamers, the NVIDIA Broadcast application delivers high-quality AI-powered background subtraction and noise removal, while NVIDIA RTX Video provides AI-powered upscaling and auto-high-dynamic range to enhance streamed video quality.
Enhancing productivity, LLMs powered by RTX GPUs execute AI assistants and copilots faster, and can process multiple requests simultaneously.
And RTX AI PCs allow developers to build and fine-tune AI models directly on their devices using NVIDIA's AI developer tools, which include NVIDIA AI Workbench, NVIDIA cuDNN and CUDA on Windows Subsystem for Linux. Developers also have access to RTX-accelerated AI frameworks and software development kits like NVIDIA TensorRT, NVIDIA Maxine and RTX Video.
The combination of AI capabilities and performance deliver enhanced experiences for gamers, creators and developers.
Faster LLMs and New Capabilities for Web Developers Microsoft recently released the generative AI extension for ORT, a cross-platform library for AI inference. The extension adds support for optimization techniques like quantization for LLMs like Phi-3, Llama 3, Gemma and Mistral. ORT supports different execution providers for inferencing via various software and hardware stacks, including DirectML.
ORT with the DirectML backend offers Windows AI developers a quick path to develop AI capabilities, with stability and production-grade support for the broad Windows PC ecosystem. NVIDIA optimizations for the generative AI extension for ORT, available now in R555 Game Ready, Studio and NVIDIA RTX Enterprise Drivers, help developers get up to 3x faster performance on RTX compared to previous drivers.
Inference performance for three LLMs using ONNX Runtime and the DirectML execution provider with the latest R555 GeForce driver compared to the previous R550 driver. INSEQ=2000 representative of document summarization workloads. All data captured with GeForce RTX 4090 GPU using batch size 1. The generative AI extension support for int4 quantization, plus the NVIDIA optimizations, result in up to 3x faster performance for LLMs. Developers can unlock the full capabilities of RTX hardware with the new R555 driver, bringing better AI experiences to consumers, faster. It includes:
Support for DQ-GEMM metacommand to handle INT4 weight-only quantization for LLMs
New RMSNorm normalization methods for Llama 2, Llama 3, Mistral and Phi-3 models
Group and multi-query attention mechanisms, and sliding window attention to support Mistral
In-place KV updates to improve attention performance
Support for GEMM of non-multiple-of-8 tensors to improve context phase performance
Additionally, NVIDIA has optimized AI workflows within WebNN to deliver the powerful performance of RTX GPUs directly within browsers. The WebNN standard helps web app developers accelerate deep learning models with on-device AI accelerators, like Tensor Cores.
Now available in developer preview, WebNN uses DirectML and ORT Web, a Javascript library for in-browser model execution, to make AI applications more accessible across multiple platforms. With this acceleration, popular models like Stable Diffusion, SD Turbo and Whisper run up to 4x faster on WebNN compared to WebGPU and are now available for developers to use. Microsoft Build attendees can learn more about developing on RTX in the Accelerating development on Windows PCs w
North America Stories
01/07/2025
L3Harris Technologies has delivered its second Bombardier Global 6500 missionize...
01/07/2025
MELBOURNE, Fla., July 1, 2025 - L3Harris Technologies (NYSE: LHX) will release its second quarter 2025 financial results before the market opens on Thursday, Ju...
01/07/2025
WASHINGTON The Federal Communications Commission has rejected license challenges to three full-power Baltimore TV stations, renewing the licenses for Chesapeake...
01/07/2025
Dalet has released a new update to its Dalet Flex media workflow platform, introducing powerful new capabilities to its flagship platform....
01/07/2025
NEWARK, N.J. With the FIFA Club World Cup final rapidly approaching and the start of the 2026 FIFA World Cup now less than a year away, interest in soccer in th...
01/07/2025
COW Job Listing: Laser Engraved Logo
Brie Clayton July 1, 2025
0 Comments
Laser engraved logo
July 1, 2025Steffie Beltt's Mi Amor Music Video Cr...
01/07/2025
LeGrow.Studio Builds Broadcast Hub for ePro League
Brie Clayton July 1, 2025
0 Comments
TEM Constellation HD and BlackmagicURSA Broadcast G2 drive liv...
01/07/2025
Beeble launches SwitchLight 2.0, bringing AI-powered relighting to any footage
Brie Clayton July 1, 2025
0 Comments
New AI model delivers full-scene p...
01/07/2025
Steffie Beltt's Mi Amor Music Video Created with Blackmagic Design
Brie Clayton July 1, 2025
0 Comments
Blackmagic PYXIS 6K digital film camera...
01/07/2025
WASHINGTON The Federal Communications Commission's Enforcement and Media Bureaus have entered into a consent decree with Sinclair to resolve a variety of in...
01/07/2025
DENVER Low-power television (LPTV) station owners looking to navigate the complexities of selling their assets in todays dynamic media environment are invited t...
01/07/2025
NASA announced today that live programming from its NASA+ channel will be available on Netflix starting sometime this summer....
01/07/2025
WASHINGTON Federal Communications Commission Chair Brendan Carr has appointed Katie McAuliffe to serve as policy advisor in his office....
01/07/2025
MOUNTAIN VIEW, Calif. Alphabet's GFiber, the broadband provider formerly known as Google Fiber, has announced that it recently worked with Nokia to demonstr...
01/07/2025
NEW YORK, N.Y. DoubleVerify (DV) has announced the launch of DV Authentic Attention for Social. The product will first launch with Snap, the owner of Snapchat....
01/07/2025
WASHINGTON The Federal Communications Commission has rejected license challenges to three full-power Baltimore TV stations and agreed to renew the license for C...
01/07/2025
Compact new converter lets users capture live NDI and streaming sources into software over a USB interface
Video interface and IP workflow innovator Magewell ...
01/07/2025
Disguise, the award-winning tech company driving visuals for Broadway and West End hits including Redwood, Stranger Things: The First Shadow and Disney's Fr...
01/07/2025
Historic appointment ushers in unified leadership for WRAL-TV, New Media, and Digital Solutions
RALEIGH, N.C. - 6-27-25 - Capitol Broadcasting Company is prou...
01/07/2025
SVG New Sponsor Spotlight: Spyrosoft's Jonathan Witty on Futher Business Exp...
01/07/2025
The NIL Effect: How the Shifting Business of College Athletics Impacts On-Campus...
01/07/2025
Cherry on the cake: Inside the technical production of the Tour de France 2025 w...
01/07/2025
NESN's Josh Jun on Keeping Alternative Broadcasts Fresh for Boston Red Sox, ...
01/07/2025
TNT Sports Races Back Into NASCAR With Fresh Look, Familiar Edge Production approach emphasizes creative freedom, bold storytelling, casual-fan-friendly tone B...
01/07/2025
Back to All News
Watch: A Line Crossed: Netflix Drops Explosive Trailer for New...
01/07/2025
Tyngsboro, Mass. - June 30, 2025 - The 2025 ACM Annual Conference at Boston University was an energizing and inspiring gathering of community media professional...
01/07/2025
In many parts of the world, including major technology hubs in the U.S., there's a yearslong wait for AI factories to come online, pending the buildout of n...
30/06/2025
The Artemis II Space Launch System core stage is integrated with the solid rocket boosters inside High Bay 3 of the Vehicle Assembly Building at NASAs Kennedy S...
30/06/2025
RALEIGH, N.C. Capitol Broadcasting Co. has named Heather Gray vice president and general manager of WRAL-TV and WRAZ-TV here....
30/06/2025
The Virginia Association of Broadcasters has recognized Bill Sewell, Director of Engineering at WTKR & WGNT in Norfolk, Va. as the recipient of the 2025 J.J. Fr...
30/06/2025
The Society of Broadcast Engineers said its annual member drive resulted in the recruitment of 49 individual members....
30/06/2025
BURLINGTON, Mass. Avid today released its fully integrated news platform, uniting MediaCentral and Wolftech News in a single newsroom solution, and will demonst...
30/06/2025
WASHINGTON The Federal Communication's Enforcement and Media Bureaus have entered into a Consent Decree with Sinclair Broadcast Group to resolve a variety o...
30/06/2025
Berklee at Umbria Jazz Clinics to Host 40th Anniversary Concert The celebration will be held on July 10 in Perugia, Italy.
By
Colette Greenstein
June 30, 202...
30/06/2025
PremiumBeat Tips and Tricks
Brie Clayton June 30, 2025
0 Comments
When editing to impress, you'll need quality music, and if your studio happens t...
30/06/2025
Back to All News
Bel n Cuesta and Karra Elejalde Star in El ni o, the New Film ...
30/06/2025
Back to All News
A New Dangerous Troll Awakens: Netflix Unleashes Teaser for Troll 2Play Video
Play Video
Entertainment
30 June 2025
GlobalNorwayDenmarkSwe...
29/06/2025
Back to All News
A Secret Society, Ritualistic Killings, and a Century-Old Curs...
28/06/2025
WASHINGTON In a press conference following the Federal Communications Commission's May Open Meeting, Chair Brendan Carr promised the agency would move rapid...
28/06/2025
STAMFORD, Conn. Charter Communications has awarded $1.1 million in Spectrum Digital Education grants to 55 nonprofit organizations that work to expand access to...
28/06/2025
LAKE FOREST, Calif. June 19, 2025
What's New:
Sonnet Technologies today announced the certification of its Echo 20 Thunderbolt 4 SuperDock as an Engin...
28/06/2025
MASV (massive.io), the fastest and most reliable large file transfer platform for media professionals, has been named an IDC Innovator in the IDC Innovators: Me...
28/06/2025
Grass Valley today announced that TV SKYLINE GmbH, one of Europe's top mobile production providers, has expanded its camera inventory with 30 LDX 135 UHD/HD...
28/06/2025
AgileTV, a European leader in TV and video technology solutions, signed an agreement with Austrian telco LIWEST to develop and implement its TV service in Austr...
28/06/2025
The 48th Annual Indian National Finals Rodeo Shot with Blackmagic PYXIS 6K
Brie Clayton June 27, 2025
0 Comments
Filmmaker Cameron Mackey relied on Bl...
28/06/2025
Social, Streaming Don't Compete, They Compliment
Andy Marken June 27, 2025
0 Comments
I think we've all arrived at a very special place. Spir...
28/06/2025
Blackmagic Design Captures Filipino Rock Band Drama Singtala
Brie Clayton June 27, 2025
0 Comments
Blackmagic URSA Mini Pro 12K and DaVinci Resolve St...
28/06/2025
Enhance Videos Faster with Aiarty Video Enhancer - Offline, Sharp, and Natural
Brie Clayton July 1, 2025
0 Comments
If you've used AI video tools ...
27/06/2025
By Jessica Herndon
One of the most exciting things about the Sundance Film Fest...
27/06/2025
WASHINGTON The Federal Communications Commission has set deadlines for comments to a notice of proposed rulemaking (NPRM) to codify certain foreign ownership re...