
If optimized AI workflows are like a perfectly tuned orchestra - where each component, from hardware infrastructure to software libraries, hits exactly the right note - then the long-standing harmony between NVIDIA and Microsoft is music to developers' ears.
The latest AI models developed by Microsoft, including the Phi-3 family of small language models, are being optimized to run on NVIDIA GPUs and made available as NVIDIA NIM inference microservices. Other microservices developed by NVIDIA, such as the cuOpt route optimization AI, are regularly added to Microsoft Azure Marketplace as part of the NVIDIA AI Enterprise software platform.
In addition to these AI technologies, NVIDIA and Microsoft are delivering a growing set of optimizations and integrations for developers creating high-performance AI apps for PCs powered by NVIDIA GeForce RTX and NVIDIA RTX GPUs.
Building on the progress shared at NVIDIA GTC, the two companies are furthering this ongoing collaboration at Microsoft Build, an annual developer event, taking place this year in Seattle through May 23.
Accelerating Microsoft's Phi-3 Models Microsoft is expanding its family of Phi-3 open small language models, adding small (7-billion-parameter) and medium (14-billion-parameter) models similar to its Phi-3-mini, which has 3.8 billion parameters. It's also introducing a new 4.2-billion-parameter multimodal model, Phi-3-vision, that supports images and text.
All of these models are GPU-optimized with NVIDIA TensorRT-LLM and available as NVIDIA NIMs, which are accelerated inference microservices with a standard application programming interface (API) that can be deployed anywhere.
APIs for the NIM-powered Phi-3 models are available at ai.nvidia.com and through NVIDIA AI Enterprise on the Azure Marketplace.
NVIDIA cuOpt Now Available on Azure Marketplace NVIDIA cuOpt, a GPU-accelerated AI microservice for route optimization, is now available in Azure Marketplace via NVIDIA AI Enterprise. cuOpt features massively parallel algorithms that enable real-time logistics management for shipping services, railway systems, warehouses and factories.
The model has set two dozen world records on major routing benchmarks, demonstrating the best accuracy and fastest times. It could save billions of dollars for the logistics and supply chain industries by optimizing vehicle routes, saving travel time and minimizing idle periods.
Through Azure Marketplace, developers can easily integrate the cuOpt microservice with Azure Maps to support teal-time logistics management and other cloud-based workflows, backed by enterprise-grade management tools and security.
Optimizing AI Performance on PCs With NVIDIA RTX The NVIDIA accelerated computing platform is the backbone of modern AI - helping developers build solutions for over 100 million Windows GeForce RTX-powered PCs and NVIDIA RTX-powered workstations worldwide.
NVIDIA and Microsoft are delivering new optimizations and integrations to Windows developers to accelerate AI in next-generation PC and workstation applications. These include:
Faster inference performance for large language models via the NVIDIA DirectX driver, the Generative AI ONNX Runtime extension and DirectML. These optimizations, available now in the GeForce Game Ready, NVIDIA Studio and NVIDIA RTX Enterprise Drivers, deliver up to 3x faster performance on NVIDIA and GeForce RTX GPUs.
Optimized performance on RTX GPUs for AI models like Stable Diffusion and Whisper via WebNN, an API that enables developers to accelerate AI models in web applications using on-device hardware.
With Windows set to support PyTorch through DirectML, thousands of Hugging Face models will work in Windows natively. NVIDIA and Microsoft are collaborating to scale performance on more than 100 million RTX GPUs.
Join NVIDIA at Microsoft Build Conference attendees can visit NVIDIA booth FP28 to meet developer experts and experience live demos of NVIDIA NIM, NVIDIA cuOpt, NVIDIA Omniverse and the NVIDIA RTX AI platform. The booth also highlights the NVIDIA MONAI platform for medical imaging workflows and NVIDIA BioNeMo generative AI platform for drug discovery - both available on Azure as part of NVIDIA AI Enterprise.
Attend sessions with NVIDIA speakers to dive into the capabilities of the NVIDIA RTX AI platform on Windows PCs and discover how to deploy generative AI and digital twin tools on Microsoft Azure.
And sign up for the Developer Showcase, taking place Wednesday, to discover how developers are building innovative generative AI using NVIDIA AI software on Azure.
North America Stories
23/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
23/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
23/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
23/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
23/04/2026
Partnership between ARRI and SmallHD brings new Hi-5 license
Configurable monitor overlays adapt to individual working styles
Supported by SmallHD monitors ru...
23/04/2026
Lighting Master Cronenweth ASC brings a unique look to each grid world with the help of Astera
Jeff Cronenweth on the set of Disney's TRON: ARES. Photo by...
23/04/2026
DP Chloe Smolkin ( The Late Show, Kidz Bop ) joins director Danielle Beckmann and writer/actor Raji Ahsan behind the camera for the heartfelt short comedy Dr...
23/04/2026
GeForce NOW is doubling down on what matters most: gamers. This week's upgra...
22/04/2026
Solid State Logic is advancing its System T platform with a stronger focus on IP...
22/04/2026
From immersive audio to live streaming, Dolby Laboratories is focused on the fut...
22/04/2026
Shallow depth-of-field cameras have taken the industry by storm. Its debut a han...
22/04/2026
Riedel Communications (Booth C4908) announced that Eastern Kentucky University (...
22/04/2026
The NAB Show is in full swing, and the SVG and SVG Europe editorial teams are chasing down the hottest stories from all over the Las Vegas Convention Center. He...
22/04/2026
Blackmagic Design has announced the URSA Cine 12K LF 100G, a new model in the URSA Cine family adding 100G Ethernet for SMPTE 2110 live production output up to ...
22/04/2026
Celebrating its 40th anniversary, NEP is leaning into hybrid production with the...
22/04/2026
NEP VP, Platform Dan Murphy sits down at the 2026 NAB Show to unpack what NEP P...
22/04/2026
Why Low Band Electronic Warfare Matters...
22/04/2026
The nation unites around football team's World Cup dream
Warsaw, Poland, 20.04.26: Nielsen, a global leader in audience measurement, data, and media intell...
22/04/2026
Warsaw, Poland, 22.04.26: Nielsen, a global leader in audience measurement, data...
22/04/2026
New market intelligence offering gives businesses a clearer view of local consum...
22/04/2026
Glookast Unveils New UX, YouTube and Social Media Connectors, Premiere Panel, Ci...
22/04/2026
Lightcraft Technology to Preview Spark Story at NAB 2026 with Interactive Previs...
22/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
22/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
22/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
22/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
22/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
22/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
22/04/2026
Back to All News
This Earth Day, Discover the Sustainable Productions Behind Our Films and Series
Emma Stewart, Ph.D.
Netflix Sustainability Officer
Enterta...
22/04/2026
NVIDIA and Google Cloud have collaborated for more than a decade, co engineering a full stack AI platform that spans every technology layer - from performance o...
21/04/2026
Cloud-based production isnt going anywhere, and BitFire is doubling down by prov...
21/04/2026
The topic of artificial intelligence has a stranglehold on the sports-video-prod...
21/04/2026
5G is still a hot topic in live event production, and this workflow continues to...
21/04/2026
At the 2026 NAB Show, Ed McGivern, GM and President of Appear US, discusses the ...
21/04/2026
Studio Network Solutions (SNS) has announced an on-premise AI suite designed for...
21/04/2026
Suite Studios has integrated its file-streaming technology into the newly announced Frame.io Drive, a desktop application from Adobe company Frame.io. The colla...
21/04/2026
Net Insight has integrated InSync Technology's FrameFormer into the Nimbra E...
21/04/2026
Fox Sports has selected Appear as a technology partner to support the next phase...
21/04/2026
Diversified has appointed Tyler Affolter as Chief Revenue Officer (CRO) to lead the company's commercial organisation. The appointment follows the firm'...
21/04/2026
Layercake has formalised the integration of Bitmovin's video streaming infra...
21/04/2026
The International Judo Federation (IJF) has extended its distribution partnershi...
21/04/2026
Glookast has launched the Cinnafilm Tachyon plugin for its Media Producer and Me...
21/04/2026
Eutelsat has entered into an agreement with Cadena Tres, a division of Grupo Ima...
21/04/2026
Dolby Laboratories and TV Azteca have partnered to introduce Dolby Atmos immersive audio to free-to-air television broadcasts. The implementation utilises the A...
21/04/2026
FOX Entertainment partnered with Verizon to overcome significant production hurd...
21/04/2026
Osprey Video has announced its technology showcase for the NAB Show 2026, highli...
21/04/2026
Riedel Communications (Booth C4908) introduced a range of new solutions at NAB S...
21/04/2026
The NAB Show is in full swing, and the SVG and SVG Europe editorial teams are chasing down the hottest stories from all over the Las Vegas Convention Center. He...
21/04/2026
Blackmagic Design has announced the URSA Cine Immersive 100G, an immersive cinem...
21/04/2026
Clark Wire & Cable is continuing its evolution from cable supplier to full-scale solutions partner for broadcast and live production. At the 2026 NAB Show, we s...