
In collaboration with OpenAI, NVIDIA has optimized the company's new open-source gpt-oss models for NVIDIA GPUs, delivering smart, fast inference from the cloud to the PC. These new reasoning models enable agentic AI applications such as web search, in-depth research and many more.
With the launch of gpt-oss-20b and gpt-oss-120b, OpenAI has opened cutting-edge models to millions of users. AI enthusiasts and developers can use the optimized models on NVIDIA RTX AI PCs and workstations through popular tools and frameworks like Ollama, llama.cpp and Microsoft AI Foundry Local, and expect performance of up to 256 tokens per second on the NVIDIA GeForce RTX 5090 GPU.
OpenAI showed the world what could be built on NVIDIA AI - and now they're advancing innovation in open-source software, said Jensen Huang, founder and CEO of NVIDIA. The gpt-oss models let developers everywhere build on that state-of-the-art open-source foundation, strengthening U.S. technology leadership in AI - all on the world's largest AI compute infrastructure.
The models' release highlights NVIDIA's AI leadership from training to inference and from cloud to AI PC.
Open for All Both gpt-oss-20b and gpt-oss-120b are flexible, open-weight reasoning models with chain-of-thought capabilities and adjustable reasoning effort levels using the popular mixture-of-experts architecture. The models are designed to support features like instruction-following and tool use, and were trained on NVIDIA H100 GPUs. AI developers can learn more and get started using instructions from the NVIDIA Technical Blog.
These models can support up to 131,072 context lengths, among the longest available in local inference. This means the models can reason through context problems, ideal for tasks such as web search, coding assistance, document comprehension and in-depth research.
The OpenAI open models are the first MXFP4 models supported on NVIDIA RTX. MXFP4 allows for high model quality, offering fast, efficient performance while requiring fewer resources compared with other precision types.
Run the OpenAI Models on NVIDIA RTX With Ollama The easiest way to test these models on RTX AI PCs, on GPUs with at least 24GB of VRAM, is using the new Ollama app. Ollama is popular with AI enthusiasts and developers for its ease of integration, and the new user interface (UI) includes out-of-the-box support for OpenAI's open-weight models. Ollama is fully optimized for RTX, making it ideal for consumers looking to experience the power of personal AI on their PC or workstation.
Once installed, Ollama enables quick, easy chatting with the models. Simply select the model from the dropdown menu and send a message. Because Ollama is optimized for RTX, there are no additional configurations or commands required to ensure top performance on supported GPUs.
Testing OpenAI's open models in Ollama is easy. Ollama's new app includes other new features, like easy support for PDF or text files within chats, multimodal support on applicable models so users can include images in their prompts, and easily customizable context lengths when working with large documents or chats.
Developers can also use Ollama via command line interface or the app's software development kit (SDK) to power their applications and workflows.
Other Ways to Use the New OpenAI Models on RTX Enthusiasts and developers can also try the gpt-oss models on RTX AI PCs through various other applications and frameworks, all powered by RTX, on GPUs that have at least 16GB of VRAM.
NVIDIA continues to collaborate with the open-source community on both llama.cpp and the GGML tensor library to optimize performance on RTX GPUs. Recent contributions include implementing CUDA Graphs to reduce overhead and adding algorithms that reduce CPU overheads. Check out the llama.cpp GitHub repository to get started.
Overall performance of the gpt-oss-20b model on various RTX AI PCs. Windows developers can also access OpenAI's new models via Microsoft AI Foundry Local, currently in public preview. Foundry Local is an on-device AI inferencing solution that integrates into workflows via the command line, SDK or application programming interfaces. Foundry Local uses ONNX Runtime, optimized through CUDA, with support for NVIDIA TensorRT for RTX coming soon. Getting started is easy: install Foundry Local and invoke Foundry model run gpt-oss-20b in a terminal.
The release of these open-source models kicks off the next wave of AI innovation from enthusiasts and developers looking to add reasoning to their AI-accelerated Windows applications.
Each week, the RTX AI Garage blog series features community-driven AI innovations and content for those looking to learn more about NVIDIA NIM microservices and AI Blueprints, as well as building AI agents, creative workflows, productivity apps and more on AI PCs and workstations.
Plug in to NVIDIA AI PC on Facebook, Instagram, TikTok and X - and stay informed by subscribing to the RTX AI PC newsletter. Join NVIDIA's Discord server to connect with community developers and AI enthusiasts for discussions on what's possible with RTX AI.
Follow NVIDIA Workstation on LinkedIn and X.
See notice regarding software product information.
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
04/08/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
04/07/2026
April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...
01/06/2026
January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026
Throughout the week, Dolby brings to life the latest innovatio...
30/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
30/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
30/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
30/05/2026
Zero in on one that says yes (and no)
Andy Marken May 29, 2026
0 Comments
Hero image courtesy of Deposit Photos
For content creators the most difficu...
29/05/2026
With InfoComm 2026 just weeks away, NDI is giving attendees plenty of reasons to...
29/05/2026
Reaffirming a partnership that has defined Canadian sports broadcasting since 19...
29/05/2026
Mobile/tablet is No. 2 device for watching TV, suggesting that the sports-production industry needs to take another look at the format
Ring Digital's Sprin...
29/05/2026
Berliner Ensemble, one of Berlin's five major theater companies, has expande...
29/05/2026
Solid State Logic will showcase its new compact, fly-away TCA Tour audio product...
29/05/2026
Gerald (Jerry) Pierce, a pioneering technologist who helped shape the digital transformation of the motion picture industry, passed away last month on April 12 ...
29/05/2026
Paramount+ will be the English-language U.S. home for Barclays Women's Super...
29/05/2026
Further strengthening its virtualisation strategy to fully support broadcasters ...
29/05/2026
Swiss broadcaster Canal Alpha has deployed Harmonic's award-winning, software-based XOS Advanced Media Processor to modernize playout operations across cant...
29/05/2026
PTZOptics will showcase a new generation of intelligent video workflows at InfoComm 2026, June 17-19, Las Vegas. Visitors to booth N8227 will see how PTZOptics ...
29/05/2026
Arizona's Family has launched the Arizona's Family Sports (AZFS) streaming app, a new direct-to-consumer destination for live, local sports. The app is ...
29/05/2026
Starting in 2027, DAZN will be the exclusive home of The Canadian Football Leagu...
29/05/2026
Comcast Business has detailed the advanced network infrastructure it has deploye...
29/05/2026
In two-day event, leaders from academia and industry explored solutions to chall...
29/05/2026
The Basketball Tournament (TBT), now entering their 13th year of competition, ha...
29/05/2026
Roku has launched FOX One as a Premium Subscription on The Roku Channel in the U.S. Roku customers can now subscribe to FOX One using their Roku account for liv...
29/05/2026
In its sixth year, the broadcaster's coverage has become a global brand and ...
29/05/2026
Ratings Roundup is a rundown of recent ratings news and is derived from press re...
29/05/2026
The days are getting longer, the temperatures are rising, and playlists are filling up for the season. With summer around the corner, Spotify's global edito...
29/05/2026
New retro-inspired MPC announced
There are few devices that have gained the status held by Akai Pro's MPC range, and in recent years, the company have s...
29/05/2026
Save up to 30 on acclaimed titles
Following a successful launch at Superbooth 2026, Bjooks have revealed that they will be continuing the Kickstarter campa...
29/05/2026
Binaural monitoring application improved
Genelec have just released an update that brings some powerful new features to their HRTF-based binaural headphone ...
29/05/2026
6 June 2026 at SAE Institute, London, UK
IMSTA FESTA 2026 is almost upon us, with some of the biggest names in pro-audio set to descend upon SAE Institute i...
29/05/2026
Gerald (Jerry) Pierce, a pioneering technologist who helped shape the digital transformation of the motion picture industry, died April 12, 2026, at his home in...
29/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
29/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
29/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
29/05/2026
Comparing 5 AI Video Enhancers for Restoring Old Video Quality
Kate Luvis May 29, 2026
0 Comments
Digitizing VHS, MiniDV, and other legacy formats doe...
29/05/2026
Studio Hamburg Builds New Post Pipeline with DaVinci Resolve Studio
Brie Clayton May 29, 2026
0 Comments
Workflow replaces a patchwork of legacy tools...
29/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
29/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
29/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
29/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
29/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
29/05/2026
At the Intersection of Music and Dance, an Epic Collaboration Boston Conservatory musicians and dancers found creative parallels in their recent performance o...
29/05/2026
Back to All News
Latino Film Institute Fellows Screen Their Short Films at the Egyptian Theatre
The Inclusion Fellowship and Spark Animation Grant uplifts 15 ...
29/05/2026
GLOOKAST LAUNCHES CINNAFILM TACHYON PLUGIN FOR MEDIA PRODUCER AND MEDIA SERVICES
This release first appeared here.
Visit our Tachyon product page or contact...
29/05/2026
May 29 2026, 09:00 (PDT) Dolby and rednote Bring More Immersive Storytelling to...
29/05/2026
Something fundamental has shifted in how people consume media. Audiences aren't abandoning television or radio content; they're just expanding how, wher...
29/05/2026
Youtube exclusive special drops today
Watch now
UKTV today announces another e...
29/05/2026
Back to All News
The Official Trailer of Physical 100 Italy, on Netflix From Se...