
In collaboration with OpenAI, NVIDIA has optimized the company's new open-source gpt-oss models for NVIDIA GPUs, delivering smart, fast inference from the cloud to the PC. These new reasoning models enable agentic AI applications such as web search, in-depth research and many more.
With the launch of gpt-oss-20b and gpt-oss-120b, OpenAI has opened cutting-edge models to millions of users. AI enthusiasts and developers can use the optimized models on NVIDIA RTX AI PCs and workstations through popular tools and frameworks like Ollama, llama.cpp and Microsoft AI Foundry Local, and expect performance of up to 256 tokens per second on the NVIDIA GeForce RTX 5090 GPU.
OpenAI showed the world what could be built on NVIDIA AI - and now they're advancing innovation in open-source software, said Jensen Huang, founder and CEO of NVIDIA. The gpt-oss models let developers everywhere build on that state-of-the-art open-source foundation, strengthening U.S. technology leadership in AI - all on the world's largest AI compute infrastructure.
The models' release highlights NVIDIA's AI leadership from training to inference and from cloud to AI PC.
Open for All Both gpt-oss-20b and gpt-oss-120b are flexible, open-weight reasoning models with chain-of-thought capabilities and adjustable reasoning effort levels using the popular mixture-of-experts architecture. The models are designed to support features like instruction-following and tool use, and were trained on NVIDIA H100 GPUs. AI developers can learn more and get started using instructions from the NVIDIA Technical Blog.
These models can support up to 131,072 context lengths, among the longest available in local inference. This means the models can reason through context problems, ideal for tasks such as web search, coding assistance, document comprehension and in-depth research.
The OpenAI open models are the first MXFP4 models supported on NVIDIA RTX. MXFP4 allows for high model quality, offering fast, efficient performance while requiring fewer resources compared with other precision types.
Run the OpenAI Models on NVIDIA RTX With Ollama The easiest way to test these models on RTX AI PCs, on GPUs with at least 24GB of VRAM, is using the new Ollama app. Ollama is popular with AI enthusiasts and developers for its ease of integration, and the new user interface (UI) includes out-of-the-box support for OpenAI's open-weight models. Ollama is fully optimized for RTX, making it ideal for consumers looking to experience the power of personal AI on their PC or workstation.
Once installed, Ollama enables quick, easy chatting with the models. Simply select the model from the dropdown menu and send a message. Because Ollama is optimized for RTX, there are no additional configurations or commands required to ensure top performance on supported GPUs.
Testing OpenAI's open models in Ollama is easy. Ollama's new app includes other new features, like easy support for PDF or text files within chats, multimodal support on applicable models so users can include images in their prompts, and easily customizable context lengths when working with large documents or chats.
Developers can also use Ollama via command line interface or the app's software development kit (SDK) to power their applications and workflows.
Other Ways to Use the New OpenAI Models on RTX Enthusiasts and developers can also try the gpt-oss models on RTX AI PCs through various other applications and frameworks, all powered by RTX, on GPUs that have at least 16GB of VRAM.
NVIDIA continues to collaborate with the open-source community on both llama.cpp and the GGML tensor library to optimize performance on RTX GPUs. Recent contributions include implementing CUDA Graphs to reduce overhead and adding algorithms that reduce CPU overheads. Check out the llama.cpp GitHub repository to get started.
Overall performance of the gpt-oss-20b model on various RTX AI PCs. Windows developers can also access OpenAI's new models via Microsoft AI Foundry Local, currently in public preview. Foundry Local is an on-device AI inferencing solution that integrates into workflows via the command line, SDK or application programming interfaces. Foundry Local uses ONNX Runtime, optimized through CUDA, with support for NVIDIA TensorRT for RTX coming soon. Getting started is easy: install Foundry Local and invoke Foundry model run gpt-oss-20b in a terminal.
The release of these open-source models kicks off the next wave of AI innovation from enthusiasts and developers looking to add reasoning to their AI-accelerated Windows applications.
Each week, the RTX AI Garage blog series features community-driven AI innovations and content for those looking to learn more about NVIDIA NIM microservices and AI Blueprints, as well as building AI agents, creative workflows, productivity apps and more on AI PCs and workstations.
Plug in to NVIDIA AI PC on Facebook, Instagram, TikTok and X - and stay informed by subscribing to the RTX AI PC newsletter. Join NVIDIA's Discord server to connect with community developers and AI enthusiasts for discussions on what's possible with RTX AI.
Follow NVIDIA Workstation on LinkedIn and X.
See notice regarding software product information.
Most recent headlines
09/11/2025
Dalet today announced a transformative leap forward for media operations: Agentic Artificial Intelligence (AI) that unifies the Dalet ecosystem under one natura...
15/10/2025
NEW YORK The NBA is making major changes to the NBA App and NBA TV as it takes control of them from TNT Sports, which has long managed the league's digital ...
15/10/2025
SAN MATEO, Calif. In what promises to be a major expansion of interactive features and personalized content on the DirecTV platform, the operator and Glance hav...
15/10/2025
SAN JOSE, Calif. Roku has launched changes to its user interface (UI) that the streaming platform says will better showcase original programming on the platform...
15/10/2025
LOS ANGELES Software-defined data storage and data services provider OpenDrives has elevated Alex Dunfey to chief technology officer, responsible for driving th...
15/10/2025
Series coming in 2026 stars Tom Vaughan-Lawlor, Justine Mitchell and Jason O'Mara released today
RT today released first look images of new comedy-drama ...
14/10/2025
SVG Europe Summit 2025: All Sessions Now Available to Watch on SVG PLAYNetworking event that preceded IBC2025 shone a light on elite live sports innovation acro...
14/10/2025
SVG Sit-Down: Author Rich Podolsky on Writing Madden & Summerall: How They Revo...
14/10/2025
SVG All-Stars: Michael Reiners, Coordinating Producer, FloRacingThe Illinois State grad steers a vast schedule of motorsports events at tracks across the countr...
14/10/2025
Content protection: Getting the right management for your DRM By Neal Romanek
Friday, October 10, 2025 - 10:11
Print This Story
Eluvio power the EPCR'...
14/10/2025
As League Takes Over Ops, NBA TV and NBA App Add 60 Games, Weekday Studio Show, ...
14/10/2025
Time and effort: World's largest student-led broadcast prepares to go On Air...
14/10/2025
(L-R) Guest, Kimberly Robinson Jones, Geeta Gandbhir, Pamela Dias, and Takema Ro...
14/10/2025
Lossless ist jetzt mit Spotify Premium verf gbar.
Verlustfreies Audio war eine...
14/10/2025
La qualit Lossless est disponible sur Spotify Premium.
Le format sans perte de...
14/10/2025
For the seventh edition of Spotify and FC Barcelona's artist jersey series, ...
14/10/2025
Spotify is committed to bringing the best listening experience to all our users, and that includes parents and families. That's why we're expanding mana...
14/10/2025
Since its debut, the Spotify Original podcast Caso 63 has been more than just a story; it's been a cultural sensation. The science fiction thriller captivat...
14/10/2025
Desde su debut, el podcast original de Spotify Caso 63 ha sido mucho m s que una historia: se ha convertido en un fen meno cultural. Este thriller de ciencia fi...
14/10/2025
Lossless p Spotify Premium r h r.
Lossless-ljud har varit en av de mest efterl ngtade funktionerna p Spotify och nu, ntligen, har den b rjat rullas ut til...
14/10/2025
Early next year, your favorite video podcasts are getting a bigger stage. Spotify and Netflix are teaming up to bring sports, culture, lifestyle, and true crime...
14/10/2025
Last week, the 4th global Safety Day took place at all SGL Carbon sites.
This years Safety Day focused on hazardous substances. Various information events, wor...
14/10/2025
From bowser to basket, 9 in 10 Aussies are feeling the impact of rising prices
26% of households earn over $160k, but are still concerned about rising prices...
14/10/2025
New players take a bite out of big bank share as consumers increasingly value tr...
14/10/2025
56% of Aussies are looking for a coastal holiday, while 40% are planning a road ...
14/10/2025
51% of Aussies want a hybrid car and 36% want a full EV
Toyota leads the market
75% research online before a new car purchase
Sydney - October 14, 2025 - Aus...
14/10/2025
Unilever leads the market
Beverages, smartphones, and food dominate category sp...
14/10/2025
Top insurance advertisers
Biggest growth categories
Sector ad spend up 4.7...
14/10/2025
WAYNE, Pa. Private-equity firm Saothair Capital Partners said it has completed the acquisition of GatesAir through a newly-formed affiliate....
14/10/2025
Media Excel, a leading provider of encoding and transcoding solutions, today announced that Space Norway, a leading provider of satellite services and operator ...
14/10/2025
Jason Tyler has joined ZTransform, a leader in media environment innovation, as Inside Sales and Procurement Manager bringing commercial and operational focus t...
14/10/2025
14 10 2025 - Media release Tiny toys, big missions: Knee High Spies launches on ABC this November
Knee High Spies
Kids, assemble! The ABC and Screen Australi...
14/10/2025
Abu Dhabi, UAE October 14, 2025: Space42 (ADX: SPACE42), the AI-powered SpaceT...
14/10/2025
Abu Dhabi, UAE October 14, 2025: Space42 (ADX: SPACE42), the UAE-based AI-powered SpaceTech company with a global reach, has signed a Memorandum of Understand...
14/10/2025
Joe Wilkinson and David Earl will explore their favourite sitcoms together with help from stars such as Ricky Gervais
14th October, London: Comedians, writers,...
14/10/2025
October 14th, 2025 ANNA SARGENT, VICTOR SLEZAK, ALI AHN, MARCELINE HUGOT, AND S...
14/10/2025
The Sky Original event series - a symphony of genius, rivalry and vengeance - al...
14/10/2025
ESA awards Rohde & Schwarz for contributions to 30 years European Satellite Navi...
14/10/2025
The Hollywood Professional Association (HPA) today unveiled key highlights of the 2026 HPA Tech Retreat, scheduled for Feb. 15-19 at the Westin Rancho Mirage Go...
14/10/2025
Rena Ayer Joins Red Seat Ventures as Senior Vice President, Content & Talent Par...
14/10/2025
Imelda May explores her relationship with the Irish language through songs and sean-n s singing
Friday 17 October, 8.30pm on RT One and RT Player
Watch tr...
14/10/2025
AI is transforming the way enterprises build, deploy and scale intelligent applications. As demand surges for enterprise-grade AI applications that offer speed,...
14/10/2025
At Oracle AI World, NVIDIA and Oracle announced they are deepening their collabo...
13/10/2025
Spectrum Brings Selected L.A. Lakers Games to Apple Vision Pro With New Immersiv...
13/10/2025
Media Climate Accord aims to offer united approach to M&E industry sustainabilit...
13/10/2025
Riot Games streamlines production of Valorant Champions Paris with ST 2110 flypa...
13/10/2025
Feeling the NRG: Riot Games puts on a show for Valorant Champions Paris final By Jo Ruddock
Monday, October 13, 2025 - 09:17
Print This Story
After more t...
13/10/2025
FOX Sports MLB Postseason Audio Aims To Make Officials' Calls More AccurateA1 Joe Carpenter hopes to bring some baseball CSI' to the ABS ump-cam system...
13/10/2025
By Katie Arthurs
Whether told through dance, ceremony, spoken word, or visual a...
13/10/2025
New SBS and NITV Original RECKLESS a Deadly Funny Thriller Straight Out of Fre...