
In collaboration with OpenAI, NVIDIA has optimized the company's new open-source gpt-oss models for NVIDIA GPUs, delivering smart, fast inference from the cloud to the PC. These new reasoning models enable agentic AI applications such as web search, in-depth research and many more.
With the launch of gpt-oss-20b and gpt-oss-120b, OpenAI has opened cutting-edge models to millions of users. AI enthusiasts and developers can use the optimized models on NVIDIA RTX AI PCs and workstations through popular tools and frameworks like Ollama, llama.cpp and Microsoft AI Foundry Local, and expect performance of up to 256 tokens per second on the NVIDIA GeForce RTX 5090 GPU.
OpenAI showed the world what could be built on NVIDIA AI - and now they're advancing innovation in open-source software, said Jensen Huang, founder and CEO of NVIDIA. The gpt-oss models let developers everywhere build on that state-of-the-art open-source foundation, strengthening U.S. technology leadership in AI - all on the world's largest AI compute infrastructure.
The models' release highlights NVIDIA's AI leadership from training to inference and from cloud to AI PC.
Open for All Both gpt-oss-20b and gpt-oss-120b are flexible, open-weight reasoning models with chain-of-thought capabilities and adjustable reasoning effort levels using the popular mixture-of-experts architecture. The models are designed to support features like instruction-following and tool use, and were trained on NVIDIA H100 GPUs. AI developers can learn more and get started using instructions from the NVIDIA Technical Blog.
These models can support up to 131,072 context lengths, among the longest available in local inference. This means the models can reason through context problems, ideal for tasks such as web search, coding assistance, document comprehension and in-depth research.
The OpenAI open models are the first MXFP4 models supported on NVIDIA RTX. MXFP4 allows for high model quality, offering fast, efficient performance while requiring fewer resources compared with other precision types.
Run the OpenAI Models on NVIDIA RTX With Ollama The easiest way to test these models on RTX AI PCs, on GPUs with at least 24GB of VRAM, is using the new Ollama app. Ollama is popular with AI enthusiasts and developers for its ease of integration, and the new user interface (UI) includes out-of-the-box support for OpenAI's open-weight models. Ollama is fully optimized for RTX, making it ideal for consumers looking to experience the power of personal AI on their PC or workstation.
Once installed, Ollama enables quick, easy chatting with the models. Simply select the model from the dropdown menu and send a message. Because Ollama is optimized for RTX, there are no additional configurations or commands required to ensure top performance on supported GPUs.
Testing OpenAI's open models in Ollama is easy. Ollama's new app includes other new features, like easy support for PDF or text files within chats, multimodal support on applicable models so users can include images in their prompts, and easily customizable context lengths when working with large documents or chats.
Developers can also use Ollama via command line interface or the app's software development kit (SDK) to power their applications and workflows.
Other Ways to Use the New OpenAI Models on RTX Enthusiasts and developers can also try the gpt-oss models on RTX AI PCs through various other applications and frameworks, all powered by RTX, on GPUs that have at least 16GB of VRAM.
NVIDIA continues to collaborate with the open-source community on both llama.cpp and the GGML tensor library to optimize performance on RTX GPUs. Recent contributions include implementing CUDA Graphs to reduce overhead and adding algorithms that reduce CPU overheads. Check out the llama.cpp GitHub repository to get started.
Overall performance of the gpt-oss-20b model on various RTX AI PCs. Windows developers can also access OpenAI's new models via Microsoft AI Foundry Local, currently in public preview. Foundry Local is an on-device AI inferencing solution that integrates into workflows via the command line, SDK or application programming interfaces. Foundry Local uses ONNX Runtime, optimized through CUDA, with support for NVIDIA TensorRT for RTX coming soon. Getting started is easy: install Foundry Local and invoke Foundry model run gpt-oss-20b in a terminal.
The release of these open-source models kicks off the next wave of AI innovation from enthusiasts and developers looking to add reasoning to their AI-accelerated Windows applications.
Each week, the RTX AI Garage blog series features community-driven AI innovations and content for those looking to learn more about NVIDIA NIM microservices and AI Blueprints, as well as building AI agents, creative workflows, productivity apps and more on AI PCs and workstations.
Plug in to NVIDIA AI PC on Facebook, Instagram, TikTok and X - and stay informed by subscribing to the RTX AI PC newsletter. Join NVIDIA's Discord server to connect with community developers and AI enthusiasts for discussions on what's possible with RTX AI.
Follow NVIDIA Workstation on LinkedIn and X.
See notice regarding software product information.
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
01/06/2026
January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026
Throughout the week, Dolby brings to life the latest innovatio...
01/05/2026
January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...
01/04/2026
January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION
Douyin Users Can Now Create And Share Videos With Stun...
15/01/2026
Share Share by:
Copy link
Facebook
X
Whatsapp
Pinterest
Flipboard...
15/01/2026
Share Share by:
Copy link
Facebook
X
Whatsapp
Pinterest
Flipboard...
15/01/2026
Share Share by:
Copy link
Facebook
X
Whatsapp
Pinterest
Flipboard...
15/01/2026
NVIDIA kicked off the year at CES, where the crowd buzzed about the latest gaming announcements - including the native GeForce NOW app for Linux and Amazon Fire...
14/01/2026
Staines-upon-Thames, UK, 13th January, 2026 ITV, one of the UKs leading broadcasters, has selected Yospace, the global leader in Dynamic Ad Insertion (DAI), to ...
14/01/2026
Tech Focus: Audio Consoles, Part 2 - New Options for Virtual MixingA variety of solutions offer both technical and economic benefitsBy Dan Daley, Audio Editor
...
14/01/2026
Tech Focus: Audio Consoles, Part 1 - Key Component Evolves Toward the Totally Vi...
14/01/2026
SVG Summit 2025: Audio from Monday Workshops Now AvailableListen to sessions from Live Production Innovation, AI Production Tools, Cloud Production, Content Wor...
14/01/2026
The L3Harris large T7 robotic systems will provide U.S. Navy and U.S. Marines wi...
14/01/2026
Steiger Media's adoption of Calrec's compact Argo M console not only makes its innovative new hybrid truck faster, more efficient, and agile, but also e...
14/01/2026
Share Share by:
Copy link
Facebook
X
Whatsapp
Pinterest
Flipboard...
14/01/2026
Share Share by:
Copy link
Facebook
X
Whatsapp
Pinterest
Flipboard...
14/01/2026
Share Share by:
Copy link
Facebook
X
Whatsapp
Pinterest
Flipboard...
14/01/2026
January 14th, 2026
TRIBECA ANNOUNCES BEST NEW YORK SHORT AWARD FOR 25TH ANNIVERSARY FESTIVAL
In Celebration of Its 25th Anniversary, Tribeca Introduces a N...
14/01/2026
Wednesday 14 January 2026
Sky News announces Cathy Newman to lead flagship new political programme
Sky News today announces that award-winning journalist and ...
14/01/2026
Back to All News
State of Fear, The First Spin-Off of a Netflix Brazil Producti...
14/01/2026
The first stamp of An Post's 2026 Stamp Programme, marking 100 Years of Broadcasting, was unveiled at the GPO by Patrick O'Donovan TD, Minister for Cult...
14/01/2026
It's official! Beverley Callard has landed in Carrigstown. The beloved actor, known for her unforgettable roles and iconic screen presence, is joining the c...
13/01/2026
Independent media in Brazil and Colombia is facing an urgent crisis of traditional business models alongside a deteriorating security environment, according to ...
13/01/2026
NHL Situation Room 2.0: How Sony Hawk-Eye Powers Centralized Officiating, Player...
13/01/2026
NBC Sports Ices the Audio for the 2026 Prevagen U.S. Figure Skating Championship...
13/01/2026
DMF and MXL in practice: Which vendors are adopting it, and how fast is the ecos...
13/01/2026
CES 2026: Five Important Sports-Tech BuzzwordsThe terms highlight innovations for sports production at the showBy Daniel Frankel, SVG Contributor
Tuesday, Jan...
13/01/2026
For TGL Season 2, Unity 6 Boosts Virtual-Graphic Quality; COSM 360 Cameras Impro...
13/01/2026
Resetting Expectations? The State of the Sports Industry with Devoncroft's J...
13/01/2026
Top Row L-R: Ana Katz, Natalia Almada, Bao Nguyen, Tatiana Maslany, A.V. Rockwell, Dr. Heather Berlin
Second Row L-R: Sophie Barthes, Azazel Jacobs, Janicza Br...
13/01/2026
DoW to invest $1B in planned independently traded Missile Solutions business...
13/01/2026
L3Harris Chairman and CEO Christopher Kubasik and Under Secretary of War for Acq...
13/01/2026
April 10, 2025
First Gulf has taken a significant step in its U.S. expansion with the launch of its first industrial development in the country.
First Westla...
13/01/2026
April 11, 2025
Canadian footwear retailer SoftMoc has signed a lease for 145,600 square feet at 901 Hopkins Street in Whitby, where the space will serve as a w...
13/01/2026
April 14, 2025
First Gulf is proud to announce that 25 Ontario has officially received its occupancy permit, marking the transition from an active construction...
13/01/2026
April 28, 2025
First Gulf has been awarded a design-build lease for a new 350,000 square foot office and warehouse facility for Sherwin-Williams. This project ...
13/01/2026
August 13, 2025
First Gulf Expands U.S. Industrial Footprint with First Savanna...
13/01/2026
August 13, 2025
First Gulf is proud to partner with Toromont Industries Ltd. to...
13/01/2026
October 10, 2025
First Gulf is pleased to announce that PPFD, a leading third-party logistics company, has leased 146,536 square feet at 901 Hopkins Street in ...
13/01/2026
Singapore - January 13, 2026 - Nielsen today announced the appointment of Matty Lin to its Commercial Organization as APAC regional sales leader.
Based in Sing...
13/01/2026
Share Share by:
Copy link
Facebook
X
Whatsapp
Pinterest
Flipboard...
13/01/2026
Share Share by:
Copy link
Facebook
X
Whatsapp
Pinterest
Flipboard...
13/01/2026
Nine-week performance series brings music, dance, theatre, and storytelling to downtown Durham, January - March 2026 (Durham, NC) The Chamber Orchestra of the T...
13/01/2026
Berklee Launches AIMS, an Artist-Centered Summit on Music and AI Hosted by the Berklee Emerging Artistic Technology Lab (BEATL), the event will focus on the i...
13/01/2026
Share Share by:
Copy link
Facebook
X
Whatsapp
Pinterest
Flipboard...
13/01/2026
Share Share by:
Copy link
Facebook
X
Whatsapp
Pinterest
Flipboard...
13/01/2026
Share Share by:
Copy link
Facebook
X
Whatsapp
Pinterest
Flipboard...
13/01/2026
Share Share by:
Copy link
Facebook
X
Whatsapp
Pinterest
Flipboard...
13/01/2026
Share Share by:
Copy link
Facebook
X
Whatsapp
Pinterest
Flipboard...
13/01/2026
Share Share by:
Copy link
Facebook
X
Whatsapp
Pinterest
Flipboard...