
Mistral AI and NVIDIA today released a new state-of-the-art language model, Mistral NeMo 12B, that developers can easily customize and deploy for enterprise applications supporting chatbots, multilingual tasks, coding and summarization.
By combining Mistral AI's expertise in training data with NVIDIA's optimized hardware and software ecosystem, the Mistral NeMo model offers high performance for diverse applications.
We are fortunate to collaborate with the NVIDIA team, leveraging their top-tier hardware and software, said Guillaume Lample, cofounder and chief scientist of Mistral AI. Together, we have developed a model with unprecedented accuracy, flexibility, high-efficiency and enterprise-grade support and security thanks to NVIDIA AI Enterprise deployment.
Mistral NeMo was trained on the NVIDIA DGX Cloud AI platform, which offers dedicated, scalable access to the latest NVIDIA architecture.
NVIDIA TensorRT-LLM for accelerated inference performance on large language models and the NVIDIA NeMo development platform for building custom generative AI models were also used to advance and optimize the process.
This collaboration underscores NVIDIA's commitment to supporting the model-builder ecosystem.
Delivering Unprecedented Accuracy, Flexibility and Efficiency
Excelling in multi-turn conversations, math, common sense reasoning, world knowledge and coding, this enterprise-grade AI model delivers precise, reliable performance across diverse tasks.
With a 128K context length, Mistral NeMo processes extensive and complex information more coherently and accurately, ensuring contextually relevant outputs.
Released under the Apache 2.0 license, which fosters innovation and supports the broader AI community, Mistral NeMo is a 12-billion-parameter model. Additionally, the model uses the FP8 data format for model inference, which reduces memory size and speeds deployment without any degradation to accuracy.
That means the model learns tasks better and handles diverse scenarios more effectively, making it ideal for enterprise use cases.
Mistral NeMo comes packaged as an NVIDIA NIM inference microservice, offering performance-optimized inference with NVIDIA TensorRT-LLM engines.
This containerized format allows for easy deployment anywhere, providing enhanced flexibility for various applications.
As a result, models can be deployed anywhere in minutes, rather than several days.
NIM features enterprise-grade software that's part of NVIDIA AI Enterprise, with dedicated feature branches, rigorous validation processes, and enterprise-grade security and support.
It includes comprehensive support, direct access to an NVIDIA AI expert and defined service-level agreements, delivering reliable and consistent performance.
The open model license allows enterprises to integrate Mistral NeMo into commercial applications seamlessly.
Designed to fit on the memory of a single NVIDIA L40S, NVIDIA GeForce RTX 4090 or NVIDIA RTX 4500 GPU, the Mistral NeMo NIM offers high efficiency, low compute cost, and enhanced security and privacy.
Advanced Model Development and Customization
The combined expertise of Mistral AI and NVIDIA engineers has optimized training and inference for Mistral NeMo.
Trained with Mistral AI's expertise, especially on multilinguality, code and multi-turn content, the model benefits from accelerated training on NVIDIA's full stack.
It's designed for optimal performance, utilizing efficient model parallelism techniques, scalability and mixed precision with Megatron-LM.
The model was trained using Megatron-LM, part of NVIDIA NeMo, with 3,072 H100 80GB Tensor Core GPUs on DGX Cloud, composed of NVIDIA AI architecture, including accelerated computing, network fabric and software to increase training efficiency.
Availability and Deployment
With the flexibility to run anywhere - cloud, data center or RTX workstation - Mistral NeMo is ready to revolutionize AI applications across various platforms.
Experience Mistral NeMo as an NVIDIA NIM today via ai.nvidia.com, with a downloadable NIM coming soon.
See notice regarding software product information.
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
04/08/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
04/07/2026
April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...
01/06/2026
January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026
Throughout the week, Dolby brings to life the latest innovatio...
21/05/2026
Game Creek Video Columbia and Celtic, NEP Supershooter 8 will house onsite produ...
21/05/2026
Freshly graduated, this upstart producer, director, and camera operator is already working as an AP on videoboard shows for the Philadelphia Phillies
In the li...
21/05/2026
Media Links has announced a channel partnership with Clearcast Asia, a broadcast...
21/05/2026
SiriusXM and NASCAR have announced a multi-year renewal of their broadcasting agreement. SiriusXM will continue to carry live broadcasts of every NASCAR Cup Ser...
21/05/2026
Audio-Technica held a demonstration event at its Technica House location in New ...
21/05/2026
Ateme has announced that RTL Deutschland has selected Ateme's software-based...
21/05/2026
LiveU has announced that BCC Live deployed the LU900Q intelligent production unit for the first time during the 2026 Memorial Hermann IRONMAN Texas North Americ...
21/05/2026
ATSC has announced that Mark Aitken, President of ONE Media and Senior VP of Advanced Technology at Sinclair Broadcast Group, will receive the 2026 Mark Richer ...
21/05/2026
BBright has published a technical analysis of the Media eXchange Layer (MXL), de...
21/05/2026
The Esports Foundation has announced that the 2026 Esports World Cup (EWC) will be hosted in Paris, France, from July 6 through August 23. The event marks the f...
21/05/2026
Chyron has announced PRIME Scorebug, a scorebug solution built on the PRIME Platform for on-premises sports production, and has expanded Chyron LIVE with purpos...
21/05/2026
Media Links has announced the integration of its Xscend IP transport platform with Skyline Communications' DataMiner xOps platform. The integration will be ...
21/05/2026
As live sports productions continue to demand more flexible, scalable, and cost-...
21/05/2026
In advance of this year's Sports Emmy Awards, SVG is taking a deep dive into...
21/05/2026
Hey Miami & Atlanta post-production folks!
Shade is hosting a free private suite at a Braves game (6/2) and Marlins game (6/5) and have about a dozen extra tic...
21/05/2026
The Suns and Mercury become the first NBA and WNBA teams to make games available under a single broadcast partner across both over-the-air and streaming....
21/05/2026
iPhones are part of the the regular production rotation for Friday Night Baseba...
21/05/2026
In advance of this year's Sports Emmy Awards, SVG is taking a deep dive into...
21/05/2026
Heather Matarazzo as Dawn Wiener in Todd Solondz's Welcome to the Dollhouse...
21/05/2026
Spotify has always been about helping you find something you want to listen to. And over the years, we've learned your taste and the moments that matter to ...
21/05/2026
Getting concert tickets today can feel like a race you're set up to lose.
You show up at the right time, refresh endlessly, and still miss out. Too often, ...
21/05/2026
In 2022, Spotify entered a new chapter by introducing audiobooks to our platform. Since then, we've grown our catalog to include more than 700,000 titles, e...
21/05/2026
Opening remarks
ALEX
Good morning everyone, I'm Alex [Norstr m].
GUSTAV
And I'm Gustav [S derstr m].
ALEX
Whether you've been following our j...
21/05/2026
Today, Spotify hosted our third Investor Day in New York City, offering the fina...
21/05/2026
Spotify hat heute seinen dritten Investor Day in New York City veranstaltet und der Finanzwelt tiefere Einblicke in das Gesch ft, die Produktstrategie und die l...
21/05/2026
Aujourd'hui, Spotify a organis son troisi me Investor Day New York. En pl...
21/05/2026
Oggi, a New York City, Spotify ha presentato il suo terzo Investor Day, offrendo...
21/05/2026
Hoy Spotify celebr su tercer Investor Day en Nueva York, donde ofrecimos a la c...
21/05/2026
Hari ini, Spotify menyelenggarakan Investor Day yang ketiga di New York City, me...
21/05/2026
2026 : (Investor Day) , , . ...
21/05/2026
Hoje, o Spotify realizou seu terceiro Investor Day em Nova York, oferecendo co...
21/05/2026
Spotify Investor Day ...
21/05/2026
Spotify bug n, 20'nci y l d n m m z kutlad m z bu y lda, finans camias na, i modelimiz, r n stratejimiz ve uzun vadeli vizyonumuz hakk nda daha detayl ...
21/05/2026
Two new Story Packs join orchestral instrument line-up
Sonuscore have just introduced two new additions to The Score, marking the instrument's first maj...
21/05/2026
30,000 samples, 99 presets & 504 loops
Heavyocity are well known for their hard-hitting cinematic instruments, and their latest release is no exception to t...
21/05/2026
Rohde & Schwarz AI powered voice to data: The future of air traffic control take...
21/05/2026
SKY RAIDER II INTERNATIONAL's modular open systems architecture delivers expanded operational reach and mission flexibility....
21/05/2026
ASO-enabled WESCAM MX-10 systems conduct systematic wide-area maritime search patterns, autonomously managing sensor scan operations to expand coverage, reduce ...
21/05/2026
HBO Max, a new addition to Gracenote Data Hub, is home to the most sports programming among major streamers
NEW YORK May 21, 2026 New analysis by Gracenote...
21/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
21/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
21/05/2026
The UK's leading event for the creative industries united thousands of professionals for two days of networking, debate, industry insight and getting hands-...
21/05/2026
Recreating Doug Trumbull's Slitscan VFX - After Effects Mastery
Graham Quince May 21, 2026
0 Comments
In this After Effects tutorial, I'm divi...
21/05/2026
Cavalry: An Array of Fun Stuff
Simon Ubsdell May 21, 2026
0 Comments
Arrays are a really powerful feature of Cavalry and here we'll go over some o...
21/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...