
Mistral AI and NVIDIA today released a new state-of-the-art language model, Mistral NeMo 12B, that developers can easily customize and deploy for enterprise applications supporting chatbots, multilingual tasks, coding and summarization.
By combining Mistral AI's expertise in training data with NVIDIA's optimized hardware and software ecosystem, the Mistral NeMo model offers high performance for diverse applications.
We are fortunate to collaborate with the NVIDIA team, leveraging their top-tier hardware and software, said Guillaume Lample, cofounder and chief scientist of Mistral AI. Together, we have developed a model with unprecedented accuracy, flexibility, high-efficiency and enterprise-grade support and security thanks to NVIDIA AI Enterprise deployment.
Mistral NeMo was trained on the NVIDIA DGX Cloud AI platform, which offers dedicated, scalable access to the latest NVIDIA architecture.
NVIDIA TensorRT-LLM for accelerated inference performance on large language models and the NVIDIA NeMo development platform for building custom generative AI models were also used to advance and optimize the process.
This collaboration underscores NVIDIA's commitment to supporting the model-builder ecosystem.
Delivering Unprecedented Accuracy, Flexibility and Efficiency
Excelling in multi-turn conversations, math, common sense reasoning, world knowledge and coding, this enterprise-grade AI model delivers precise, reliable performance across diverse tasks.
With a 128K context length, Mistral NeMo processes extensive and complex information more coherently and accurately, ensuring contextually relevant outputs.
Released under the Apache 2.0 license, which fosters innovation and supports the broader AI community, Mistral NeMo is a 12-billion-parameter model. Additionally, the model uses the FP8 data format for model inference, which reduces memory size and speeds deployment without any degradation to accuracy.
That means the model learns tasks better and handles diverse scenarios more effectively, making it ideal for enterprise use cases.
Mistral NeMo comes packaged as an NVIDIA NIM inference microservice, offering performance-optimized inference with NVIDIA TensorRT-LLM engines.
This containerized format allows for easy deployment anywhere, providing enhanced flexibility for various applications.
As a result, models can be deployed anywhere in minutes, rather than several days.
NIM features enterprise-grade software that's part of NVIDIA AI Enterprise, with dedicated feature branches, rigorous validation processes, and enterprise-grade security and support.
It includes comprehensive support, direct access to an NVIDIA AI expert and defined service-level agreements, delivering reliable and consistent performance.
The open model license allows enterprises to integrate Mistral NeMo into commercial applications seamlessly.
Designed to fit on the memory of a single NVIDIA L40S, NVIDIA GeForce RTX 4090 or NVIDIA RTX 4500 GPU, the Mistral NeMo NIM offers high efficiency, low compute cost, and enhanced security and privacy.
Advanced Model Development and Customization
The combined expertise of Mistral AI and NVIDIA engineers has optimized training and inference for Mistral NeMo.
Trained with Mistral AI's expertise, especially on multilinguality, code and multi-turn content, the model benefits from accelerated training on NVIDIA's full stack.
It's designed for optimal performance, utilizing efficient model parallelism techniques, scalability and mixed precision with Megatron-LM.
The model was trained using Megatron-LM, part of NVIDIA NeMo, with 3,072 H100 80GB Tensor Core GPUs on DGX Cloud, composed of NVIDIA AI architecture, including accelerated computing, network fabric and software to increase training efficiency.
Availability and Deployment
With the flexibility to run anywhere - cloud, data center or RTX workstation - Mistral NeMo is ready to revolutionize AI applications across various platforms.
Experience Mistral NeMo as an NVIDIA NIM today via ai.nvidia.com, with a downloadable NIM coming soon.
See notice regarding software product information.
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
06/09/2026
June 9 2026, 23:00 (PDT) Dolby and MagentaTV Bring Fans Closer to the FIFA Worl...
04/08/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
04/07/2026
April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...
23/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
23/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
23/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
23/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
23/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
23/06/2026
Multifaceted Growth Executive Brings 20+ Years of Experience Leading Organizations Across Tech and M&E
Imagine Communications today announced the appointment ...
23/06/2026
Australians in Film and Screen Australia's talent development initiative UNT...
23/06/2026
Visual Productions Unveils RdmRelay2 Four-channel Relay Control at InfoComm 2026
Brie Clayton June 22, 2026
0 Comments
New Relay Solution Combines DMX, ...
23/06/2026
SMPTE Makes Its Standards Freely Accessible, Opening Standards Library to the Gl...
23/06/2026
Newly identified molecule strengthens the eye's response to damage in retinal disease Scripps Research discovery finds that restoring the naturally occurrin...
22/06/2026
Behind The Mic provides a roundup of recent news regarding on-air talent, includ...
22/06/2026
Cosm has announced the appointment of David Ho as Chief Legal Officer, a newly created executive role reporting to President and CEO Jeb Terry. Ho will oversee ...
22/06/2026
Warner Bros. Discovery and Amazon Web Services (AWS) have announced the developm...
22/06/2026
Daktronics has completed an audio control system upgrade at Petco Park in San Di...
22/06/2026
Accelerate Media has named John Willi as President and announced the launch of the Accelerate Sports Network (ASN), a prep sports media and streaming platform c...
22/06/2026
All Women's Sports Network (AWSN) and 3XBA (3 3 Basketball Association) have announced live television coverage of the annual 3XBA tournament on Friday, Jun...
22/06/2026
OWL AI has announced the appointment of Jay Prasad as Chief Executive Officer and member of the Board of Directors. Prasad succeeds Josh Gwyther, who has served...
22/06/2026
CP Communications delivered RF video and audio support for TNT's Inside the NBA at the 2026 NBA Finals, providing main show coverage in San Antonio and ea...
22/06/2026
Polymarket has announced a partnership with GRID, an official esports data platf...
22/06/2026
As sports venues continue to evolve into more video-centric, fan-engagement-driv...
22/06/2026
As the regional sports production scene shifts toward streaming, this Texan helps lead the engineering behind Victory+'s growing live platform...
22/06/2026
By Kristin Feeley, Director, Documentary Film & Artist Programs
the memories of your elders [are] a scaffolding for you to build your identity on - and t...
22/06/2026
New hyper-resolution analyser EQ revealed
CEDAR Audio's all-new Icons plug-in series has just gained its newest member, Blade. Described by the compan...
22/06/2026
Turn any live input into a cinematic soundscape
Designed for use in the studio and on stage, Sampleson's latest creation is capable of taking any audio ...
22/06/2026
Adds guitar strings to Eurorack rigs
ADDAC System are renowned for their weird and wonderful synth designs, and their line-up includes plenty of gear that...
22/06/2026
FIFA World Cup 2026 fever grows, as more than one third of Australians tune in ...
22/06/2026
In our latest blog, Tim Pearson explores NAGRA Venturi, the new streaming security solution for the AI era from NAGRAVISION. Designed to aggregate and analyze ...
22/06/2026
Expanded integrations give advertisers access to distinct contextual signals acr...
22/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
22/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
22/06/2026
Xilica today announced the release of Dynamic Voice Lift, a new feature in Xilica Designer v4.12 that brings adaptive speech reinforcement to large meeting spac...
22/06/2026
Telecom operators have seen remarkable returns from using generative AI to automate network management, customer care and back-office operations. Most of that i...
22/06/2026
Monday 22 June 2026
Official trailer released for Katie Price: Nothing to Hide,...
22/06/2026
The next era of AI will not be defined by compute alone. Its growth will be dete...
22/06/2026
Mission, Vision and Veritas - new Los Alamos National Laboratory (LANL) supercom...
22/06/2026
At the ISC conference running in Hamburg this week, NVIDIA is introducing new so...
22/06/2026
For the past two years, the U.S. National Science Foundation's National Arti...
22/06/2026
JUPITER, Europe's first exascale supercomputer at Germany's Forschungszentrum J lich, runs on NVIDIA Grace Hopper Superchips and NVIDIA Quantum-X800 Inf...
21/06/2026
To call the 2026 FIFA World Cup a big undertaking would be a big understatement....
21/06/2026
New series now live on Udemy
Regular SOS contributor and Cubase workshop columnist John Walden has just released a new Cubase video course that is now avail...
21/06/2026
Hot tubs sit at about 38 to 40 degrees Celsius, warm enough that most people can only soak for about 15 minutes. NVIDIA's newest AI servers can run their co...
21/06/2026
Sunday 21 June 2026
Sky announces immersive documentary series The Wargame
The Wargame first looks
ZIP (2MB)
Sky today confirms the commission of The Wargam...
20/06/2026
New add-on creates doubles & vocal stacks
IK Multimedia's latest ReSing add-on kits the innovative software out with the ability to automatically genera...
20/06/2026
What exactly is Apogee Control V3?
Control V3 is a new mixer application that controls Apogee interfaces. The new hit feature is that V3 finally allows for...
19/06/2026
Split compound eases operational challenges at Shinnecock Hills Golf Club...
19/06/2026
North Carolina, Oklahoma meet in the best-of-three Finals as ESPN leans into spe...