
Mistral AI and NVIDIA today released a new state-of-the-art language model, Mistral NeMo 12B, that developers can easily customize and deploy for enterprise applications supporting chatbots, multilingual tasks, coding and summarization.
By combining Mistral AI's expertise in training data with NVIDIA's optimized hardware and software ecosystem, the Mistral NeMo model offers high performance for diverse applications.
We are fortunate to collaborate with the NVIDIA team, leveraging their top-tier hardware and software, said Guillaume Lample, cofounder and chief scientist of Mistral AI. Together, we have developed a model with unprecedented accuracy, flexibility, high-efficiency and enterprise-grade support and security thanks to NVIDIA AI Enterprise deployment.
Mistral NeMo was trained on the NVIDIA DGX Cloud AI platform, which offers dedicated, scalable access to the latest NVIDIA architecture.
NVIDIA TensorRT-LLM for accelerated inference performance on large language models and the NVIDIA NeMo development platform for building custom generative AI models were also used to advance and optimize the process.
This collaboration underscores NVIDIA's commitment to supporting the model-builder ecosystem.
Delivering Unprecedented Accuracy, Flexibility and Efficiency
Excelling in multi-turn conversations, math, common sense reasoning, world knowledge and coding, this enterprise-grade AI model delivers precise, reliable performance across diverse tasks.
With a 128K context length, Mistral NeMo processes extensive and complex information more coherently and accurately, ensuring contextually relevant outputs.
Released under the Apache 2.0 license, which fosters innovation and supports the broader AI community, Mistral NeMo is a 12-billion-parameter model. Additionally, the model uses the FP8 data format for model inference, which reduces memory size and speeds deployment without any degradation to accuracy.
That means the model learns tasks better and handles diverse scenarios more effectively, making it ideal for enterprise use cases.
Mistral NeMo comes packaged as an NVIDIA NIM inference microservice, offering performance-optimized inference with NVIDIA TensorRT-LLM engines.
This containerized format allows for easy deployment anywhere, providing enhanced flexibility for various applications.
As a result, models can be deployed anywhere in minutes, rather than several days.
NIM features enterprise-grade software that's part of NVIDIA AI Enterprise, with dedicated feature branches, rigorous validation processes, and enterprise-grade security and support.
It includes comprehensive support, direct access to an NVIDIA AI expert and defined service-level agreements, delivering reliable and consistent performance.
The open model license allows enterprises to integrate Mistral NeMo into commercial applications seamlessly.
Designed to fit on the memory of a single NVIDIA L40S, NVIDIA GeForce RTX 4090 or NVIDIA RTX 4500 GPU, the Mistral NeMo NIM offers high efficiency, low compute cost, and enhanced security and privacy.
Advanced Model Development and Customization
The combined expertise of Mistral AI and NVIDIA engineers has optimized training and inference for Mistral NeMo.
Trained with Mistral AI's expertise, especially on multilinguality, code and multi-turn content, the model benefits from accelerated training on NVIDIA's full stack.
It's designed for optimal performance, utilizing efficient model parallelism techniques, scalability and mixed precision with Megatron-LM.
The model was trained using Megatron-LM, part of NVIDIA NeMo, with 3,072 H100 80GB Tensor Core GPUs on DGX Cloud, composed of NVIDIA AI architecture, including accelerated computing, network fabric and software to increase training efficiency.
Availability and Deployment
With the flexibility to run anywhere - cloud, data center or RTX workstation - Mistral NeMo is ready to revolutionize AI applications across various platforms.
Experience Mistral NeMo as an NVIDIA NIM today via ai.nvidia.com, with a downloadable NIM coming soon.
See notice regarding software product information.
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
04/08/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
04/07/2026
April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...
01/06/2026
January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026
Throughout the week, Dolby brings to life the latest innovatio...
06/05/2026
How changes to proteins can alter drug interactions for new precision therapies Scripps Research team maps how chemical modifications to proteins affect drug bi...
05/05/2026
Experts from the world of academia, tech, business, politics and media convened for a Thomson Talks at the Cambridge Disinformation Summit in April. It's th...
05/05/2026
Three phones were hardwired for power and transmission to the truck; camera feat...
05/05/2026
The creative studio behind campaigns for the NBA, Fanatics Sportsbook & Casino, ...
05/05/2026
Nielsen has announced results from a co-viewing pilot program covering February&...
05/05/2026
viztrick AiDi, an on-device AI solution developed by Nippon TV, delivered global...
05/05/2026
ARRI has announced Omnibar, a battery-powered, IP65-rated multi-color LED linear...
05/05/2026
Imagine Communications has announced that France T l visions is the first broadc...
05/05/2026
The Women's National Basketball Association (WNBA) and Bell Media today announced a multiyear agreement to broadcast and stream WNBA games in Canada beginni...
05/05/2026
SVG is proud to announce Warner Bros. Discovery's Techwood Studios in Atlant...
05/05/2026
With no operator required, AutoMic workflow automates talent identification on U...
05/05/2026
A crash in 2015 set the industry back, but this winter proved that drones are he...
05/05/2026
Another year, and more proof that Asia continues to shape some of the world's most exciting new sounds. This year's RADAR artists draw from deep local r...
05/05/2026
The Austin City Limits Music Fest 2026 lineup just dropped, and this year, Spoti...
05/05/2026
New drum machine book campaign incoming
Bjooks have announced that during Superbooth 2026, they will be launching a Kickstarter campaign to fund the product...
05/05/2026
Flagship all-in-one production bundle updated
The latest version of Native Instruments' flagship virtual instrument and plug-in bundle has just been ann...
05/05/2026
Rohde & Schwarz to host RF Testing Innovations Forum 2026, helping design engine...
05/05/2026
L3Harris provides communications, electronic warfare, sensors and mission systems that enable Virginia-class submarine crews to operate with confidence in conte...
05/05/2026
The company grew by 7.6% in net revenue and 16.3% in EBITDA, achieving a 33% inc...
05/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
05/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
05/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
05/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
05/05/2026
Project Marks First Major Broadcast Deployment of Latest Addition to SNP Lineup
Imagine Communications today announced that France T l visions is the first br...
05/05/2026
Shotoku Broadcast Systems Wins 2026 NAB Show Product of the Year Award
Shotoku Broadcast Systems announced today that its Swoop range of robotic cranes has be...
05/05/2026
DigitalGlue's creative.space Intelligence Wins Future's Best of Show Award, Presented by TV Tech
creative.space Intelligence (CSI), part of the creativ...
05/05/2026
Zixi, a leader in live video delivery and workflow orchestration, will showcase next-generation broadcast workflows at the Media Production and Technology Show ...
05/05/2026
Stingr marks its launch with a new approach to second-screen interactivity
Brie Clayton May 5, 2026
0 Comments
Huge leap forward in revenues and engag...
05/05/2026
Shotoku Broadcast Systems Wins 2026 NAB Show Product of the Year Award
Brie Clayton May 5, 2026
0 Comments
Shotoku Broadcast Systems announced today tha...
05/05/2026
Following a successful NAB Show in Las Vegas, DHD will promote examples from its wide range of broadcast-quality audio production equipment at the May 13th-14th...
05/05/2026
LucidLink today announced its programme for MPTS 2026, where it will exhibit at Stand M59 at Olympia London, 13 to 14 May. The company will showcase its latest ...
05/05/2026
Limecraft today announces the release of Limecraft 2026.3, the third platform update in its 2026 release cycle. Limecraft is an AI-powered production platform t...
05/05/2026
Huge leap forward in revenues and engagement...
05/05/2026
Broadcast Solutions, a leading system integrator and provider of innovative solutions for the broadcast media industry, has taken another significant step in st...
05/05/2026
Operative today announced the appointment of Dang Ly as Chief Product Officer, signaling the company's accelerating commitment to delivering the next genera...
05/05/2026
The Media Talent Manifesto (MTM) today announces the return of the World Skills Caf at IBC2026, positioning the event as a critical industry forum to confront ...
05/05/2026
ARRI unveils Omnibar: compact, modular, battery-powered IP65 LED bars with preci...
05/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
05/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
05/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
05/05/2026
Digital Domain Welcomes Award-Nominated VFX Supervisor Jelmer Boskma
Brie Clayton May 4, 2026
0 Comments
Digital Domain, a global leader in visual eff...
05/05/2026
Enterprise AI has learned to generate. It has learned to reason. Now companies are asking the next question: How should AI act?
Early agent systems have shown ...
05/05/2026
May 5th, 2026 Press Materials Available Here
2026 TRIBECA FESTIVAL UNVEILS EXP...
05/05/2026
Back to All News
Limited Series About The Greatest Soccer Team Of All Time: Net...
05/05/2026
FOX Sports, FOX One and Indeed Launch Nationwide Search for FOX One Chief World...
05/05/2026
GoVertical! Technology Recognized for Ability to Provide Real-Time 9:16 Autocrop...