
Mistral AI and NVIDIA today released a new state-of-the-art language model, Mistral NeMo 12B, that developers can easily customize and deploy for enterprise applications supporting chatbots, multilingual tasks, coding and summarization.
By combining Mistral AI's expertise in training data with NVIDIA's optimized hardware and software ecosystem, the Mistral NeMo model offers high performance for diverse applications.
We are fortunate to collaborate with the NVIDIA team, leveraging their top-tier hardware and software, said Guillaume Lample, cofounder and chief scientist of Mistral AI. Together, we have developed a model with unprecedented accuracy, flexibility, high-efficiency and enterprise-grade support and security thanks to NVIDIA AI Enterprise deployment.
Mistral NeMo was trained on the NVIDIA DGX Cloud AI platform, which offers dedicated, scalable access to the latest NVIDIA architecture.
NVIDIA TensorRT-LLM for accelerated inference performance on large language models and the NVIDIA NeMo development platform for building custom generative AI models were also used to advance and optimize the process.
This collaboration underscores NVIDIA's commitment to supporting the model-builder ecosystem.
Delivering Unprecedented Accuracy, Flexibility and Efficiency
Excelling in multi-turn conversations, math, common sense reasoning, world knowledge and coding, this enterprise-grade AI model delivers precise, reliable performance across diverse tasks.
With a 128K context length, Mistral NeMo processes extensive and complex information more coherently and accurately, ensuring contextually relevant outputs.
Released under the Apache 2.0 license, which fosters innovation and supports the broader AI community, Mistral NeMo is a 12-billion-parameter model. Additionally, the model uses the FP8 data format for model inference, which reduces memory size and speeds deployment without any degradation to accuracy.
That means the model learns tasks better and handles diverse scenarios more effectively, making it ideal for enterprise use cases.
Mistral NeMo comes packaged as an NVIDIA NIM inference microservice, offering performance-optimized inference with NVIDIA TensorRT-LLM engines.
This containerized format allows for easy deployment anywhere, providing enhanced flexibility for various applications.
As a result, models can be deployed anywhere in minutes, rather than several days.
NIM features enterprise-grade software that's part of NVIDIA AI Enterprise, with dedicated feature branches, rigorous validation processes, and enterprise-grade security and support.
It includes comprehensive support, direct access to an NVIDIA AI expert and defined service-level agreements, delivering reliable and consistent performance.
The open model license allows enterprises to integrate Mistral NeMo into commercial applications seamlessly.
Designed to fit on the memory of a single NVIDIA L40S, NVIDIA GeForce RTX 4090 or NVIDIA RTX 4500 GPU, the Mistral NeMo NIM offers high efficiency, low compute cost, and enhanced security and privacy.
Advanced Model Development and Customization
The combined expertise of Mistral AI and NVIDIA engineers has optimized training and inference for Mistral NeMo.
Trained with Mistral AI's expertise, especially on multilinguality, code and multi-turn content, the model benefits from accelerated training on NVIDIA's full stack.
It's designed for optimal performance, utilizing efficient model parallelism techniques, scalability and mixed precision with Megatron-LM.
The model was trained using Megatron-LM, part of NVIDIA NeMo, with 3,072 H100 80GB Tensor Core GPUs on DGX Cloud, composed of NVIDIA AI architecture, including accelerated computing, network fabric and software to increase training efficiency.
Availability and Deployment
With the flexibility to run anywhere - cloud, data center or RTX workstation - Mistral NeMo is ready to revolutionize AI applications across various platforms.
Experience Mistral NeMo as an NVIDIA NIM today via ai.nvidia.com, with a downloadable NIM coming soon.
See notice regarding software product information.
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
01/06/2026
January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026
Throughout the week, Dolby brings to life the latest innovatio...
02/05/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
01/05/2026
January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...
01/04/2026
January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION
Douyin Users Can Now Create And Share Videos With Stun...
26/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
26/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
26/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
26/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
26/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
26/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
26/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
26/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
26/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
26/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
26/02/2026
With more than four decades of experience in radio broadcasting and live sports production, Daryl Doss, owner of Doss Technical Services and a contract engineer...
26/02/2026
BCNEXXT has deployed live HLG-based HDR playout capabilities within its Vipe platform, enabling broadcasters to integrate High Dynamic Range into live productio...
26/02/2026
TAG Video Systems (Booth W2323) will unveil new capabilities across its IP-native Realtime Media Platform at NAB 2026. New releases include visual service healt...
26/02/2026
IBC today announced a new strategic partnership with EIT Culture & Creativity the institutional partnership for culture and creativity, supported by the Europ...
26/02/2026
Clear-Com kept the action on track at Red Bull Shay'iMoto, an adrenaline-fueled motorsport spinning event that transformed the streets of Durban, South Afr...
26/02/2026
Harmonic (NASDAQ: HLIT) today announced that Alcom, a leading telco operator in Finland, is powering its next-generation white-label headend video service with ...
26/02/2026
Big Blue Marble, a provider of broadcast-grade, cloud-native video solutions for broadcasters, service providers, and content owners, today announced that it ha...
26/02/2026
New approach enables video service providers to deliver multiple live feeds on the same screen with lower costs and improved device compatibility
Broadpeak, a ...
26/02/2026
Final quarter revenues increase 7% year-on-year, with accelerating momentum in the second half
Space Services grows revenues by 6% year-on-year and records hig...
25/02/2026
With the Olympic Flag officially handed over to the organisers of the next Winter Games and the baton passed from Milano Cortina 2026 to French Alps 2030, the I...
25/02/2026
From a studio overlooking the Dolomites to workflows routed through Milan and into Salford, the BBC delivered a lean and mean operation for its Winter Games c...
25/02/2026
Warner Bros. Discovery (WBD) Sports is managing a huge network of channels acros...
25/02/2026
From its base in the northern Italian town of Cortina, Warner Bros. Discovery (W...
25/02/2026
In addition to 16:9-to-9:16 intelligent cropping for live video, Inference autom...
25/02/2026
Longtime rivals Floyd Money Mayweather Jr. (50-0, 27 KOs) and Manny PacMan P...
25/02/2026
The WNBA's Portland Fire and NWSL's Portland Thorns announce a groundbre...
25/02/2026
Multi-angle coverage, on-demand access to ultra-high-resolution video are provided for replays and clips across multiple distribution channels
The NHL and Cosm...
25/02/2026
The implementation standardizes an integrated workflow connecting ultra-high-res...
25/02/2026
Targeting a younger audience, creator-led network's Access Granted series hi...
25/02/2026
Alpha, the project's systems integrator, assisted in the workflow transformation
Tipping off the second half of the 2025-26 home schedule against the Houst...
25/02/2026
OCVIBE, the 100-acre mixed-use development transforming the area surrounding Hon...
25/02/2026
It's never been easier to customize your Spotify listening experience. Last year, we introduced more control over the way your playlist sounds, giving Premi...
25/02/2026
Hip-hop thrives on constant reinvention, with bold voices and fearless experimentation continually pushing the genre's boundaries. Every era brings new lead...
25/02/2026
L3Harris technicians recently completed a major mirror refurbishment for the U.S...
25/02/2026
This new offering helps solve for the need to move beyond traditional audience d...
25/02/2026
Gold-standard Gracenote content metadata will power Samsung's LLM-enabled entertainment search discovery experiences and more
NEW YORK February 25, 202...
25/02/2026
Afrobeats Icon Tiwa Savage Joins Forces with Berklee to Empower African Talent In collaboration with Berklee Global, the Tiwa Savage Music Foundation will hos...
25/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
25/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
25/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
25/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
25/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
25/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
25/02/2026
Arch Platform Technologies, a leading platform for creating and managing cloud workstation infrastructure, and Wacom, the world's leading manufacturer of in...
25/02/2026
Today, AWS is announcing AWS Elemental Inference, a fully managed AI service that automatically transforms and maximizes live and on-demand video broadcasts to ...