
Mistral AI and NVIDIA today released a new state-of-the-art language model, Mistral NeMo 12B, that developers can easily customize and deploy for enterprise applications supporting chatbots, multilingual tasks, coding and summarization.
By combining Mistral AI's expertise in training data with NVIDIA's optimized hardware and software ecosystem, the Mistral NeMo model offers high performance for diverse applications.
We are fortunate to collaborate with the NVIDIA team, leveraging their top-tier hardware and software, said Guillaume Lample, cofounder and chief scientist of Mistral AI. Together, we have developed a model with unprecedented accuracy, flexibility, high-efficiency and enterprise-grade support and security thanks to NVIDIA AI Enterprise deployment.
Mistral NeMo was trained on the NVIDIA DGX Cloud AI platform, which offers dedicated, scalable access to the latest NVIDIA architecture.
NVIDIA TensorRT-LLM for accelerated inference performance on large language models and the NVIDIA NeMo development platform for building custom generative AI models were also used to advance and optimize the process.
This collaboration underscores NVIDIA's commitment to supporting the model-builder ecosystem.
Delivering Unprecedented Accuracy, Flexibility and Efficiency
Excelling in multi-turn conversations, math, common sense reasoning, world knowledge and coding, this enterprise-grade AI model delivers precise, reliable performance across diverse tasks.
With a 128K context length, Mistral NeMo processes extensive and complex information more coherently and accurately, ensuring contextually relevant outputs.
Released under the Apache 2.0 license, which fosters innovation and supports the broader AI community, Mistral NeMo is a 12-billion-parameter model. Additionally, the model uses the FP8 data format for model inference, which reduces memory size and speeds deployment without any degradation to accuracy.
That means the model learns tasks better and handles diverse scenarios more effectively, making it ideal for enterprise use cases.
Mistral NeMo comes packaged as an NVIDIA NIM inference microservice, offering performance-optimized inference with NVIDIA TensorRT-LLM engines.
This containerized format allows for easy deployment anywhere, providing enhanced flexibility for various applications.
As a result, models can be deployed anywhere in minutes, rather than several days.
NIM features enterprise-grade software that's part of NVIDIA AI Enterprise, with dedicated feature branches, rigorous validation processes, and enterprise-grade security and support.
It includes comprehensive support, direct access to an NVIDIA AI expert and defined service-level agreements, delivering reliable and consistent performance.
The open model license allows enterprises to integrate Mistral NeMo into commercial applications seamlessly.
Designed to fit on the memory of a single NVIDIA L40S, NVIDIA GeForce RTX 4090 or NVIDIA RTX 4500 GPU, the Mistral NeMo NIM offers high efficiency, low compute cost, and enhanced security and privacy.
Advanced Model Development and Customization
The combined expertise of Mistral AI and NVIDIA engineers has optimized training and inference for Mistral NeMo.
Trained with Mistral AI's expertise, especially on multilinguality, code and multi-turn content, the model benefits from accelerated training on NVIDIA's full stack.
It's designed for optimal performance, utilizing efficient model parallelism techniques, scalability and mixed precision with Megatron-LM.
The model was trained using Megatron-LM, part of NVIDIA NeMo, with 3,072 H100 80GB Tensor Core GPUs on DGX Cloud, composed of NVIDIA AI architecture, including accelerated computing, network fabric and software to increase training efficiency.
Availability and Deployment
With the flexibility to run anywhere - cloud, data center or RTX workstation - Mistral NeMo is ready to revolutionize AI applications across various platforms.
Experience Mistral NeMo as an NVIDIA NIM today via ai.nvidia.com, with a downloadable NIM coming soon.
See notice regarding software product information.
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
01/06/2026
January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026
Throughout the week, Dolby brings to life the latest innovatio...
02/05/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
01/05/2026
January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...
01/04/2026
January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION
Douyin Users Can Now Create And Share Videos With Stun...
12/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
12/03/2026
Appear's high-performance, ultra-low latency encoding platform augments LTN's fully managed global IP network and orchestration platform
LTN, a leader ...
12/03/2026
2026 NAB Show Exhibitor Preview
April 18-22
Las Vegas
Booth C3519
Summary:
At the 2026 NAB Show in Las Vegas, Boland Communications will be showing the bro...
12/03/2026
Riedel Communications today announced the continued expansion of its Managed Technology Division in the Americas and the appointment of Jan Schaffner as Vice Pr...
12/03/2026
Grass Valley and integration partner Tab M Solutions have completed Phase 1 of a new broadcast production control room for the University of Illinois Division o...
12/03/2026
NAKIVO, a global provider of backup and ransomware recovery solutions, announces the general availability of NAKIVO Backup & Replication v11.2. This release exp...
12/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
12/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
12/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
12/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
12/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
12/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
12/03/2026
COW Jobs: Editor de V deo - Direct Response, Performance Ads - Brazil, Remote
Brie Clayton March 11, 2026
0 Comments
Editor(a) de V deo (Direct Respon...
12/03/2026
Avatar: Fire and Ash Graded with DaVinci Resolve Studio
Brie Clayton March 11, 2026
0 Comments
Colorist delivers premium cinematic color across 2D, 3D...
12/03/2026
Boston Conservatory to Timoth e Chalamet: We Care About Ballet and Opera Boston Conservatory at Berklee students and faculty respond to the actors recent comm...
12/03/2026
Editor's note: This post is part of Into the Omniverse, a series focused on ...
12/03/2026
The Late Late Show Show St Patrick's Day special
Dancing with the Stars f...
12/03/2026
GeForce NOW is bringing the game to the Game Developers Conference (GDC), running this week in San Francisco. While developers build the future of gaming, GeFor...
11/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
11/03/2026
Matrox Video will showcase its vision for the future of live production at NAB 2026 in Las Vegas, April 19-22, highlighting how broadcasters and media organizat...
11/03/2026
Geneva-based technology company, GlobalM SA, is presenting its GMX Distributed Video Gateway, a software-defined IP media transport platform designed to replace...
11/03/2026
Backlight (booth #N2829), the company behind Iconik and Wildmoka, which power video workflows for large media and entertainment organizations, has released the ...
11/03/2026
QuickLink, a leading provider of award-winning video production and remote guest contribution solutions, presents its latest StudioEdge models at The NAB Show ...
11/03/2026
Telestream, a global leader in media workflow technologies, today announced the expansion of Telestream Cloud Services with the introduction of UP, a new cloud-...
11/03/2026
Operative, the preferred advertising management provider for the world's leading media brands, today announced the launch of AOS for digital media, an AI-po...
11/03/2026
Calrec will be located in Central Hall, on Booth C6907
Choice without compromise
The broadcast industry is going through a rapid evolution that s signalling a...
11/03/2026
The new service is hosted and operated entirely in the Netherlands, combining data sovereignty, resilience, scalability, and predictable costs without relying...
11/03/2026
Ease Live, an Evertz company and leader in interactive graphical overlays, today announced the successful deployment of its platform on Red Bull TV for Premier ...
11/03/2026
Mediagenix, a global leader in smart content solutions to profitably connect the right content to the right audience, is advancing its Semantic Intelligence cap...
11/03/2026
Emergent, a leading provider of AI-enhanced media production solutions, today announced the official launch of Fusion, a powerful, no-code application builder d...
11/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
11/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
11/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
11/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
11/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
11/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
11/03/2026
Utah Scientific today announced the expansion of its Technology Partner Program with the addition of Audinate, Bitfocus, and Skaarhoj, three industry leaders wh...
11/03/2026
DigitalGlue, creator of the creative.space on-premise managed storage platform, today revealed plans to launch creative.space Intelligence (CSI) at NAB 2026 (Bo...
11/03/2026
Maxon, maker of powerful, approachable software solutions for creators working in 2D and 3D design, motion graphics, visual effects, gaming, and more, has annou...
11/03/2026
Composer and Re-recording Mixer Michael Phillips Keeley has built his career around immersive storytelling. Working from his Dolby Atmos-equipped studio, Sound ...
11/03/2026
Leading video software provider Synamedia today announced that YES, the pay-TV subsidiary of the telco Bezeq (TASE: BEZQ), has selected Synamedia Iris to delive...
11/03/2026
As media companies face increasing cost pressures and operational complexity, at the 2026 NAB Show in Las Vegas, Viaccess-Orca (VO), a global leader in OTT / TV...
11/03/2026
Digital Alert Systems, a global leader in emergency communications solutions for media providers, today announced the release of Version 6 software for its DASD...
11/03/2026
First Medium-Earth Orbit (MEO) deployment of the emergency.lu platform for refugees and their host communities' use provides dependable broadband for humani...
11/03/2026
Foundry releases Nuke 17.0
Brie Clayton March 1, 2026
0 Comments
Native Gaussian Splat support, new 3D system based on USD, expanded machine learning ca...