Sony Pixel Power calrec Sony

Mistral AI and NVIDIA Unveil Mistral NeMo 12B, a Cutting-Edge Enterprise AI Model

18/07/2024

Mistral AI and NVIDIA today released a new state-of-the-art language model, Mistral NeMo 12B, that developers can easily customize and deploy for enterprise applications supporting chatbots, multilingual tasks, coding and summarization.

By combining Mistral AI's expertise in training data with NVIDIA's optimized hardware and software ecosystem, the Mistral NeMo model offers high performance for diverse applications.

We are fortunate to collaborate with the NVIDIA team, leveraging their top-tier hardware and software, said Guillaume Lample, cofounder and chief scientist of Mistral AI. Together, we have developed a model with unprecedented accuracy, flexibility, high-efficiency and enterprise-grade support and security thanks to NVIDIA AI Enterprise deployment.

Mistral NeMo was trained on the NVIDIA DGX Cloud AI platform, which offers dedicated, scalable access to the latest NVIDIA architecture.

NVIDIA TensorRT-LLM for accelerated inference performance on large language models and the NVIDIA NeMo development platform for building custom generative AI models were also used to advance and optimize the process.

This collaboration underscores NVIDIA's commitment to supporting the model-builder ecosystem.

Delivering Unprecedented Accuracy, Flexibility and Efficiency

Excelling in multi-turn conversations, math, common sense reasoning, world knowledge and coding, this enterprise-grade AI model delivers precise, reliable performance across diverse tasks.

With a 128K context length, Mistral NeMo processes extensive and complex information more coherently and accurately, ensuring contextually relevant outputs.

Released under the Apache 2.0 license, which fosters innovation and supports the broader AI community, Mistral NeMo is a 12-billion-parameter model. Additionally, the model uses the FP8 data format for model inference, which reduces memory size and speeds deployment without any degradation to accuracy.

That means the model learns tasks better and handles diverse scenarios more effectively, making it ideal for enterprise use cases.

Mistral NeMo comes packaged as an NVIDIA NIM inference microservice, offering performance-optimized inference with NVIDIA TensorRT-LLM engines.

This containerized format allows for easy deployment anywhere, providing enhanced flexibility for various applications.

As a result, models can be deployed anywhere in minutes, rather than several days.

NIM features enterprise-grade software that's part of NVIDIA AI Enterprise, with dedicated feature branches, rigorous validation processes, and enterprise-grade security and support.

It includes comprehensive support, direct access to an NVIDIA AI expert and defined service-level agreements, delivering reliable and consistent performance.

The open model license allows enterprises to integrate Mistral NeMo into commercial applications seamlessly.

Designed to fit on the memory of a single NVIDIA L40S, NVIDIA GeForce RTX 4090 or NVIDIA RTX 4500 GPU, the Mistral NeMo NIM offers high efficiency, low compute cost, and enhanced security and privacy.

Advanced Model Development and Customization

The combined expertise of Mistral AI and NVIDIA engineers has optimized training and inference for Mistral NeMo.

Trained with Mistral AI's expertise, especially on multilinguality, code and multi-turn content, the model benefits from accelerated training on NVIDIA's full stack.

It's designed for optimal performance, utilizing efficient model parallelism techniques, scalability and mixed precision with Megatron-LM.

The model was trained using Megatron-LM, part of NVIDIA NeMo, with 3,072 H100 80GB Tensor Core GPUs on DGX Cloud, composed of NVIDIA AI architecture, including accelerated computing, network fabric and software to increase training efficiency.

Availability and Deployment

With the flexibility to run anywhere - cloud, data center or RTX workstation - Mistral NeMo is ready to revolutionize AI applications across various platforms.

Experience Mistral NeMo as an NVIDIA NIM today via ai.nvidia.com, with a downloadable NIM coming soon.

See notice regarding software product information.
LINK: https://blogs.nvidia.com/blog/mistral-nvidia-ai-model/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

06/04/2026

Fab Five' Reunion Drives TNT and CBS's Experimental Final Four Altcast Built on REMI Workflow

Michigan legends bring a new voice to the broadcast as TNT Sports and CBS Sports...

06/04/2026

SVG New Sponsor Spotlight: Optikka CEO Daniel Evans on Scaling Sports Content with Programmatic Graphics

From high school sports all the way up to the major leagues, building high-quali...

06/04/2026

Quickplay and TwelveLabs Join AWS Business Outcomes Xcelerator Program

Quickplay, an AI company for the media and entertainment industry, has been accepted into the Advanced tier of the TwelveLabs Ecosystem Partner Program. Quickpl...

06/04/2026

Grass Valley Launches Future Playmakers Program for Students in Sports Production and Media Technology

Grass Valley has announced the Future Playmakers Program, a global initiative to...

06/04/2026

SVG All-Stars: Raasean Robinson, Gerente de Posproduccin y Operaciones de Estudio, FOX Deportes

El l der de operaciones impulsa la producci n en estudio mientras encuentra insp...

06/04/2026

SVG All-Stars: Raasean Robinson, Manager, Post Production and Studio Operations, FOX Deportes

The ops leader helps lead the charge in studio for the Spanish-language broadcas...

06/04/2026

Behind The Mic: SiriusXM Shares 2026 Masters Broadcast Team; ESPN to Produce Over 140+ Hours of Masters Live Coverage

Behind The Mic provides a roundup of recent news regarding on-air talent, includ...

06/04/2026

NHL Opens Innovation Lab in Partnership with Verizon, New Jersey Devils

The National Hockey League (NHL), in partnership with Verizon and the New Jersey Devils, today announced the opening of the NHL Innovation Lab powered by Verizo...

06/04/2026

ESPN+ To Stream Inaugural Rock League Curling Season

Rock League, a new professional curling league, has announced that ESPN+ will stream its inaugural 2026 season for fans in the United States. The first Rock Lea...

06/04/2026

ASG Appoints Andrea Cummis as VP of Systems Design and Engineering

Advanced Systems Group has announced the appointment of Andrea (Andy) Cummis as Vice President of Systems Design and Engineering. In this role, she will lead de...

06/04/2026

Source Media Group Launches Source Golf, a Creator-Driven YouTube Network Targeting Next-Gen Fans

Backed by Bolt Ventures, the venture brings Bryson DeChambeau, Grant Horvat, and...

06/04/2026

How the NHL's Innovation Lab Will Take Broadcast, Fan, and Team Tech to New Heights

With this environment we can start that collaboration even earlier because we ca...

06/04/2026

K-Pop Artist ENHYPEN Host The Blood Diary,' a New Video Podcast Series From HYBE

Like the immortal lives of vampires, some stories never really end. That's t...

06/04/2026

From Audio to IRL: How Let's Get Haunted' Is Building Community With Spotify RADAR

As podcasting continues to evolve, growth increasingly means building beyond aud...

06/04/2026

FSK Audio update Bark24 Dyn

Multiband dynamics plug-in enhanced California-based developer FSK Audio have released a significant update for their innovative multiband dynamics processo...

06/04/2026

IK Multimedia introduce ToneNET Preset Sharing

Share official & user-created full-rig presets IK Multimedia's latest TONEX update makes it possible for users of the popular amp and effects modelling ...

06/04/2026

Baseball 2026: More AI, Better Viewing Choices

Share Copy link Facebook X Linkedin Bluesky Email...

06/04/2026

JB&A Announces Details for its Pre-NAB 2026 Event

Share Copy link Facebook X Linkedin Bluesky Email...

06/04/2026

Dalet Showcases Dalia Agentic AI and End-to-End Media Workflows at NAB Show 2026

Dalet Showcases Dalia Agentic AI and End-to-End Media Workflows at NAB Show 2026 Brie Clayton April 6, 2026 0 Comments Dalet, a leading technology and...

06/04/2026

OpenDrives Shows Off Sports Expertise in Sports Business Hub located in NAB Show's West Hall

OpenDrives Shows Off Sports Expertise in Sports Business Hub located in NAB Show...

06/04/2026

Proton to Demonstrate 3D Application at NAB 2026

Proton to Demonstrate 3D Application at NAB 2026 Brie Clayton April 6, 2026 0 Comments Yet further creative potential unleashed through innovation in ...

06/04/2026

Autoscript Highlights Voice-Driven Prompting and PTZ Solutions at NAB 2026

Autoscript Highlights Voice-Driven Prompting and PTZ Solutions at NAB 2026 Brie Clayton April 6, 2026 0 Comments Experience Autoscript Voice, PTZ prom...

06/04/2026

Mediaproxy Highlights Significant Enhancements to its LogServer suite at NAB Show 2026

Mediaproxy Highlights Significant Enhancements to its LogServer suite at NAB Sho...

06/04/2026

Re-Architectured PCC Software Streamlines and Enhances the Full High-Speed Imaging Workflow

Wayne, N.J., April 6th, 2026 Phantom High-Speed announces the release of PCC 4...

06/04/2026

Tribeca Studios And Lilly Announce Winners Of Inaugural Vital Stories Filmmaker Program

April 6th, 2026 TRIBECA STUDIOS AND LILLY ANNOUNCE WINNERS OF INAUGURAL VITAL...

06/04/2026

Netflix Expands Kids Entertainment Lineup With Playground App for Games, New Shows & Returning Favorites

Back to All News Netflix Expands Kids Entertainment Lineup With Playground App ...

05/04/2026

Latest SoundBridge update now live

Tackles all reported bugs! SoundBridge have just announced the launch of a new update that introduces a couple of minor changes to their remote collaboratio...

04/04/2026

Don't Be Lame: Arizona Men's Basketball Social Team Aims To Catch the Attention of Wildcats Fans

The University of Arizona's Men's Basketball team has only loss twice th...

04/04/2026

HDR Makes Its Men's Final Four Debut as CBS Sports and TNT Sports Collaborate on New Camera Tools and an IP-Powered Compound

1080p HDR arrives, a new generation of storytelling tools takes center stage, an...

04/04/2026

Fab Five Reunion Drives TNT and CBS's Experimental Final Four Altcast Built on REMI Workflow

Michigan legends bring a new voice to the broadcast as TNT Sports and CBS Sports...

04/04/2026

Flock Audio's latest Patch App DX update

Faster, cleaner and more intuitive than ever The control software for Flock Audio's digitally controlled patchbay systems has just been treated to an up...

04/04/2026

Sinclair to FCC: Broadcast Sports Drives Investment in Local News

Share Copy link Facebook X Linkedin Bluesky Email...

04/04/2026

Study: Worldwide Telecom Capex to Decline in 2026,

Share Copy link Facebook X Linkedin Bluesky Email...

04/04/2026

Ateme Delivers Full End-to-End Streaming Platform to Moldtelecom

Share Copy link Facebook X Linkedin Bluesky Email...

04/04/2026

FCC Plans Spending, Regulatory Fee Revenue Reductions in FY 2027

Share Copy link Facebook X Linkedin Bluesky Email...

04/04/2026

DHD Introduces AI-Based Audio Noise Reduction to XD3 IP Core

DHD Introduces AI-Based Audio Noise Reduction to XD3 IP Core Brie Clayton April 3, 2026 0 Comments The accompanying image shows the rear panel of the ...

04/04/2026

Macnica Redefines ST 2110 Flexibility with Two Speeds on One Card

Macnica Redefines ST 2110 Flexibility with Two Speeds on One Card Brie Clayton April 3, 2026 0 Comments New for NAB Show 2026, MEP100 SmartNIC now sup...

04/04/2026

Unified Media Workflows for Story-Centric Production

Unified Media Workflows for Story-Centric Production Brie Clayton April 3, 2026 0 Comments Framelight X unifies field capture, editing and publishing ...

03/04/2026

TNT Sports and CBS Sports To Reunite Michigan's Iconic Fab Five' for Special NCAA Men's Final Four Altcast on truTV and HBO Max

Michigan's Fab Five will reunite for an alternate presentation of the Mich...

03/04/2026

NAB 2026: Avid To Showcase Content Core and AI Workflow Innovations

Avid will exhibit at NAB Show 2026 (April 18-22, Booth N2226, Las Vegas Convention Center), demonstrating its Content Core platform and new AI-driven workflow c...

03/04/2026

MRMC Appoints Nick Barthee as Chief Operating Officer

Mark Roberts Motion Control (MRMC) has announced the appointment of Nick Barthee as Chief Operating Officer. The announcement follows MRMC's transition fro...

03/04/2026

Elite Media Technologies Selects Interra Systems' BATON for File-Based QC

Interra Systems has announced that Elite Media Technologies has selected its BATON file-based QC solution for media workflows. Elite Media Technologies speciali...

03/04/2026

Moldtelecom Deploys Ateme Technologies Across Full Streaming Workflow

Ateme has announced that Moldtelecom has deployed Ateme technologies across its streaming workflow, covering encoding, delivery, operations, and analytics. Mol...

03/04/2026

NAB 2026: Grass Valley To Demonstrate Framelight X Content Management

Grass Valley will demonstrate Framelight X, its content management platform, at NAB Show 2026. The platform connects capture, ingest, editing, and publishing in...

03/04/2026

NAB 2026: Encompass Digital Media and Techex Launch Cloud-Based Master Control Service for Live Events

Encompass Digital Media and Techex have announced a cloud-native Master Control ...