Sony Pixel Power calrec Sony

Mistral AI and NVIDIA Unveil Mistral NeMo 12B, a Cutting-Edge Enterprise AI Model

18/07/2024

Mistral AI and NVIDIA today released a new state-of-the-art language model, Mistral NeMo 12B, that developers can easily customize and deploy for enterprise applications supporting chatbots, multilingual tasks, coding and summarization.

By combining Mistral AI's expertise in training data with NVIDIA's optimized hardware and software ecosystem, the Mistral NeMo model offers high performance for diverse applications.

We are fortunate to collaborate with the NVIDIA team, leveraging their top-tier hardware and software, said Guillaume Lample, cofounder and chief scientist of Mistral AI. Together, we have developed a model with unprecedented accuracy, flexibility, high-efficiency and enterprise-grade support and security thanks to NVIDIA AI Enterprise deployment.

Mistral NeMo was trained on the NVIDIA DGX Cloud AI platform, which offers dedicated, scalable access to the latest NVIDIA architecture.

NVIDIA TensorRT-LLM for accelerated inference performance on large language models and the NVIDIA NeMo development platform for building custom generative AI models were also used to advance and optimize the process.

This collaboration underscores NVIDIA's commitment to supporting the model-builder ecosystem.

Delivering Unprecedented Accuracy, Flexibility and Efficiency

Excelling in multi-turn conversations, math, common sense reasoning, world knowledge and coding, this enterprise-grade AI model delivers precise, reliable performance across diverse tasks.

With a 128K context length, Mistral NeMo processes extensive and complex information more coherently and accurately, ensuring contextually relevant outputs.

Released under the Apache 2.0 license, which fosters innovation and supports the broader AI community, Mistral NeMo is a 12-billion-parameter model. Additionally, the model uses the FP8 data format for model inference, which reduces memory size and speeds deployment without any degradation to accuracy.

That means the model learns tasks better and handles diverse scenarios more effectively, making it ideal for enterprise use cases.

Mistral NeMo comes packaged as an NVIDIA NIM inference microservice, offering performance-optimized inference with NVIDIA TensorRT-LLM engines.

This containerized format allows for easy deployment anywhere, providing enhanced flexibility for various applications.

As a result, models can be deployed anywhere in minutes, rather than several days.

NIM features enterprise-grade software that's part of NVIDIA AI Enterprise, with dedicated feature branches, rigorous validation processes, and enterprise-grade security and support.

It includes comprehensive support, direct access to an NVIDIA AI expert and defined service-level agreements, delivering reliable and consistent performance.

The open model license allows enterprises to integrate Mistral NeMo into commercial applications seamlessly.

Designed to fit on the memory of a single NVIDIA L40S, NVIDIA GeForce RTX 4090 or NVIDIA RTX 4500 GPU, the Mistral NeMo NIM offers high efficiency, low compute cost, and enhanced security and privacy.

Advanced Model Development and Customization

The combined expertise of Mistral AI and NVIDIA engineers has optimized training and inference for Mistral NeMo.

Trained with Mistral AI's expertise, especially on multilinguality, code and multi-turn content, the model benefits from accelerated training on NVIDIA's full stack.

It's designed for optimal performance, utilizing efficient model parallelism techniques, scalability and mixed precision with Megatron-LM.

The model was trained using Megatron-LM, part of NVIDIA NeMo, with 3,072 H100 80GB Tensor Core GPUs on DGX Cloud, composed of NVIDIA AI architecture, including accelerated computing, network fabric and software to increase training efficiency.

Availability and Deployment

With the flexibility to run anywhere - cloud, data center or RTX workstation - Mistral NeMo is ready to revolutionize AI applications across various platforms.

Experience Mistral NeMo as an NVIDIA NIM today via ai.nvidia.com, with a downloadable NIM coming soon.

See notice regarding software product information.
LINK: https://blogs.nvidia.com/blog/mistral-nvidia-ai-model/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

01/04/2026

DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION

January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION Douyin Users Can Now Create And Share Videos With Stun...

26/02/2026

AWS Launches New Tool for Vertical Video Conversion

Share Copy link Facebook X Linkedin Bluesky Email...

26/02/2026

Broadpeak Launches Multiview For Live Sports at 2026 NAB Show

Share Copy link Facebook X Linkedin Bluesky Email...

26/02/2026

Griffin Media Rolls Out Bitcentral Core News At KWTV, KOTV

Share Copy link Facebook X Linkedin Bluesky Email...

26/02/2026

Adobes New Firefly QuickCut Gives Video Editors a Starting Point

Share Copy link Facebook X Linkedin Bluesky Email...

26/02/2026

Disney Gains But YouTube Continues to Dominate Screentime

Share Copy link Facebook X Linkedin Bluesky Email...

26/02/2026

BCNEXXT Adds HLG-Based HDR To Vipe Platform

Share Copy link Facebook X Linkedin Bluesky Email...

26/02/2026

FCC Launches Inquiry Into Broadcast Sports Rights

Share Copy link Facebook X Linkedin Bluesky Email...

26/02/2026

Avids New CPO Discusses AI, NAB Show and Newsroom Tech

Share Copy link Facebook X Linkedin Bluesky Email...

26/02/2026

Samsung Taps Gracenote for AI-Powered Discovery

Share Copy link Facebook X Linkedin Bluesky Email...

26/02/2026

Less Tools, More Visibility: TAG Video Systems at NAB 2026

Share Copy link Facebook X Linkedin Bluesky Email...

26/02/2026

Studio Technologies Dante-Based Solutions Power High-Prof...

With more than four decades of experience in radio broadcasting and live sports production, Daryl Doss, owner of Doss Technical Services and a contract engineer...

26/02/2026

BCNEXXT Deploys Live HLG-Based HDR Playout Within Vipe Pl...

BCNEXXT has deployed live HLG-based HDR playout capabilities within its Vipe platform, enabling broadcasters to integrate High Dynamic Range into live productio...

26/02/2026

TAG Video Systems at NAB 2026

TAG Video Systems (Booth W2323) will unveil new capabilities across its IP-native Realtime Media Platform at NAB 2026. New releases include visual service healt...

26/02/2026

IBC2026 unveils strategic partnership with EIT Culture an...

IBC today announced a new strategic partnership with EIT Culture & Creativity the institutional partnership for culture and creativity, supported by the Europ...

26/02/2026

Clear-Coms Arcadia Central Station and FreeSpeak Icon Bel...

Clear-Com kept the action on track at Red Bull Shay'iMoto, an adrenaline-fueled motorsport spinning event that transformed the streets of Durban, South Afr...

26/02/2026

Alcom Elevates Headend Video Service with Harmonic to Dri...

Harmonic (NASDAQ: HLIT) today announced that Alcom, a leading telco operator in Finland, is powering its next-generation white-label headend video service with ...

26/02/2026

Big Blue Marble named as Launch Partner for AWS Elemental...

Big Blue Marble, a provider of broadcast-grade, cloud-native video solutions for broadcasters, service providers, and content owners, today announced that it ha...

26/02/2026

Broadpeak launches Multiview solution to simplify multi-s...

New approach enables video service providers to deliver multiple live feeds on the same screen with lower costs and improved device compatibility Broadpeak, a ...

26/02/2026

Space42 Reports Full-Year 2025 Earnings and returns to Quarterly Growth

Final quarter revenues increase 7% year-on-year, with accelerating momentum in the second half Space Services grows revenues by 6% year-on-year and records hig...

25/02/2026

Record Global Audiences Announced as Olympic Baton Passes to French Alps 2030

With the Olympic Flag officially handed over to the organisers of the next Winter Games and the baton passed from Milano Cortina 2026 to French Alps 2030, the I...

25/02/2026

BBC Shrinks IBC Footprint as Remote Production Takes Center Stage

From a studio overlooking the Dolomites to workflows routed through Milan and into Salford, the BBC delivered a lean and mean operation for its Winter Games c...

25/02/2026

Making the Warner Bros. Discovery Sports Winter Olympics Production Work, From Monitoring to Transmission to Comms

Warner Bros. Discovery (WBD) Sports is managing a huge network of channels acros...

25/02/2026

How Warner Bros. Discovery Sports Used XR to Bring the Peaks of the Dolomites Into View

From its base in the northern Italian town of Cortina, Warner Bros. Discovery (W...

25/02/2026

New AWS Elemental Inference' Offers AI-Powered, Real-Time Vertical-Video Conversion

In addition to 16:9-to-9:16 intelligent cropping for live video, Inference autom...

25/02/2026

Netflix To Livestream Floyd Mayweather Jr. vs. Manny Pacquiao Rematch on Sept. 19 From Sphere

Longtime rivals Floyd Money Mayweather Jr. (50-0, 27 KOs) and Manny PacMan P...

25/02/2026

Portland Fire, Thorns Announce Landmark Broadcast Partnership with Gray Media's FOX 12 Plus

The WNBA's Portland Fire and NWSL's Portland Thorns announce a groundbre...

25/02/2026

NHL, Cosm Install C360 10.5K Capture Systems at the League's Arenas

Multi-angle coverage, on-demand access to ultra-high-resolution video are provided for replays and clips across multiple distribution channels The NHL and Cosm...

25/02/2026

SVG Sit-Down: Cosm's Devin Poolman and Evan Wimer on Installing 10.5K C360 Cameras at All 32 NHL Venues

The implementation standardizes an integrated workflow connecting ultra-high-res...

25/02/2026

Hangin' With the Hornets: Enjoy Basketball Goes Behind the Scenes at NBA Charlotte Franchise

Targeting a younger audience, creator-led network's Access Granted series hi...

25/02/2026

Orlando Magic Jump Into ST 2110 With New Production-Control Room at Kia Center

Alpha, the project's systems integrator, assisted in the workflow transformation Tipping off the second half of the 2025-26 home schedule against the Houst...

25/02/2026

OCVIBE and Global Digital Display Firm Daktronics Announce Technology Partnership

OCVIBE, the 100-acre mixed-use development transforming the area surrounding Hon...

25/02/2026

Level Up Your Playlists' Transitions With Smart Reorder

It's never been easier to customize your Spotify listening experience. Last year, we introduced more control over the way your playlist sounds, giving Premi...

25/02/2026

Who's Going to Lead Hip-Hop's Next Generation? Vote Now on Spotify

Hip-hop thrives on constant reinvention, with bold voices and fearless experimentation continually pushing the genre's boundaries. Every era brings new lead...

25/02/2026

Keeping America's Space Watchtower Sharp: US and Australia Work to Advance Critical Telescope Capacity

L3Harris technicians recently completed a major mirror refurbishment for the U.S...

25/02/2026

Nielsen Utilizes Scarborough To Introduce 200+ New, Advanced Audience Segments Via Nielsen ONE

This new offering helps solve for the need to move beyond traditional audience d...

25/02/2026

Samsung taps Gracenote to supercharge range of AI initiatives

Gold-standard Gracenote content metadata will power Samsung's LLM-enabled entertainment search discovery experiences and more NEW YORK February 25, 202...

25/02/2026

Afrobeats Icon Tiwa Savage Joins Forces with Berklee to Empower African Talent

Afrobeats Icon Tiwa Savage Joins Forces with Berklee to Empower African Talent In collaboration with Berklee Global, the Tiwa Savage Music Foundation will hos...

25/02/2026

Wowza Names Jon Corley as Chief Innovation Officer

Share Copy link Facebook X Linkedin Bluesky Email...

25/02/2026

Milan Cortina Winter Olympics U.S. Viewing Best Since 2014

Share Copy link Facebook X Linkedin Bluesky Email...

25/02/2026

When It Comes to the Upper C-Band, Wireless Carriers Want More

Share Copy link Facebook X Linkedin Bluesky Email...

25/02/2026

ASG Elevates Jody Boatwright to Chief Strategy Officer

Share Copy link Facebook X Linkedin Bluesky Email...

25/02/2026

Study: Premium Video Ads Outperform YouTube

Share Copy link Facebook X Linkedin Bluesky Email...

25/02/2026

TelevisaUnivision's ViX Streamer Achieves Profitability

Share Copy link Facebook X Linkedin Bluesky Email...

25/02/2026

Arch Platform Technologies and Wacom Announce Strategic I...

Arch Platform Technologies, a leading platform for creating and managing cloud workstation infrastructure, and Wacom, the world's leading manufacturer of in...

25/02/2026

Transform live video for mobile audiences with AWS Elemen...

Today, AWS is announcing AWS Elemental Inference, a fully managed AI service that automatically transforms and maximizes live and on-demand video broadcasts to ...