Sony Pixel Power calrec Sony

SLMming Down Latency: How NVIDIA's First On-Device Small Language Model Makes Digital Humans More Lifelike

21/08/2024

Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and showcases new hardware, software, tools and accelerations for RTX PC and workstation users.

At Gamescom this week, NVIDIA announced that NVIDIA ACE - a suite of technologies for bringing digital humans to life with generative AI - now includes the company's first on-device small language model (SLM), powered locally by RTX AI.

The model, called Nemotron-4 4B Instruct, provides better role-play, retrieval-augmented generation and function-calling capabilities, so game characters can more intuitively comprehend player instructions, respond to gamers, and perform more accurate and relevant actions.

Available as an NVIDIA NIM microservice for cloud and on-device deployment by game developers, the model is optimized for low memory usage, offering faster response times and providing developers a way to take advantage of over 100 million GeForce RTX-powered PCs and laptops and NVIDIA RTX-powered workstations.

The SLM Advantage An AI model's accuracy and performance depends on the size and quality of the dataset used for training. Large language models are trained on vast amounts of data, but are typically general-purpose and contain excess information for most uses.

SLMs, on the other hand, focus on specific use cases. So even with less data, they're capable of delivering more accurate responses, more quickly - critical elements for conversing naturally with digital humans.

Nemotron-4 4B was first distilled from the larger Nemotron-4 15B LLM. This process requires the smaller model, called a student, to mimic the outputs of the larger model, appropriately called a teacher. During this process, noncritical outputs of the student model are pruned or removed to reduce the parameter size of the model. Then, the SLM is quantized, which reduces the precision of the model's weights.

With fewer parameters and less precision, Nemotron-4 4B has a lower memory footprint and faster time to first token - how quickly a response begins - than the larger Nemotron-4 LLM while still maintaining a high level of accuracy due to distillation. Its smaller memory footprint also means games and apps that integrate the NIM microservice can run locally on more of the GeForce RTX AI PCs and laptops and NVIDIA RTX AI workstations that consumers own today.

This new, optimized SLM is also purpose-built with instruction tuning, a technique for fine-tuning models on instructional prompts to better perform specific tasks. This can be seen in Mecha BREAK, a video game in which players can converse with a mechanic game character and instruct it to switch and customize mechs.

ACEs Up ACE NIM microservices allow developers to deploy state-of-the-art generative AI models through the cloud or on RTX AI PCs and workstations to bring AI to their games and applications. With ACE NIM microservices, non-playable characters (NPCs) can dynamically interact and converse with players in the game in real time.

ACE consists of key AI models for speech-to-text, language, text-to-speech and facial animation. It's also modular, allowing developers to choose the NIM microservice needed for each element in their particular process.

NVIDIA Riva automatic speech recognition (ASR) processes a user's spoken language and uses AI to deliver a highly accurate transcription in real time. The technology builds fully customizable conversational AI pipelines using GPU-accelerated multilingual speech and translation microservices. Other supported ASRs include OpenAI's Whisper, a open-source neural net that approaches human-level robustness and accuracy on English speech recognition.

Once translated to digital text, the transcription goes into an LLM - such as Google's Gemma, Meta's Llama 3 or now NVIDIA Nemotron-4 4B - to start generating a response to the user's original voice input.

Next, another piece of Riva technology - text-to-speech - generates an audio response. ElevenLabs' proprietary AI speech and voice technology is also supported and has been demoed as part of ACE, as seen in the above demo.

Finally, NVIDIA Audio2Face (A2F) generates facial expressions that can be synced to dialogue in many languages. With the microservice, digital avatars can display dynamic, realistic emotions streamed live or baked in during post-processing.

The AI network automatically animates face, eyes, mouth, tongue and head motions to match the selected emotional range and level of intensity. And A2F can automatically infer emotion directly from an audio clip.

Finally, the full character or digital human is animated in a renderer, like Unreal Engine or the NVIDIA Omniverse platform.

AI That's NIMble In addition to its modular support for various NVIDIA-powered and third-party AI models, ACE allows developers to run inference for each model in the cloud or locally on RTX AI PCs and workstations.

The NVIDIA AI Inference Manager software development kit allows for hybrid inference based on various needs such as experience, workload and costs. It streamlines AI model deployment and integration for PC application developers by preconfiguring the PC with the necessary AI models, engines and dependencies. Apps and games can then orchestrate inference seamlessly across a PC or workstation to the cloud.

ACE NIM microservices run locally on RTX AI PCs and workstations, as well as in the cloud. Current microservices running locally include Audio2Face, in the Covert Protocol tech demo, and the new Nemotron-4 4B Instruct and Whisper ASR in Mecha BREAK.

To Infinity and Beyond Digital humans go far beyond NPCs in games. At last month's SIGGRAPH conference, NVIDIA previewed James, an interactive digital human that can connect with people using emotions, humor and more. James is based on
LINK: https://blogs.nvidia.com/blog/ai-decoded-gamescom-ace-nemotron-instruc...
See more stories from nvidia

Most recent headlines

13/11/2025

SES, Relativity Space Expand Multi-Launch Agreement for Terran R

Luxembourg and Long Beach, CA, 12 November 2025 - SES, a leading space solutions company, announced today an extended multi-year, multi-launch services agreemen...

13/11/2025

Field & Stream, Outdoor America Launch Field & Stream TV

NASHVILLE, Tenn. Field & Stream and Outdoor America have formed a strategic partnership to launch Field & Stream TV, rebranding Outdoor America's free ad-su...

13/11/2025

Silicondust Becomes An ATSC 3.0 Certificate Authority

PHOENIX, Ariz. Silicondust has announced it is now an ATSC 3.0 Certificate Authority for NextGen TV and said that it is offering an Online Certificate Status Pr...

13/11/2025

Nielsen Names Peter Naylor Its First Chief Client Officer

NEW YORK Nielsen has announced that Peter Naylor, an ad sales executive who has worked at some of the largest media companies in the world, will be its first ch...

13/11/2025

CBS Philadelphia's Jim Donovan to Retire in December

PHILADELPHIA After more than 20 years of at CBS Philadelphia and an award-winning career spanning nearly four decades, Jim Donovan, anchor of CBS News Philadelp...

13/11/2025

Frontline Announces 2025-26 Local Journalism Initiative Partners

BOSTON Frontline, PBS's investigative documentary series produced at GBH in Boston, has announced the newest class of partners for its Local Journalism Init...

13/11/2025

Major Study Finds High Levels of Mistakes in AI-Generated News Summaries

A groundbreaking new study by the BBC and the European Broadcasting Union (EBU) has found serious problems with news summaries generated by AI assistants....

13/11/2025

Gabriel Byrne, Carrie Crowley and Russell Howard among the guests on this week's Late Late Show

Legendary actor and proud Irishman Gabriel Byrne will be in studio this week to ...

13/11/2025

International Soccer takes centre stage on a jam-packed four days of live, free-to-air Sport across RT

Tonight's crucial Republic of Ireland World Cup qualifier v Portugal at the ...

13/11/2025

Karen Byrne, Andrew Ryan and Roddy Collins drop in for episode four of The 2 Johnnies Late Night Lock In

In the fourth episode of The 2 Johnnies Late Night Lock In the lads are joined b...

13/11/2025

GeForce NOW Enlists Call of Duty: Black Ops 7' for the Cloud

Chaos has entered the chat. It's GFN Thursday, and things are getting intense with the launch of Call of Duty: Black Ops 7, streaming at launch this week on...

12/11/2025

Wangu Kanuri: Finalist Young Journalist of the Year 2025

For me, no story is too small if it speaks to the ordinary Kenyan, says Wangu Kanuri, a multimedia journalist and contributor to the Nation Media Group working...

12/11/2025

Tracy Bonareri Onchoke: Finalist Young Journalist of the Year 2025

Tracy Bonareri Onchoke is an investigative journalist from Kenya who strives to tell stories that amplify voices pushed to the margins' in her reports for ...

12/11/2025

Godwin Asediba: Finalist Young Journalist of the Year 2025

Godwin Asediba who is an investigative journalist, producer and news anchor with TV3 and 3FM in Ghana, has received death threats for his work exposing injustic...

12/11/2025

SVG TranSPORT 2025: All Sessions Now Available to Watch on SVG PLAY

SVG TranSPORT 2025: All Sessions Now Available to Watch on SVG PLAYEvent addressed the latest in live sports video contribution and distribution technologyBy SV...

12/11/2025

2026 Sundance Film Festival Annual Event Celebrating Sundance Institute: A Tribute to Founder Robert Redford

L-R: Ed Harris, Gyula Gazdag Inaugural Robert Redford Luminary Award to Honor E...

12/11/2025

Give Me the Backstory: Get to Know Alireza Khatami, the Director of The Things You Kill

By Bailey Pennick One of the most exciting things about the Sundance Film Festi...

12/11/2025

Morgan Wallen Reflects on His Biggest Hits in New Billions Club: The Series' Episode

In 2023, Morgan Wallen made history when Last Night became the first solo coun...

12/11/2025

Calrec delivers future-focused production for Whisper Cymru

Calrec delivers future-focused production for Whisper Cymru at Wales's first-ever dedicated remote production hub Supporting a growing roster of live sports...

12/11/2025

Blue Lucy Renews Multi-Year Partnership with VSI Group

LONDON, England November 11, 2025 - Blue Lucy, a leading provider of media management and workflow automation solutions, is pleased to announce the renewal o...

12/11/2025

Clear-Com Deployed for Record-Breaking Live Broadcast

ALAMEDA, Calif. Clear-Com says its communications gear was recently deployed for the ADAC RAVENOL 24h Race at Germany's N rburgring circuit, which set a rec...

12/11/2025

Mediagenix Joins AWS ISV Accelerate Program

BRUSSELS Mediagenix has announced that it has joined the Amazon Web Services (AWS) Independent Software Vendor (ISV) Accelerate Program (ISV). This acceptance f...

12/11/2025

Alfalite Partners with Adistec to Expand Presence in the Americas

HUELVA, Spain Alfalite, Europe's only LED screen manufacturer, has announced a strategic partnership with Adistec Corp, a leading distributor of infrastruct...

12/11/2025

Stingray to Acquire TuneIn for up to $175 Million

MONTREAL Stingray Group Inc. has announced that it has entered into a definitive agreement to acquire TuneIn Holdings, Inc. ( 'TuneIn''), a pioneer ...

12/11/2025

Vubiquity Earns AWS Media and Entertainment Competency St...

Vubiquity, an Amdocs company and global leader in technology-led media services, today announced it has achieved the Amazon Web Services (AWS) Media & Entertain...

12/11/2025

SES and AMN Expand Rural Connectivity across Cte d'Ivoire with Major Network Upgrade

Over 200 upgraded sites now delivering 2G and 3G mobile data services to more th...

12/11/2025

DirecTV Launches New CTV Political Ad Platform

NEW YORK and WASHINGTON DirecTV Advertising has launched DirecTV Elect, a new digital platform powered by AI that is specifically designed for political adverti...

12/11/2025

Carr Weighs in on Disney, YouTube Dispute

WASHINGTON Federal Communications Commission Chair Brendan Carr has weighed in on the blackout of ABC, ESPN and other Disney programming on YouTube TV with a po...

12/11/2025

VEON Wins Corporate Governance Awards for Kyivstar Listing and Technology Leadership in Corporate Governance

12 Nov 2025 VEON Wins Corporate Governance Awards for Kyivstar Listing and Tech...

12/11/2025

Sky unveils first of its kind clean power system for film and TV production

Wednesday 12 November 2025 Sky unveils first of its kind clean power system for film and TV production Sky has today unveiled a major new clean energy system ...

12/11/2025

'The Accident 2' Welcomes Brbara de Regil to the Cast and Premieres Official Trailer

Back to All News The Accident 2 Welcomes B rbara de Regil to the Cast and Premi...

12/11/2025

International standards bodies release climate action policy paper at COP30

Wednesday 12th November - Bel m, Brazil - Today, leading organizations IEC, ISO and ULSE, initiators of the Standards Pavilion at UNFCCC COP30, published a join...

12/11/2025

Preferred Business Partner of the German Bundesverband E-Commerce und Versandhandel Deutschland e.V. (bevh)

Arvato Systems Becomes Preferred Business Partner of the German Bundesverband E-...

12/11/2025

Celebrating 21 Years of the RT Choice Music Prize

RT Choice Music Prize In association with IMRO and IRMA 2 0 2 6 K E Y D A T E S Irish Album of the Year 2025 Shortlist 19th January Irish Song of the ...

12/11/2025

NVIDIA Wins Every MLPerf Training v5.1 Benchmark

In the age of AI reasoning, training smarter, more capable models is critical to scaling intelligence. Delivering the massive performance to meet this new age r...

12/11/2025

RT Investigates reveals Court Interpreter in overturned FGM case worked on over 240 other Irish court cases

Parents jailed for over two years after bringing their daughter to hospital for ...

12/11/2025

Faster Than a Click: Hyperlink Agent Search Now Available on NVIDIA RTX PCs

Large language model (LLM)-based AI assistants are powerful productivity tools, but without the right context and information, they can struggle to provide nuan...

11/11/2025

SVG Sit-Down: How Pixellot's Automated-Production-Tech Stack Is Evolving in the AI Era

SVG Sit-Down: How Pixellot's Automated-Production-Tech Stack Is Evolving in ...

11/11/2025

Introducing SVG's New Platinum White Papers' Platform

Introducing SVG's New Platinum White Papers' PlatformTop technology providers detail how they are innovating in sports productionBy SVG Staff Tuesday...

11/11/2025

SVG All-Stars: Vanessa Lindsey, Senior Director, Technical and Remote Operations and Crewing, TNT Sports

SVG All-Stars: Vanessa Lindsey, Senior Director, Technical and Remote Operations...

11/11/2025

Lesson Plan: How Big Ten Network's StudentU Produces Broadcast Pros - and 2,000+ Live Games a Year

Lesson Plan: How Big Ten Network's StudentU Produces Broadcast Pros - and 2,...

11/11/2025

Peacock Performance View Feature Now Available for All NBA Games on Peacock

Peacock Performance View Feature Now Available for All NBA Games on PeacockBy Jason Dachman, Editorial Director, U.S. Tuesday, November 11, 2025 - 2:10 pm P...

11/11/2025

Spotify and NMPA Announce Agreement to Expand Direct-Licensing Audiovisual Opportunities for Independent Publishers

Today, Spotify and the National Music Publishers' Association (NMPA) launche...

11/11/2025

SGL Carbon site in Willich celebrates 30 years of Expertise in High-Tech Prepregs

This year, SGL Carbons Willich site is celebrating a special anniversary. For 30...

11/11/2025

New Nielsen Rural Survey reveals the changing media habits of Filipinos outside major cities

Rural connectivity rising fast Traditional media still matters Rural Filipinos...

11/11/2025

Wohler Bows 3 New Features for iVAM2-MPEG SRT Monitor

Wohler has said it has added three Secure Reliable Transport (SRT) connections to its new iVAM2-MPEG monitor....

11/11/2025

OpenDrives Transforms into a Data Services Company with A...

OpenDrives, Inc., a leader in software-defined data storage and data services, recently hosted an exclusive event in Los Angeles to celebrate the soft launch of...

11/11/2025

NAKIVO Reports 29 Percent Revenue Growth in EMEA and Stro...

NAKIVO Inc., a fast-growing software company specialising in data protection and disaster recovery solutions for virtual, physical, cloud, and SaaS environments...