
An AI agent is only as accurate, relevant and timely as the data that powers it.
Now generally available, NVIDIA NeMo microservices are helping enterprise IT quickly build AI teammates that tap into data flywheels to scale employee productivity. The microservices provide an end-to-end developer platform for creating state-of-the-art agentic AI systems and continually optimizing them with data flywheels informed by inference and business data, as well as user preferences.
With a data flywheel, enterprise IT can onboard AI agents as digital teammates. These agents can tap into user interactions and data generated during AI inference to continuously improve model performance - turning usage into insight and insight into action.
Building Powerful Data Flywheels for Agentic AI Without a constant stream of high-quality inputs - from databases, user interactions or real-world signals - an agent's understanding can weaken, making responses less reliable and agents less productive.
Maintaining and improving the models that power AI agents in production requires three types of data: inference data to gather insights and adapt to evolving data patterns, up-to-date business data to provide intelligence, and user feedback data to advise if the model and application are performing as expected. NeMo microservices help developers tap into these three data types.
NeMo microservices speed AI agent development with end-to-end tools for curating, customizing, evaluating and guardrailing the models that drive their agents.
NVIDIA NeMo microservices - including NeMo Customizer, NeMo Evaluator and NeMo Guardrails - can be used alongside NeMo Retriever and NeMo Curator to ease enterprises' experiences building, optimizing and scaling AI agents through custom enterprise data flywheels. For example:
NeMo Customizer accelerates large language model fine-tuning, delivering up to 1.8x higher training throughput. This high-performance, scalable microservice uses popular post-training techniques including supervised fine-tuning and low-rank adaptation.
NeMo Evaluator simplifies the evaluation of AI models and workflows on custom and industry benchmarks with just five application programming interface (API) calls.
NeMo Guardrails improves compliance protection by up to 1.4x with only half a second of additional latency, helping organizations implement robust safety and security measures that align with organizational policies and guidelines.
With NeMo microservices, developers can build data flywheels that boost AI agent accuracy and efficiency. Deployed through the NVIDIA AI Enterprise software platform, NeMo microservices are easy to operate and can run on any accelerated computing infrastructure, on premises or in the cloud, with enterprise-grade security, stability and support.
The microservices have become generally available at a time when enterprises are building large-scale multi-agent systems, where hundreds of specialized agents - with distinct goals and workflows - collaborate to tackle complex tasks as digital teammates, working alongside employees to assist, augment and accelerate work across functions.
This enterprise-wide impact positions AI agents as a trillion-dollar opportunity - with applications spanning automated fraud detection, shopping assistants, predictive machine maintenance and document review - and underscores the critical role data flywheels play in transforming business data into actionable insights.
Data flywheels built with NVIDIA NeMo microservices constantly curate data, retrain models and evaluate their performance, all with minimal human interactions and maximum autonomy. Industry Pioneers Boost AI Agent Accuracy With NeMo Microservices NVIDIA partners and industry pioneers are using NeMo microservices to build responsive AI agent platforms so that digital teammates can help get more done.
Working with Arize and Quantiphi, AT&T has built an advanced AI-powered agent using NVIDIA NeMo, designed to process a knowledge base of nearly 10,000 documents, refreshed weekly. The scalable, high-performance AI agent is fine-tuned for three key business priorities: speed, cost efficiency and accuracy - all increasingly critical as adoption scales.
AT&T boosted AI agent accuracy by up to 40% using NeMo Customizer and Evaluator by fine-tuning a Mistral 7B model to help deliver personalized services, prevent fraud and optimize network performance.
BlackRock is working with NeMo microservices for agentic AI capabilities in its Aladdin tech platform, which unifies the investment management process through a common data language.
Teaming with Galileo, Cisco's Outshift team is using NVIDIA NeMo microservices to power a coding assistant that delivers 40% fewer tool selection errors and achieves up to 10x faster response times.
Nasdaq is accelerating its Nasdaq Gen AI Platform with NeMo Retriever microservices and NVIDIA NIM microservices. NeMo Retriever enhanced the platform's search capabilities, leading to up to 30% improved accuracy and response times, in addition to cost savings.
Broad Model and Partner Ecosystem Support for NeMo Microservices NeMo microservices support a broad range of popular open models, including Llama, the Microsoft Phi family of small language models, Google Gemma, Mistral and Llama Nemotron Ultra, currently the top open model on scientific reasoning, coding and complex math benchmarks.
Meta has tapped NVIDIA NeMo microservices through new connectors for Meta Llamastack. Users can access the same capabilities - including Customizer, Evaluator and Guardrails - via APIs, enabling them to run the full suite of agent-building workflows within their environment.
With Llamastack integration, agent builders can implement data flywheels powered by NeMo microservices, said Raghotham Murthy, software engineer, GenAI, at Meta. This allows them to co
More from Nvidia
03/07/2025
The forecast this month is showing a 100% chance of epic gaming. Catch the scorching lineup of 20 titles coming to the cloud, which gamers can play whether indo...
02/07/2025
Black Forest Labs, one of the world's leading AI research labs, just changed the game for image generation.
The lab's FLUX.1 image models have earned g...
01/07/2025
In many parts of the world, including major technology hubs in the U.S., there's a yearslong wait for AI factories to come online, pending the buildout of n...
26/06/2025
As of today, NVIDIA now supports the general availability of Gemma 3n on NVIDIA RTX and Jetson. Gemma, previewed by Google DeepMind at Google I/O last month, in...
26/06/2025
Editor's note: This blog is a part of Into the Omniverse, a series focused o...
26/06/2025
Mark Theriault founded the startup FITY envisioning a line of clever cooling products: cold drink holders that come with freezable pucks to keep beverages cold ...
26/06/2025
This GFN Thursday rolls out a new reward and games for GeForce NOW members. Whether hunting for hot new releases or rediscovering timeless classics, members can...
24/06/2025
To get the most out of AI, optimizations are critical. When developers think about optimizing AI models for inference, model compression techniques-such as quan...
24/06/2025
To speed up AI adoption across industries, HPE and NVIDIA today launched new AI factory offerings at HPE Discover in Las Vegas.
The new lineup includes everyth...
24/06/2025
From the heart of Germany's automotive sector to manufacturing hubs across F...
19/06/2025
GeForce NOW is throwing open the vault doors to welcome the legendary Borderland series to the cloud.
Whether a seasoned Vault Hunter or new to the mayhem of P...
18/06/2025
Project G-Assist - available through the NVIDIA App - is an experimental AI assistant that helps tune, control and optimize NVIDIA GeForce RTX systems.
NVIDIA&...
17/06/2025
As a global labor shortage leaves 50 million positions unfilled across industrie...
13/06/2025
Industrial AI isn't slowing down. Germany is ready.
Following London Tech Week and GTC Paris at VivaTech, NVIDIA founder and CEO Jensen Huang's Europea...
12/06/2025
Generative AI has reshaped how people create, imagine and interact with digital ...
12/06/2025
Level up GeForce NOW experiences this summer with 40% off Performance Day Passes. Enjoy 24 hours of premium cloud gaming with RTX ON, delivering low latency and...
11/06/2025
NVIDIA is launching a comprehensive, industry-defining autonomous vehicle (AV) software platform to accelerate large-scale deployment of safe, intelligent trans...
11/06/2025
NVIDIA Research has developed an AI light switch for videos that can turn daytim...
11/06/2025
Using NVIDIA platforms, tools and libraries, European telecommunications institu...
11/06/2025
NVIDIA was today named an Autonomous Grand Challenge winner at the Computer Visi...
11/06/2025
In the face of growing labor shortages and need for sustainability, European man...
11/06/2025
AI is packing and shipping efficiency for the retail and consumer packaged goods (CPG) industries, with a majority of surveyed companies in the space reporting ...
11/06/2025
Urban populations are expected to double by 2050, which means around 2.5 billion...
11/06/2025
Telecom companies last year spent nearly $295 billion in capital expenditures an...
11/06/2025
In a new effort to advance sovereign AI for European public service media, NVIDI...
11/06/2025
At GTC Paris - held alongside VivaTech, Europe's largest tech event - NVIDIA founder and CEO Jensen Huang delivered a clear message: Europe isn't just a...
10/06/2025
Germany's Leibniz Supercomputing Centre, LRZ, is gaining a new supercomputer...
10/06/2025
With a more detailed simulation of the Earth's climate, scientists and resea...
10/06/2025
Cisco and NVIDIA are helping set a new standard for secure, scalable and high-performance enterprise AI.
Announced today at the Cisco Live conference in San Di...
09/06/2025
AI isn't waiting. And this week, neither is Europe.
At London's Olympia, under a ceiling of steel beams and enveloped by the thrum of startup pitches, ...
08/06/2025
U.K. Prime Minister Keir Starmer's ambition for Britain to be an AI maker, not an AI taker, is becoming a reality at London Tech Week.
With NVIDIA's ...
05/06/2025
GeForce NOW is a gamer's ticket to an unforgettable summer of gaming. With 25 titles coming this month and endless ways to play, the summer is going to be e...
04/06/2025
NVIDIA is working with companies worldwide to build out AI factories - speeding ...
04/06/2025
Humans learn the norms, values and behaviors of society from each other - and Bernt B rnich, founder and CEO of 1X Technologies, thinks robots should learn like...
04/06/2025
4:2:2 cameras - capable of capturing double the color information compared with most standard cameras - are becoming widely available for consumers. At the same...
02/06/2025
Editor's note: This blog, originally published on October 28, 2024, has been...
02/06/2025
Since a 7.8-magnitude earthquake hit Syria and T rkiye two years ago - leaving 5...
29/05/2025
Ready for a front-row seat to the next scientific revolution?
That's the idea behind Doudna - a groundbreaking supercomputer announced today at Lawrence Be...
29/05/2025
Large language models (LLMs), trained on datasets with billions of tokens, can generate high-quality content. They're the backbone for many of the most popu...
29/05/2025
GeForce NOW is supercharging Valve's Steam Deck with a new native app - delivering the high-quality GeForce RTX-powered gameplay members are used to on a po...
28/05/2025
Building effective agentic AI systems requires rethinking how technology interac...
27/05/2025
Over a century ago, Henry Ford pioneered the mass production of cars and engines...
27/05/2025
NVIDIA and Google share a long-standing relationship rooted in advancing AI inno...
22/05/2025
GeForce NOW is turning up the heat this summer with a hot new deal. For a limited time, save 40% on six-month Performance memberships and enjoy premium GeForce ...
21/05/2025
As robots increasingly make their way to the largest enterprises' manufacturing plants and warehouses, the need for access to critical business and operatio...
20/05/2025
Industrial AI is transforming how factories operate, innovate and scale.
The convergence of AI, simulation and digital twins is poised to unlock new levels of ...
19/05/2025
Agentic AI is redefining scientific discovery and unlocking research breakthroughs and innovations across industries. Through deepened collaboration, NVIDIA and...
19/05/2025
Across robot training and development, NVIDIA Research is uncovering breakthroughs in areas such as multimodal generative AI and synthetic data generation.
The...
19/05/2025
Generative AI is transforming PC software into breakthrough experiences - from digital humans to writing assistants, intelligent agents and creative tools.
NVI...
18/05/2025
Electricity. The Internet. Now it's time for another major technology, AI, to sweep the globe.
NVIDIA founder and CEO Jensen Huang took the stage at a pack...