Sony Pixel Power calrec Sony

Exploring the Revenue-Generating Potential of AI Factories

15/05/2025

AI is creating value for everyone - from researchers in drug discovery to quantitative analysts navigating financial market changes.

The faster an AI system can produce tokens, a unit of data used to string together outputs, the greater its impact. That's why AI factories are key, providing the most efficient path from time to first token to time to first value.

AI factories are redefining the economics of modern infrastructure. They produce intelligence by transforming data into valuable outputs - whether tokens, predictions, images, proteins or other forms - at massive scale.

They help enhance three key aspects of the AI journey - data ingestion, model training and high-volume inference. AI factories are being built to generate tokens faster and more accurately, using three critical technology stacks: AI models, accelerated computing infrastructure and enterprise-grade software.

Read on to learn how AI factories are helping enterprises and organizations around the world convert the most valuable digital commodity - data - into revenue potential.

From Inference Economics to Value Creation Before building an AI factory, it's important to understand the economics of inference - how to balance costs, energy efficiency and an increasing demand for AI.

Throughput refers to the volume of tokens that a model can produce. Latency is the amount of tokens that the model can output in a specific amount of time, which is often measured in time to first token - how long it takes before the first output appears - and time per output token, or how fast each additional token comes out. Goodput is a newer metric, measuring how much useful output a system can deliver while hitting key latency targets.

User experience is key for any software application, and the same goes for AI factories. High throughput means smarter AI, and lower latency ensures timely responses. When both of these measures are balanced properly, AI factories can provide engaging user experiences by quickly delivering helpful outputs.

For example, an AI-powered customer service agent that responds in half a second is far more engaging and valuable than one that responds in five seconds, even if both ultimately generate the same number of tokens in the answer.

Companies can take the opportunity to place competitive prices on their inference output, resulting in more revenue potential per token.

Measuring and visualizing this balance can be difficult - which is where the concept of a Pareto frontier comes in.

AI Factory Output: The Value of Efficient Tokens The Pareto frontier, represented in the figure below, helps visualize the most optimal ways to balance trade-offs between competing goals - like faster responses vs. serving more users simultaneously - when deploying AI at scale.

The vertical axis represents throughput efficiency, measured in tokens per second (TPS), for a given amount of energy used. The higher this number, the more requests an AI factory can handle concurrently.

The horizontal axis represents the TPS for a single user, representing how long it takes for a model to give a user the first answer to a prompt. The higher the value, the better the expected user experience. Lower latency and faster response times are generally desirable for interactive applications like chatbots and real-time analysis tools.

The Pareto frontier's maximum value - shown as the top value of the curve - represents the best output for given sets of operating configurations. The goal is to find the optimal balance between throughput and user experience for different AI workloads and applications.

The best AI factories use accelerated computing to increase tokens per watt - optimizing AI performance while dramatically increasing energy efficiency across AI factories and applications.

The animation above compares user experience when running on NVIDIA H100 GPUs configured to run at 32 tokens per second per user, versus NVIDIA B300 GPUs running at 344 tokens per second per user. At the configured user experience, Blackwell Ultra delivers over a 10x better experience and almost 5x higher throughput, enabling up to 50x higher revenue potential.

How an AI Factory Works in Practice An AI factory is a system of components that come together to turn data into intelligence. It doesn't necessarily take the form of a high-end, on-premises data center, but could be an AI-dedicated cloud or hybrid model running on accelerated compute infrastructure. Or it could be a telecom infrastructure that can both optimize the network and perform inference at the edge.

Any dedicated accelerated computing infrastructure paired with software turning data into intelligence through AI is, in practice, an AI factory.

The components include accelerated computing, networking, software, storage, systems, and tools and services.

When a person prompts an AI system, the full stack of the AI factory goes to work. The factory tokenizes the prompt, turning data into small units of meaning - like fragments of images, sounds and words.

Each token is put through a GPU-powered AI model, which performs compute-intensive reasoning on the AI model to generate the best response. Each GPU performs parallel processing - enabled by high-speed networking and interconnects - to crunch data simultaneously.

An AI factory will run this process for different prompts from users across the globe. This is real-time inference, producing intelligence at industrial scale.

Because AI factories unify the full AI lifecycle, this system is continuously improving: inference is logged, edge cases are flagged for retraining and optimization loops tighten over time - all without manual intervention, an example of goodput in action.

Leading global security technology company Lockheed Martin has built its own AI factory to support diverse uses across its business. Through its
LINK: https://blogs.nvidia.com/blog/revenue-potential-ai-factories/...
See more stories from nvidia

More from Nvidia

29/05/2025

The Supercomputer Designed to Accelerate Nobel-Worthy Science

Ready for a front-row seat to the next scientific revolution? That's the idea behind Doudna - a groundbreaking supercomputer announced today at Lawrence Be...

29/05/2025

Run LLMs on AnythingLLM Faster With NVIDIA RTX AI PCs

Large language models (LLMs), trained on datasets with billions of tokens, can generate high-quality content. They're the backbone for many of the most popu...

29/05/2025

RTX on Deck: The GeForce NOW Native App for Steam Deck Is Here

GeForce NOW is supercharging Valve's Steam Deck with a new native app - delivering the high-quality GeForce RTX-powered gameplay members are used to on a po...

28/05/2025

NVIDIA's Bartley Richardson on How Teams of AI Agents Provide Next-Level Automation

Building effective agentic AI systems requires rethinking how technology interac...

27/05/2025

How Dell Technologies Is Building the Engines of AI Factories With NVIDIA Blackwell

Over a century ago, Henry Ford pioneered the mass production of cars and engines...

27/05/2025

NVIDIA and Google Partnership Gains Momentum With the Latest Blackwell and Gemini Announcements

NVIDIA and Google share a long-standing relationship rooted in advancing AI inno...

22/05/2025

Sale Into Summer With 40% Off GeForce NOW Six-Month Performance Memberships

GeForce NOW is turning up the heat this summer with a hot new deal. For a limited time, save 40% on six-month Performance memberships and enjoy premium GeForce ...

21/05/2025

NVIDIA and SAP Bring AI Agents to the Physical World

As robots increasingly make their way to the largest enterprises' manufacturing plants and warehouses, the need for access to critical business and operatio...

20/05/2025

Siemens Makes Factory Floors Smarter With Industrial AI

Industrial AI is transforming how factories operate, innovate and scale. The convergence of AI, simulation and digital twins is poised to unlock new levels of ...

19/05/2025

NVIDIA and Microsoft Accelerate Agentic AI Innovation, From Cloud to PC

Agentic AI is redefining scientific discovery and unlocking research breakthroughs and innovations across industries. Through deepened collaboration, NVIDIA and...

19/05/2025

NVIDIA Research Breakthroughs Put Advanced Robots in Motion

Across robot training and development, NVIDIA Research is uncovering breakthroughs in areas such as multimodal generative AI and synthetic data generation. The...

19/05/2025

NVIDIA and Microsoft Advance Development on RTX AI PCs

Generative AI is transforming PC software into breakthrough experiences - from digital humans to writing assistants, intelligent agents and creative tools. NVI...

18/05/2025

NVIDIA CEO Envisions AI Infrastructure Industry Worth Trillions of Dollars'

Electricity. The Internet. Now it's time for another major technology, AI, to sweep the globe. NVIDIA founder and CEO Jensen Huang took the stage at a pack...

18/05/2025

NVIDIA Expands Omniverse Blueprint for AI Factory Digital Twins With New Ecosystem Integrations, Development Tools

Empowering engineering teams with more tools for building AI factories, NVIDIA t...

18/05/2025

AI Blueprint for Video Search and Summarization Now Available to Deploy Video Analytics AI Agents Across Industries

The age of video analytics AI agents is here. Video is one of the defining feat...

18/05/2025

Semiconductor Industry Accelerates Design Manufacturing With NVIDIA Blackwell and CUDA-X

TSMC, Cadence, KLA, Siemens and Synopsys are advancing semiconductor manufacturi...

18/05/2025

NVIDIA Grace CPU C1 Gains Broad Support in Edge, Telco and Storage

NVIDIA is highlighting significant momentum for its new Grace CPU C1 this week at the COMPUTEX trade show in Taipei, with a strong showing of support from key o...

18/05/2025

NVIDIA-Powered Supercomputer to Enable Quantum Leap for Taiwan Research

Researchers across Taiwan are tackling complex challenges in AI development, climate science and quantum computing. Their work will soon be boosted by a new sup...

18/05/2025

That's One Smart Hospital! Taiwan Medical Centers Deploy Life-Saving Innovations With NVIDIA System-Builder Partners

Leading healthcare organizations across the globe are using agentic AI, robotics...

18/05/2025

NVIDIA Grows Quantum Computing Ecosystem With Taiwan Manufacturers and Supercomputing

Quantum computing promises to shorten the path to solving some of the world'...

15/05/2025

Into the Omniverse: Computational Fluid Dynamics Simulation Finds Smoothest Flow With AI-Driven Digital Twins

Editor's note: This post is part of Into the Omniverse, a series focused on ...

15/05/2025

Exploring the Revenue-Generating Potential of AI Factories

AI is creating value for everyone - from researchers in drug discovery to quantitative analysts navigating financial market changes. The faster an AI system ca...

15/05/2025

Time to Slay: DOOM: The Dark Ages' Looms on GeForce NOW

Steel clashes and war drums thunder as a new age of battle dawns - one that will test even the mightiest Slayer. This GFN Thursday, DOOM: The Dark Ages - the b...

14/05/2025

Visa Makes Payments Personalized and Secure With AI

Think tap to pay - but smarter and safer. Visa is tapping into AI to enhance services for its global network of customers, focused on fraud prevention, personal...

14/05/2025

Press Play on Don Diablo's Music Video - Created With NVIDIA RTX-Powered Generative AI

Electronic music icon Don Diablo is known for pushing the boundaries of music, v...

13/05/2025

NVIDIA Partners Showcase Cutting-Edge Robotic and Industrial AI Solutions at Automate 2025

As the manufacturing industry faces challenges - such as labor shortages, reshor...

13/05/2025

How Reasoning AI Agents Transform High-Stakes Decision Making

Editor's note: This post is part of the AI On blog series, which explores the latest techniques and real-world applications of agentic AI, chatbots and copi...

12/05/2025

NVIDIA Scores COMPUTEX Best Choice Awards

NVIDIA today received multiple accolades at COMPUTEX's Best Choice Awards, in recognition of innovation across the company. The NVIDIA GeForce RTX 5090 GPU...

08/05/2025

Wildfire Prevention: AI Startups Support Prescribed Burns, Early Alerts

Artificial intelligence is helping identify and treat diseases faster with better results for humankind. Natural disasters like wildfires are next. Fires in th...

08/05/2025

Join the Family: GeForce NOW Welcomes 2K's Acclaimed Mafia' Franchise to the Cloud

Calling all wiseguys - 2K's acclaimed Mafia franchise is available to stream...

08/05/2025

LM Studio Accelerates LLM Performance With NVIDIA GeForce RTX GPUs and CUDA 12.8

As AI use cases continue to expand - from document summarization to custom software agents - developers and enthusiasts are seeking faster, more flexible ways t...

07/05/2025

Cadence Taps NVIDIA Blackwell to Accelerate AI-Driven Engineering Design and Scientific Simulation

A new supercomputer offered by Cadence, a leading provider of technology for ele...

07/05/2025

NVIDIA's Rama Akkiraju on How AI Platform Architects Help Bridge Business Vision and Technical Execution

Enterprises across industries are exploring AI to rethink problem-solving and re...

02/05/2025

NVIDIA Experts Share Top 5 Tips for Standing Out in the AI Job Market

With graduation season approaching, a new cohort of students is embarking on next steps, aiming to use their passions and skills to make a real, tangible impact...

01/05/2025

May the Cloud Be With You: GeForce NOW Unveils 21 New Games This Month

May brings more than just rainbows and sunshine - it's also time for fresh adventures and epic battles. This GFN Thursday spotlights 20 can't-miss games...

01/05/2025

Wandercraft Begins Clinical Trials for Physical AI-Powered Personal Exoskeleton

For Nicolas Simon, advancing the field of robotics is a personal mission that could change his siblings' lives. Two-thirds of Simon's family members us...

30/04/2025

From Kitchen to Drive-Thru: How Yum! Brands Is Accelerating Restaurant Innovation With AI

The quick-service restaurant (QSR) industry is being reinvented by AI. For exam...

30/04/2025

Control the Composition of AI-Generated Images With the NVIDIA AI Blueprint for 3D-Guided Generative AI

AI-powered image generation has progressed at a remarkable pace - from early exa...

28/04/2025

NVIDIA Brings Cybersecurity to Every AI Factory

As enterprises increasingly adopt AI, securing AI factories - where complex, agentic workflows are executed - has never been more critical. NVIDIA is bringing ...

28/04/2025

How Agentic AI Enables the Next Leap in Cybersecurity

Agentic AI is redefining the cybersecurity landscape - introducing new opportunities that demand rethinking how to secure AI while offering the keys to addressi...

28/04/2025

Oracle Cloud Infrastructure Deploys Thousands of NVIDIA Blackwell GPUs for Agentic AI and Reasoning Models

Oracle has stood up and optimized its first wave of liquid-cooled NVIDIA GB200 N...

24/04/2025

NVIDIA Research at ICLR - Pioneering the Next Wave of Multimodal Generative AI

Advancing AI requires a full-stack approach, with a powerful foundation of computing infrastructure - including accelerated processors and networking technologi...

24/04/2025

All Roads Lead Back to Oblivion: Bethesda's The Elder Scrolls IV: Oblivion Remastered' Arrives on GeForce NOW

Get the controllers ready and clear the calendar - it's a jam-packed GFN Thu...

23/04/2025

How the Economics of Inference Can Maximize AI Value

As AI models evolve and adoption grows, enterprises must perform a delicate balancing act to achieve maximum value. That's because inference - the process ...

23/04/2025

Capital One Banks on AI for Financial Services

Financial services has long been at the forefront of adopting technological innovations. Today, generative AI and agentic systems are redefining the industry, f...

23/04/2025

Project G-Assist Plug-In Builder Lets Anyone Customize AI on GeForce RTX AI PCs

AI is rapidly reshaping what's possible on a PC - whether for real-time image generation or voice-controlled workflows. As AI capabilities grow, so does the...

23/04/2025

Enterprises Onboard AI Teammates Faster With NVIDIA NeMo Tools to Scale Employee Productivity

An AI agent is only as accurate, relevant and timely as the data that powers it....

22/04/2025

Keeping AI on the Planet: NVIDIA Technologies Make Every Day About Earth Day

Whether at sea, land or in the sky - even outer space - NVIDIA technology is helping research scientists and developers alike explore and understand oceans, wil...

22/04/2025

Chill Factor: NVIDIA Blackwell Platform Boosts Water Efficiency by Over 300x

Traditionally, data centers have relied on air cooling - where mechanical chillers circulate chilled air to absorb heat from servers, helping them maintain opti...

22/04/2025

Making Brain Waves: AI Startup Speeds Disease Research With Lab in the Loop

About 15% of the world's population - over a billion people - are affected by neurological disorders, from commonly known diseases like Alzheimer's and ...