
AI is creating value for everyone - from researchers in drug discovery to quantitative analysts navigating financial market changes.
The faster an AI system can produce tokens, a unit of data used to string together outputs, the greater its impact. That's why AI factories are key, providing the most efficient path from time to first token to time to first value.
AI factories are redefining the economics of modern infrastructure. They produce intelligence by transforming data into valuable outputs - whether tokens, predictions, images, proteins or other forms - at massive scale.
They help enhance three key aspects of the AI journey - data ingestion, model training and high-volume inference. AI factories are being built to generate tokens faster and more accurately, using three critical technology stacks: AI models, accelerated computing infrastructure and enterprise-grade software.
Read on to learn how AI factories are helping enterprises and organizations around the world convert the most valuable digital commodity - data - into revenue potential.
From Inference Economics to Value Creation Before building an AI factory, it's important to understand the economics of inference - how to balance costs, energy efficiency and an increasing demand for AI.
Throughput refers to the volume of tokens that a model can produce. Latency is the amount of tokens that the model can output in a specific amount of time, which is often measured in time to first token - how long it takes before the first output appears - and time per output token, or how fast each additional token comes out. Goodput is a newer metric, measuring how much useful output a system can deliver while hitting key latency targets.
User experience is key for any software application, and the same goes for AI factories. High throughput means smarter AI, and lower latency ensures timely responses. When both of these measures are balanced properly, AI factories can provide engaging user experiences by quickly delivering helpful outputs.
For example, an AI-powered customer service agent that responds in half a second is far more engaging and valuable than one that responds in five seconds, even if both ultimately generate the same number of tokens in the answer.
Companies can take the opportunity to place competitive prices on their inference output, resulting in more revenue potential per token.
Measuring and visualizing this balance can be difficult - which is where the concept of a Pareto frontier comes in.
AI Factory Output: The Value of Efficient Tokens The Pareto frontier, represented in the figure below, helps visualize the most optimal ways to balance trade-offs between competing goals - like faster responses vs. serving more users simultaneously - when deploying AI at scale.
The vertical axis represents throughput efficiency, measured in tokens per second (TPS), for a given amount of energy used. The higher this number, the more requests an AI factory can handle concurrently.
The horizontal axis represents the TPS for a single user, representing how long it takes for a model to give a user the first answer to a prompt. The higher the value, the better the expected user experience. Lower latency and faster response times are generally desirable for interactive applications like chatbots and real-time analysis tools.
The Pareto frontier's maximum value - shown as the top value of the curve - represents the best output for given sets of operating configurations. The goal is to find the optimal balance between throughput and user experience for different AI workloads and applications.
The best AI factories use accelerated computing to increase tokens per watt - optimizing AI performance while dramatically increasing energy efficiency across AI factories and applications.
The animation above compares user experience when running on NVIDIA H100 GPUs configured to run at 32 tokens per second per user, versus NVIDIA B300 GPUs running at 344 tokens per second per user. At the configured user experience, Blackwell Ultra delivers over a 10x better experience and almost 5x higher throughput, enabling up to 50x higher revenue potential.
How an AI Factory Works in Practice An AI factory is a system of components that come together to turn data into intelligence. It doesn't necessarily take the form of a high-end, on-premises data center, but could be an AI-dedicated cloud or hybrid model running on accelerated compute infrastructure. Or it could be a telecom infrastructure that can both optimize the network and perform inference at the edge.
Any dedicated accelerated computing infrastructure paired with software turning data into intelligence through AI is, in practice, an AI factory.
The components include accelerated computing, networking, software, storage, systems, and tools and services.
When a person prompts an AI system, the full stack of the AI factory goes to work. The factory tokenizes the prompt, turning data into small units of meaning - like fragments of images, sounds and words.
Each token is put through a GPU-powered AI model, which performs compute-intensive reasoning on the AI model to generate the best response. Each GPU performs parallel processing - enabled by high-speed networking and interconnects - to crunch data simultaneously.
An AI factory will run this process for different prompts from users across the globe. This is real-time inference, producing intelligence at industrial scale.
Because AI factories unify the full AI lifecycle, this system is continuously improving: inference is logged, edge cases are flagged for retraining and optimization loops tighten over time - all without manual intervention, an example of goodput in action.
Leading global security technology company Lockheed Martin has built its own AI factory to support diverse uses across its business. Through its
Most recent headlines
04/09/2025
Monumental Sports & Entertainment (MSE), in collaboration with Dalet, has been a...
15/05/2025
Airbus U.S. Space & Defense and L3Harris Technologies announced a teaming agreem...
15/05/2025
Mexico City - May 15, 2025 - Nielsen, the global leader in audience measurement, data, and analytics, announced the expansion of its streaming measurement panel...
15/05/2025
NEW YORK Warner Bros. Discovery U.S. Ad Sales today unveiled NEO and DemoDirect, two new solutions that the company said will provide advertisers with more effe...
15/05/2025
NEP Europe is utilizing technology and teams from across Europe for this year's Eurovision Song Contest....
15/05/2025
TAMPA BAY, Fla. & CINCINNATI, Ohio The Tampa Bay Lightning have inked a multi-year media rights agreement with Scripps Sports that gives it rights to produce an...
15/05/2025
NEW YORK Warner Bros. Discovery announced during its upfront presentation that Max, the company's streaming platform, will be rebranded as HBO Max this summ...
15/05/2025
LONDON 7fivefive, has announced that it is expanding its work for BBC Studios' Global Media & Streaming team and that it is helping scale its virtual infras...
15/05/2025
STAMFORD, Conn. Charter Communications, Inc. has named Jake Perlman executive vice president, chief technology and information officer (CTIO)....
15/05/2025
ACT Entertainment is proud to announce that Vice President of Talent Mike Schmid has been honored with a Gold Stevie Award as Human Resources Executive of the ...
15/05/2025
Arabsat, a leading global satellite operator and the primary provider of satellite services across the Arab world, has partnered with Grass Valley, the technolo...
15/05/2025
Leading media technology and services provider Tyrell, who has supported the broadcast, post-production, production, VFX, and corporate markets in the UK and Ir...
15/05/2025
At Broadcast Asia 2025, LiveU will demonstrate its latest mission-critical IP-video solutions within its expanded EcoSystem, designed to add efficiency and shor...
15/05/2025
Delivering High-Performance Network Switches Designed for Lightware AV Workflows
Lightware, a leading manufacturer of connectivity solutions for the profession...
15/05/2025
Cerberus Tech, a leader in cloud-native IP video contribution and distribution, today announced that the company has been named a finalist in the 2025 StreamTV ...
15/05/2025
BroadcastAsia 2025 Exhibitor Preview
May 27-29
Singapore Expo
Stand 5F3-5
For today's broadcasters, telcos, content owners, and streaming platforms, eff...
15/05/2025
Marshall Electronics will spotlight its Elite Series of PTZ Cameras at InfoComm 2025 (Booth 3843). Marshall's Elite Series of PTZ cameras includes the CV630...
15/05/2025
ACT Entertainment, the industry-leading manufacturer and distributor of live performance equipment, announces that it has been selected as a U.S. distributor fo...
15/05/2025
TAG Video Systems will be presenting its latest advancements in real time video monitoring, probing, and visualization at Broadcast Asia 2025 in Booth 5D1-1 (Ma...
15/05/2025
Clear-Com is set to showcase its latest products at InfoComm 2025, taking place from June 7-13 in Orlando Florida. Clear-Com will present attendees with a hand...
15/05/2025
Amagi, a cloud-based SaaS technology solutions provider for broadcast and streaming TV, has appointed Emma Whitmore as its new Senior Vice President of Sales, E...
15/05/2025
Maxon, maker of powerful, approachable software solutions for creators working in 2D and 3D design, motion graphics, visual effects, gaming and more, is further...
15/05/2025
NAKIVO Inc., a leading vendor of data protection solutions for physical, virtual, cloud, and SaaS environments, announced strong operational results for Q1 2025...
15/05/2025
Boston Conservatory at Berklee Honors Tania Le n and Kelli O'Hara at Commenc...
15/05/2025
John Yao Awarded Guggenheim Fellowship for Interactive Jazz Project The Berklee professor and trombonist will debut Let's Make Some Noise, an immersive bi...
15/05/2025
Back to All News
Trailer for Lost in Starlight' Offers a Vibrant and Heart...
15/05/2025
Back to All News
Reflecting on a Year of Progress in Accessibility and Whats Ahead
Heather Dowdy
Director of Product Accessibility
Product
15 May 2025
Glo...
15/05/2025
A Reflection on 20 years of NAB booth builds and the Evolution of Grass Valley AMPPAt NAB, I had a moment to reflect on just how dramatically our approach to pr...
15/05/2025
The Football Association of Ireland and RT have today announced a new two-year ...
15/05/2025
Editor's note: This post is part of Into the Omniverse, a series focused on ...
15/05/2025
AI is creating value for everyone - from researchers in drug discovery to quantitative analysts navigating financial market changes.
The faster an AI system ca...
15/05/2025
Facebook
Twitter
LinkedIn
Thales fully supports the success of Vipps, Norw...
15/05/2025
Facebook
Twitter
LinkedIn
Serbia and Montenegro Air Traffic Services SMATS...
15/05/2025
Facebook
Twitter
LinkedIn
Thales, a global leader in high-power lasers, will inaugurate GenF on Thursday 15 May 2025 in Le Barp (Bordeaux). GenF aims to t...
15/05/2025
Steel clashes and war drums thunder as a new age of battle dawns - one that will test even the mightiest Slayer.
This GFN Thursday, DOOM: The Dark Ages - the b...
14/05/2025
Spotify and FC Barcelona's partnership saw its sixth jersey takeover this we...
14/05/2025
At Spotify, we believe that our incredible bandmates (that's what we call our team) are the driving force behind our success. Longtime Spotifier Anna Lundst...
14/05/2025
The strategic partnership between L3Harris and Palantir is already providing significant progress in developing real-world Radio-as-a-Sensor concepts....
14/05/2025
Dubai/CABSAT/ Bilbao, Spain, May 14th 2025 - Qvest, a global leader in media-focused practices and services, and AgileTV, a provider of comprehensive television...
14/05/2025
NEW YORK In another example of how pay TV operators are looking to strengthen their bundled offerings by adding streaming services, Altice USA's Optimum has...
14/05/2025
DALLAS Ad-supported subscription-based streaming services will increase in popularity over the next four years, reaching 278 million viewers by 2029 according t...
14/05/2025
Indie Crusade Epic Explores the Apocalypse with URSA Mini Pro 12K
Brie Clayton May 14, 2025
0 Comments
Blackmagic RAW, DaVinci Resolve Studio and Fusi...
14/05/2025
Q&A: Inside The Philadelphia Orchestra's Video Workflow With EVO
Melanie Ciotti May 14, 2025
0 Comments
Above image: The Philadelphia Orchestra ce...
14/05/2025
This article originally appeared on TV Tech sister brand Radio World....
14/05/2025
TACOMA, Wash. Seattle public broadcaster KBTC Public Television has launched KBTC-VC, a new virtual ATSC 3.0 channel, broadcasting on virtual channel 28-11. Th...
14/05/2025
WASHINGTON The Federal Communications Commission is moving to investigate certain 5G and satellite spectrum held by EchoStar and its satellite-TV subsidiary, Di...
14/05/2025
In an announcement that is likely to widely reverberate through the pay TV industry through the rest of 2025 and beyond, The Walt Disney Co. has unveiled import...
14/05/2025
WASHINGTON The National Association of Broadcasters (NAB) has announced the results of the 2025 NAB Radio and Television Board of Directors elections. The two-y...
14/05/2025
RIYADH, Saudi Arabia & MONTREAL Arabsat, a global satellite operator and the primary provider of satellite services across the Arab world, has announced that it...
14/05/2025
TYSONS, Va., and INDIANAPOLIS Tegna and the Indiana Fever said broadcast stations in 11 additional Midwest markets will join WTHR Indianapolis in airing 18 of t...