Sony Pixel Power calrec Sony

NVIDIA Blackwell Raises Bar in New InferenceMAX Benchmarks, Delivering Unmatched Performance and Efficiency

09/10/2025

NVIDIA Blackwell swept the new SemiAnalysis InferenceMAX v1 benchmarks, delivering the highest performance and best overall efficiency.

InferenceMax v1 is the first independent benchmark to measure total cost of compute across diverse models and real-world scenarios.

Best return on investment: NVIDIA GB200 NVL72 delivers unmatched AI factory economics - a $5 million investment generates $75 million in DSR1 token revenue, a 15x return on investment.

Lowest total cost of ownership: NVIDIA B200 software optimizations achieve two cents per million tokens on gpt-oss, delivering 5x lower cost per token in just 2 months.

Best throughput and interactivity: NVIDIA B200 sets the pace with 60,000 tokens per second per GPU and 1,000 tokens per second per user on gpt-oss with the latest NVIDIA TensorRT-LLM stack.

As AI shifts from one-shot answers to complex reasoning, the demand for inference - and the economics behind it - is exploding.

The new independent InferenceMAX v1 benchmarks are the first to measure total cost of compute across real-world scenarios. The results? The NVIDIA Blackwell platform swept the field - delivering unmatched performance and best overall efficiency for AI factories.

A $5 million investment in an NVIDIA GB200 NVL72 system can generate $75 million in token revenue. That's a 15x return on investment (ROI) - the new economics of inference.

Inference is where AI delivers value every day, said Ian Buck, vice president of hyperscale and high-performance computing at NVIDIA. These results show that NVIDIA's full-stack approach gives customers the performance and efficiency they need to deploy AI at scale.

Enter InferenceMAX v1 InferenceMAX v1, a new benchmark from SemiAnalysis released Monday, is the latest to highlight Blackwell's inference leadership. It runs popular models across leading platforms, measures performance for a wide range of use cases and publishes results anyone can verify.

Why do benchmarks like this matter?

Because modern AI isn't just about raw speed - it's about efficiency and economics at scale. As models shift from one-shot replies to multistep reasoning and tool use, they generate far more tokens per query, dramatically increasing compute demands.

NVIDIA's open-source collaborations with OpenAI (gpt-oss 120B), Meta (Llama 3 70B), and DeepSeek AI (DeepSeek R1) highlight how community-driven models are advancing state-of-the-art reasoning and efficiency.

Partnering with these leading model builders and the open-source community, NVIDIA ensures the latest models are optimized for the world's largest AI inference infrastructure. These efforts reflect a broader commitment to open ecosystems - where shared innovation accelerates progress for everyone.

Deep collaborations with the FlashInfer, SGLang and vLLM communities enable codeveloped kernel and runtime enhancements that power these models at scale.

Software Optimizations Deliver Continued Performance Gains NVIDIA continuously improves performance through hardware and software codesign optimizations. Initial gpt-oss-120b performance on an NVIDIA DGX Blackwell B200 system with the NVIDIA TensorRT LLM library was market-leading, but NVIDIA's teams and the community have significantly optimized TensorRT LLM for open-source large language models.

The TensorRT LLM v1.0 release is a major breakthrough in making large AI models faster and more responsive for everyone.

Through advanced parallelization techniques, it uses the B200 system and NVIDIA NVLink Switch's 1,800 GB/s bidirectional bandwidth to dramatically improve the performance of the gpt-oss-120b model.

The innovation doesn't stop there. The newly released gpt-oss-120b-Eagle3-v2 model introduces speculative decoding, a clever method that predicts multiple tokens at a time.

This reduces lag and delivers even quicker results, tripling throughput at 100 tokens per second per user (TPS/user) - boosting per-GPU speeds from 6,000 to 30,000 tokens.

For dense AI models like Llama 3.3 70B, which demand significant computational resources due to their large parameter count and the fact that all parameters are utilized simultaneously during inference, NVIDIA Blackwell B200 sets a new performance standard in InferenceMAX v1 benchmarks.

Blackwell delivers over 10,000 TPS per GPU at 50 TPS per user interactivity - 4x higher per-GPU throughput compared with the NVIDIA H200 GPU.

Performance Efficiency Drives Value Metrics like tokens per watt, cost per million tokens and TPS/user matter as much as throughput. In fact, for power-limited AI factories, Blackwell delivers 10x throughput per megawatt compared with the previous generation, which translates into higher token revenue.

The cost per token is crucial for evaluating AI model efficiency, directly impacting operational expenses. The NVIDIA Blackwell architecture lowered cost per million tokens by 15x versus the previous generation, leading to substantial savings and fostering wider AI deployment and innovation.

Multidimensional Performance InferenceMAX uses the Pareto frontier - a curve that shows the best trade-offs between different factors, such as data center throughput and responsiveness - to map performance.

But it's more than a chart. It reflects how NVIDIA Blackwell balances the full spectrum of production priorities: cost, energy efficiency, throughput and responsiveness. That balance enables the highest ROI across real-world workloads.

Systems that optimize for just one mode or scenario may show peak performance in isolation, but the economics of that doesn't scale. Blackwell's full-stack design delivers efficiency and value where it matters most: in production.

For a deeper look at how these curves are built - and why they matter for total cost of ownership and service-level agreement planning - check out this technical deep d
LINK: https://blogs.nvidia.com/blog/blackwell-inferencemax-benchmark-results...
See more stories from nvidia

North America Stories

10/10/2025

Prime Video to Stream PGA Tour's The Skins Game on Black Friday

LOS ANGELES and PONTE VEDRA BEACH, Florida Amazon's Prime Video has announced a new deal that will allow it to exclusively stream a revival of the PGA Tour&...

10/10/2025

Allen Media Group's Local Now Free Streaming Platform Adds Fox Channels

ATLANTA Local Now, Allen Media Group's free streaming service, will add five channels from Fox to its growing lineup. The new offerings are Fox Sports, Fox ...

10/10/2025

NAB Applauds FCC Draft Notice on NextGen TV Rules

WASHINGTON The National Association of Broadcasters is applauding a draft notice from the Federal Communications Commission that would potentially speed up the ...

10/10/2025

Google Cloud Named Official Cloud Provider for 2028 L.A. Olympics

MOUNTAIN VIEW, Calif. and LOS ANGELES and NEW YORK LA28, Team USA and NBCUniversal have announced a wide-ranging sponsorship deal with Google that will make the...

10/10/2025

Telemundo Studios, U. of Miami Launch First-Ever Latino Podcast Incubator

MIAMI NBCUniversal Telemundo Studios said it a partnership with the University of Miami to launch what they are billing as an industry-first podcast incubator f...

10/10/2025

NFL Viewing Hits 15-Year High

As TV and streaming media outlets pay record prices for NFL rights, those big bets are paying off with record viewing levels....

09/10/2025

International Island Games: Pop-Up Private 5G Networks Offer Opportunity for Smaller Sporting Events

International Island Games: Pop-up private 5G networks offer opportunity for sma...

09/10/2025

SVG Campus Shot Callers: Andrew Kurtz, Director for Broadcast Production, Monmouth University

SVG Campus Shot Callers: Andrew Kurtz, Director for Broadcast Production, Monmou...

09/10/2025

SPORTEL Monaco Announces Final Conference Lineup

SPORTEL Monaco Announces Final Conference LineupSVG, SVG Europe Lead Discussions on Fan Engagement, Generative AIBy Ken Kerschbaumer, Editorial Director Thurs...

09/10/2025

With Soccer's Popularity on the Rise, U.S. Open Cup Rolls Out Largest Production in Tournament History

With Soccer's Popularity on the Rise, U.S. Open Cup Rolls Out Largest Produc...

09/10/2025

L3Harris Announces New Variants of its VAMPIRE System

Since being introduced in 2023, L3Harris' VAMPIRE Counter-UAS system has successfully shot down hundreds of hostile drones in combat operations. The company...

09/10/2025

Prime Video to Stream PGA Tour's 'The Skins Game' on Black Friday

LOS ANGELES and PONTE VEDRA BEACH, Florida Amazon's Prime Video has announced a new deal that will allow it to exclusively stream a revival of the PGA Tour&...

09/10/2025

Professional Wireless Systems Showcases Rental Services f...

Professional Wireless Systems (PWS), a leading provider of wireless audio solutions and RF management, is highlighting its rental services to support production...

09/10/2025

Marshall Electronics Announces New StreamDesk Professiona...

Marshall Electronics recently announced its first professional podcast streaming bundle, the StreamDesk. This convenient package includes a Marshall CV508 POV C...

09/10/2025

PlayBox Neo Reports IBC Success with Award Winning PlayBo...

PlayBox Neo is delighted to report on yet another highly successful IBC Show which took place from 12 15 September, attracting a notable 20% increase in stand...

09/10/2025

LTN and XR Extreme Reach partner to power live syndicatio...

Joint offering enables studios to deliver live programming into US broadcast stations via a purpose-built IP network LTN announces a new partnership with XR Ex...

09/10/2025

NAKIVO Releases v11 1 with Enhanced Disaster Recovery and...

This latest update adds 5 new languages to the interface, more Proxmox VE backup and recovery options, automated real-time replication, enhanced MSP direct conn...

09/10/2025

Actus Digital and Pikolo Announce Integration to Deliver...

Actus Digital, a LiveU company and a leader in intelligent compliance logging, quality monitoring, and content analysis solutions, today announced a new integra...

09/10/2025

Avid powers data-driven future with Avid Content Core US...

Avid will make the US debut of its groundbreaking Avid Content Core at NAB Show New York 2025 (booth 547), taking place October 22 23. At the event, Avid will...

09/10/2025

Harmonic Spotlights Fiber Breakthroughs at Network X 2025

Harmonic (NASDAQ: HLIT) is bringing its powerful fiber broadband innovations and deep expertise in accelerating fiber deployments to Network X 2025 in Paris. Ha...

09/10/2025

New Delhi Television NDTV Selects Grass Valley for Major...

New Delhi Television Ltd (NDTV), one of India's leading news and digital journalism companies, selected Grass Valley to modernize its media asset management...

09/10/2025

FCC Releases Draft Notice for NextGen TV Rules, ATSC 1.0 Sunset

WASHINGTON After announcing on Oct. 6 that it would vote on a notice of proposed rulemaking on ATSC 3.0 at its October meeting, the Federal Communications Commi...

09/10/2025

iSpot: Q3 National Linear TV Ad Revenue Increases to $8.77 Billion

BELLEVUE, Wash. Despite ongoing worries about the economy, new data from iSpot shows that national linear advertising revenue recorded a 4.2% increase in Q3 202...

09/10/2025

Berklee Presents a Roots-Fueled Tribute to Bob Dylan

Berklee Presents a Roots-Fueled Tribute to Bob Dylan The Signature Series concert will showcase Dylan's impact on songwriting and the enduring spirit of A...

09/10/2025

NVIDIA Blackwell Raises Bar in New InferenceMAX Benchmarks, Delivering Unmatched Performance and Efficiency

NVIDIA Blackwell swept the new SemiAnalysis InferenceMAX v1 benchmarks, deliveri...

09/10/2025

Netflix Unveils the Official Trailer for the Second Season of 'Breathless'

Back to All News Netflix Unveils the Official Trailer for the Second Season of ...

09/10/2025

Level Up Your Holidays With Party Games Coming to Netflix on TV

Back to All News Level Up Your Holidays With Party Games Coming to Netflix on TV Entertainment 09 October 2025 Global Link copied to clipboard Netflix is ...

09/10/2025

New Delhi Television (NDTV) Selects Grass Valley for Major Content Migration

Framelight X, powered by AMPP, helps NDTV centralize content and optimize workflows MONTREAL, CANADA - October 09, 2025 - New Delhi Television Ltd (NDTV), one ...

09/10/2025

Microsoft Azure Unveils World's First NVIDIA GB300 NVL72 Supercomputing Cluster for OpenAI

Microsoft Azure today announced the new NDv6 GB300 VM series, delivering the ind...

09/10/2025

Incoming: Battlefield 6' Lands on GeForce NOW at Launch

Lock, load and stream - the battle is just beginning. EA's highly anticipated Battlefield 6 is set to storm the cloud when it launches tomorrow with GeForce...

09/10/2025

October 08, 2025

Scripps Research-led team receives $14.2M NIH award to map the body's hidden sixth sense An NIH-backed effort aims to decode how the nervous system monito...

08/10/2025

NHL Faceoff 2025: Entering Its Fifth Year as League Partner, ESPN Captures the Sport's Speed, Subtlety

NHL Faceoff 2025: Entering Its Fifth Year as League Partner, ESPN Captures the S...

08/10/2025

FOX Sports Inks Deal for 2026 World Baseball Classic Rights

FOX Sports Inks Deal for 2026 World Baseball Classic RightsBy SVG Staff Tuesday, October 7, 2025 - 4:00 pm Print This Story | Subscribe Story Highlights ...

08/10/2025

Tech Focus: AI & Production Music -Many Benefits for Broadcast Sports, but Uncertainty Continues

Tech Focus: AI & Production Music -Many Benefits for Broadcast Sports, but Uncer...

08/10/2025

WNBA Finals: ESPN Puts Stories on the Court' Front and Center, Debuts 3-Point Distance Tracking

WNBA Finals: ESPN Puts Stories on the Court' Front and Center, Debuts 3-Poi...

08/10/2025

Give Me the Backstory: Get to Know Geeta Gandbhir, the Filmmaker Behind The Perfect Neighbor

By Lucy Spicer One of the most exciting things about the Sundance Film Festival...

08/10/2025

If I Had Legs I'd Kick You Immerses You in the High Stakes of Emotional Burnout

Conan O'Brien and Rose Byrne (photo by Andrew H. Walker / Shutterstock for S...

08/10/2025

Building an Invisible 140,000,000-Mile Bridge

L3Harris legacy in space communication is built on decades of innovation and expertise. From early spacecraft missions like NASA's Mercury, Gemini and Apoll...

08/10/2025

US Army Selects L3Harris to Support NGC2 Program

The L3Harris AN/PRC-158C NGC2 Gateway Manpack will blend high data throughput to allow U.S. soldiers to move quickly across any battlefield with relentless comm...

08/10/2025

Actus Digital Integrates QA Compliance Logger With Pikolo ITracker

HACKENSACK, N.J. Actus Digital today unveiled a new integration with Pikolo's ITracker platform, which streamlines broadcast operations by unifying real-tim...

08/10/2025

NAB to Host Broadcasters Foundation Media Mixer in DC

NEW YORK The Broadcasters Foundation of America (BFOA) has announced that its next Media Mixer will be hosted by Curtis LeGeyt, President and CEO of the Nationa...

08/10/2025

Report: Counterfeit Devices Can Easily Distort CTV Ad Impressions

NEW YORK CleanTap, a startup that provides CTV ad security technology today released new research revealing critical vulnerabilities in the connected TV (CTV) a...

08/10/2025

Emergent Appoints Ben Gunkel as Business Development Dire...

Emergent, a leading provider of AI-enhanced media production solutions and creative services, today announced the appointment of Ben Gunkel as Business Developm...

08/10/2025

LynTec and SSRC Announce Strategic Partnership

LynTec, a leading manufacturer of innovative electrical power control solutions for professional audio, video, and lighting systems, today announced a new partn...

08/10/2025

Pliant Technologies Showcases New CrewCom Digital Audio N...

Pliant Technologies highlights its new CrewCom Digital Audio Network Interface with Dante and AES67 at NAB New York 2025 (Booth 934). The CXD-32CF 32x32 I/O Di...

08/10/2025

Riedel Communications Delivers Multivenue Communications...

Riedel Communications proudly served as the official partner for the Rhine-Ruhr 2025 FISU World University Games, delivering a comprehensive Managed Technology ...

08/10/2025

Grass Valley Launches 4K Broadcast Upgrade Project with J...

Grass Valley, the leading provider of end-to-end live production solutions, today announced it has signed a significant agreement with Jamuna TV, a prominent pr...

08/10/2025

Mediaproxy to Showcase Next-Generation Multiviewer and Re...

Mediaproxy, the global standard for IP compliance monitoring and multiviewing solutions, will showcase its latest advancements at NAB Show New York, October 22 ...

08/10/2025

NBCUniversal Says Its NBA Inventory Is Nearly Sold Out

With only two weeks until the 2025-2026 NBA season tips-off on NBC and Peacock the first under the league's new 11-year, $77 billion media rights contracts...