Sony Pixel Power calrec Sony

NVIDIA Blackwell Raises Bar in New InferenceMAX Benchmarks, Delivering Unmatched Performance and Efficiency

09/10/2025

NVIDIA Blackwell swept the new SemiAnalysis InferenceMAX v1 benchmarks, delivering the highest performance and best overall efficiency.

InferenceMax v1 is the first independent benchmark to measure total cost of compute across diverse models and real-world scenarios.

Best return on investment: NVIDIA GB200 NVL72 delivers unmatched AI factory economics - a $5 million investment generates $75 million in DSR1 token revenue, a 15x return on investment.

Lowest total cost of ownership: NVIDIA B200 software optimizations achieve two cents per million tokens on gpt-oss, delivering 5x lower cost per token in just 2 months.

Best throughput and interactivity: NVIDIA B200 sets the pace with 60,000 tokens per second per GPU and 1,000 tokens per second per user on gpt-oss with the latest NVIDIA TensorRT-LLM stack.

As AI shifts from one-shot answers to complex reasoning, the demand for inference - and the economics behind it - is exploding.

The new independent InferenceMAX v1 benchmarks are the first to measure total cost of compute across real-world scenarios. The results? The NVIDIA Blackwell platform swept the field - delivering unmatched performance and best overall efficiency for AI factories.

A $5 million investment in an NVIDIA GB200 NVL72 system can generate $75 million in token revenue. That's a 15x return on investment (ROI) - the new economics of inference.

Inference is where AI delivers value every day, said Ian Buck, vice president of hyperscale and high-performance computing at NVIDIA. These results show that NVIDIA's full-stack approach gives customers the performance and efficiency they need to deploy AI at scale.

Enter InferenceMAX v1 InferenceMAX v1, a new benchmark from SemiAnalysis released Monday, is the latest to highlight Blackwell's inference leadership. It runs popular models across leading platforms, measures performance for a wide range of use cases and publishes results anyone can verify.

Why do benchmarks like this matter?

Because modern AI isn't just about raw speed - it's about efficiency and economics at scale. As models shift from one-shot replies to multistep reasoning and tool use, they generate far more tokens per query, dramatically increasing compute demands.

NVIDIA's open-source collaborations with OpenAI (gpt-oss 120B), Meta (Llama 3 70B), and DeepSeek AI (DeepSeek R1) highlight how community-driven models are advancing state-of-the-art reasoning and efficiency.

Partnering with these leading model builders and the open-source community, NVIDIA ensures the latest models are optimized for the world's largest AI inference infrastructure. These efforts reflect a broader commitment to open ecosystems - where shared innovation accelerates progress for everyone.

Deep collaborations with the FlashInfer, SGLang and vLLM communities enable codeveloped kernel and runtime enhancements that power these models at scale.

Software Optimizations Deliver Continued Performance Gains NVIDIA continuously improves performance through hardware and software codesign optimizations. Initial gpt-oss-120b performance on an NVIDIA DGX Blackwell B200 system with the NVIDIA TensorRT LLM library was market-leading, but NVIDIA's teams and the community have significantly optimized TensorRT LLM for open-source large language models.

The TensorRT LLM v1.0 release is a major breakthrough in making large AI models faster and more responsive for everyone.

Through advanced parallelization techniques, it uses the B200 system and NVIDIA NVLink Switch's 1,800 GB/s bidirectional bandwidth to dramatically improve the performance of the gpt-oss-120b model.

The innovation doesn't stop there. The newly released gpt-oss-120b-Eagle3-v2 model introduces speculative decoding, a clever method that predicts multiple tokens at a time.

This reduces lag and delivers even quicker results, tripling throughput at 100 tokens per second per user (TPS/user) - boosting per-GPU speeds from 6,000 to 30,000 tokens.

For dense AI models like Llama 3.3 70B, which demand significant computational resources due to their large parameter count and the fact that all parameters are utilized simultaneously during inference, NVIDIA Blackwell B200 sets a new performance standard in InferenceMAX v1 benchmarks.

Blackwell delivers over 10,000 TPS per GPU at 50 TPS per user interactivity - 4x higher per-GPU throughput compared with the NVIDIA H200 GPU.

Performance Efficiency Drives Value Metrics like tokens per watt, cost per million tokens and TPS/user matter as much as throughput. In fact, for power-limited AI factories, Blackwell delivers 10x throughput per megawatt compared with the previous generation, which translates into higher token revenue.

The cost per token is crucial for evaluating AI model efficiency, directly impacting operational expenses. The NVIDIA Blackwell architecture lowered cost per million tokens by 15x versus the previous generation, leading to substantial savings and fostering wider AI deployment and innovation.

Multidimensional Performance InferenceMAX uses the Pareto frontier - a curve that shows the best trade-offs between different factors, such as data center throughput and responsiveness - to map performance.

But it's more than a chart. It reflects how NVIDIA Blackwell balances the full spectrum of production priorities: cost, energy efficiency, throughput and responsiveness. That balance enables the highest ROI across real-world workloads.

Systems that optimize for just one mode or scenario may show peak performance in isolation, but the economics of that doesn't scale. Blackwell's full-stack design delivers efficiency and value where it matters most: in production.

For a deeper look at how these curves are built - and why they matter for total cost of ownership and service-level agreement planning - check out this technical deep d
LINK: https://blogs.nvidia.com/blog/blackwell-inferencemax-benchmark-results...
See more stories from nvidia

Most recent headlines

09/11/2025

Dalet Unveils Agentic AI Media Workflows at IBC2025

Dalet today announced a transformative leap forward for media operations: Agentic Artificial Intelligence (AI) that unifies the Dalet ecosystem under one natura...

10/10/2025

Prime Video to Stream PGA Tour's The Skins Game on Black Friday

LOS ANGELES and PONTE VEDRA BEACH, Florida Amazon's Prime Video has announced a new deal that will allow it to exclusively stream a revival of the PGA Tour&...

10/10/2025

Allen Media Group's Local Now Free Streaming Platform Adds Fox Channels

ATLANTA Local Now, Allen Media Group's free streaming service, will add five channels from Fox to its growing lineup. The new offerings are Fox Sports, Fox ...

10/10/2025

NAB Applauds FCC Draft Notice on NextGen TV Rules

WASHINGTON The National Association of Broadcasters is applauding a draft notice from the Federal Communications Commission that would potentially speed up the ...

10/10/2025

Google Cloud Named Official Cloud Provider for 2028 L.A. Olympics

MOUNTAIN VIEW, Calif. and LOS ANGELES and NEW YORK LA28, Team USA and NBCUniversal have announced a wide-ranging sponsorship deal with Google that will make the...

10/10/2025

Telemundo Studios, U. of Miami Launch First-Ever Latino Podcast Incubator

MIAMI NBCUniversal Telemundo Studios said it a partnership with the University of Miami to launch what they are billing as an industry-first podcast incubator f...

10/10/2025

NFL Viewing Hits 15-Year High

As TV and streaming media outlets pay record prices for NFL rights, those big bets are paying off with record viewing levels....

09/10/2025

International Island Games: Pop-Up Private 5G Networks Offer Opportunity for Smaller Sporting Events

International Island Games: Pop-up private 5G networks offer opportunity for sma...

09/10/2025

SVG Campus Shot Callers: Andrew Kurtz, Director for Broadcast Production, Monmouth University

SVG Campus Shot Callers: Andrew Kurtz, Director for Broadcast Production, Monmou...

09/10/2025

SPORTEL Monaco Announces Final Conference Lineup

SPORTEL Monaco Announces Final Conference LineupSVG, SVG Europe Lead Discussions on Fan Engagement, Generative AIBy Ken Kerschbaumer, Editorial Director Thurs...

09/10/2025

With Soccer's Popularity on the Rise, U.S. Open Cup Rolls Out Largest Production in Tournament History

With Soccer's Popularity on the Rise, U.S. Open Cup Rolls Out Largest Produc...

09/10/2025

Spotify's World Mental Health Day Campaign Amplifies Connection and Youth Voices with UNICEF

This World Mental Health Day, Spotify is helping listeners slow down, reconnect,...

09/10/2025

Spotify's New In-App Experience Lets You Rank Your Favorite Ariana Grande Songs

Ariana Grande fans, the moment has arrived. Whether you've loved Ariana sinc...

09/10/2025

Announcing the Recipients of the Spotify x JED Impact Award for Positive Mental Health Storytelling

When we introduced the inaugural Spotify x JED Impact Award earlier this year, o...

09/10/2025

Karan Aujla and Ikky Talk Turning Punjabi Beats Into Global Hits

From its vibrant Indian origin to playlists spanning the globe, Punjabi music is having a moment. Today in India, it stands among the most-streamed music langua...

09/10/2025

Take a lively look at death with the return of SBS Audio's award-winning Grave Matters podcast

Take a lively look at death with the return of SBS Audio's award-winning Gra...

09/10/2025

Clarification: SBS editorial guidance on Middle East conflict

Clarification: SBS editorial guidance on Middle East conflict 9 October, 2025 Media releases Statement by Mandi Wicks, SBS Director of News and Current Aff...

09/10/2025

L3Harris Announces New Variants of its VAMPIRE System

Since being introduced in 2023, L3Harris' VAMPIRE Counter-UAS system has successfully shot down hundreds of hostile drones in combat operations. The company...

09/10/2025

Prime Video to Stream PGA Tour's 'The Skins Game' on Black Friday

LOS ANGELES and PONTE VEDRA BEACH, Florida Amazon's Prime Video has announced a new deal that will allow it to exclusively stream a revival of the PGA Tour&...

09/10/2025

Professional Wireless Systems Showcases Rental Services f...

Professional Wireless Systems (PWS), a leading provider of wireless audio solutions and RF management, is highlighting its rental services to support production...

09/10/2025

Marshall Electronics Announces New StreamDesk Professiona...

Marshall Electronics recently announced its first professional podcast streaming bundle, the StreamDesk. This convenient package includes a Marshall CV508 POV C...

09/10/2025

PlayBox Neo Reports IBC Success with Award Winning PlayBo...

PlayBox Neo is delighted to report on yet another highly successful IBC Show which took place from 12 15 September, attracting a notable 20% increase in stand...

09/10/2025

LTN and XR Extreme Reach partner to power live syndicatio...

Joint offering enables studios to deliver live programming into US broadcast stations via a purpose-built IP network LTN announces a new partnership with XR Ex...

09/10/2025

NAKIVO Releases v11 1 with Enhanced Disaster Recovery and...

This latest update adds 5 new languages to the interface, more Proxmox VE backup and recovery options, automated real-time replication, enhanced MSP direct conn...

09/10/2025

Actus Digital and Pikolo Announce Integration to Deliver...

Actus Digital, a LiveU company and a leader in intelligent compliance logging, quality monitoring, and content analysis solutions, today announced a new integra...

09/10/2025

Avid powers data-driven future with Avid Content Core US...

Avid will make the US debut of its groundbreaking Avid Content Core at NAB Show New York 2025 (booth 547), taking place October 22 23. At the event, Avid will...

09/10/2025

Harmonic Spotlights Fiber Breakthroughs at Network X 2025

Harmonic (NASDAQ: HLIT) is bringing its powerful fiber broadband innovations and deep expertise in accelerating fiber deployments to Network X 2025 in Paris. Ha...

09/10/2025

New Delhi Television NDTV Selects Grass Valley for Major...

New Delhi Television Ltd (NDTV), one of India's leading news and digital journalism companies, selected Grass Valley to modernize its media asset management...

09/10/2025

FCC Releases Draft Notice for NextGen TV Rules, ATSC 1.0 Sunset

WASHINGTON After announcing on Oct. 6 that it would vote on a notice of proposed rulemaking on ATSC 3.0 at its October meeting, the Federal Communications Commi...

09/10/2025

iSpot: Q3 National Linear TV Ad Revenue Increases to $8.77 Billion

BELLEVUE, Wash. Despite ongoing worries about the economy, new data from iSpot shows that national linear advertising revenue recorded a 4.2% increase in Q3 202...

09/10/2025

Berklee Presents a Roots-Fueled Tribute to Bob Dylan

Berklee Presents a Roots-Fueled Tribute to Bob Dylan The Signature Series concert will showcase Dylan's impact on songwriting and the enduring spirit of A...

09/10/2025

NVIDIA Blackwell Raises Bar in New InferenceMAX Benchmarks, Delivering Unmatched Performance and Efficiency

NVIDIA Blackwell swept the new SemiAnalysis InferenceMAX v1 benchmarks, deliveri...

09/10/2025

Space42 Ships First UAE-Integrated and Tested Synthetic Aperture Radar Satellites to Expand its Foresight Constellation

Abu Dhabi, UAE October, 09, 2025: Space42 (ADX: SPACE42), the UAE-based AI-pow...

09/10/2025

Channel 4 and UKTV announce ground-breaking deal to carry UKTV's U service on Channel 4 streaming

Channel 4 and UKTV announce ground-breaking deal to carry UKTV's U service o...

09/10/2025

U to host Clash of the Comics

U to host Clash of the Comics First ever UK television screening for sellout live event 9th October 2025, London: Clash of the Comics, the wild, unpredictable...

09/10/2025

A new Game of Thrones tale: Official teaser released for A Knight of the Seven Kingdoms

coming to Sky and NOW on 19 JanuaryThursday 9 October 2025 To view this content...

09/10/2025

Netflix Unveils the Official Trailer for the Second Season of 'Breathless'

Back to All News Netflix Unveils the Official Trailer for the Second Season of ...

09/10/2025

Level Up Your Holidays With Party Games Coming to Netflix on TV

Back to All News Level Up Your Holidays With Party Games Coming to Netflix on TV Entertainment 09 October 2025 Global Link copied to clipboard Netflix is ...

09/10/2025

Harmonic Spotlights Fiber Breakthroughs at Network X 2025

SAN JOSE, Calif. - Oct. 9, 2025 - Harmonic (NASDAQ: HLIT) is bringing its powerful fiber broadband innovations and deep expertise in accelerating fiber deployme...

09/10/2025

New Delhi Television (NDTV) Selects Grass Valley for Major Content Migration

Framelight X, powered by AMPP, helps NDTV centralize content and optimize workflows MONTREAL, CANADA - October 09, 2025 - New Delhi Television Ltd (NDTV), one ...

09/10/2025

FOX Advertising Announces Strategic Partnership with Mobian Centered on Innovative, AI-Powered Measurement and Targeting

FOX Advertising Announces Strategic Partnership with Mobian Centered on Innovati...

09/10/2025

Arqiva and Viasat Energy Services renew collaboration

Deal provides managed connectivity services for energy operations across Europe and Africa. 09 October 2025, London, UK Arqiva, the leading media services an...

09/10/2025

Ray D'Arcy to leave RT Radio 1

After over 11 years on RT Radio 1, The Ray D'Arcy Show is set to come to an end this week Ray presented his show for the final time yesterday. During it...

09/10/2025

Microsoft Azure Unveils World's First NVIDIA GB300 NVL72 Supercomputing Cluster for OpenAI

Microsoft Azure today announced the new NDv6 GB300 VM series, delivering the ind...

09/10/2025

Gavin Maloney reappointed as Associate Principal Conductor of the RT Concert Orchestra

RT today announced that Gavin Maloney has been reappointed as Associate Princip...

09/10/2025

Incoming: Battlefield 6' Lands on GeForce NOW at Launch

Lock, load and stream - the battle is just beginning. EA's highly anticipated Battlefield 6 is set to storm the cloud when it launches tomorrow with GeForce...

09/10/2025

October 08, 2025

Scripps Research-led team receives $14.2M NIH award to map the body's hidden sixth sense An NIH-backed effort aims to decode how the nervous system monito...

08/10/2025

NHL Faceoff 2025: Entering Its Fifth Year as League Partner, ESPN Captures the Sport's Speed, Subtlety

NHL Faceoff 2025: Entering Its Fifth Year as League Partner, ESPN Captures the S...

08/10/2025

FOX Sports Inks Deal for 2026 World Baseball Classic Rights

FOX Sports Inks Deal for 2026 World Baseball Classic RightsBy SVG Staff Tuesday, October 7, 2025 - 4:00 pm Print This Story | Subscribe Story Highlights ...