Sony Pixel Power calrec Sony

New Data Shows NVIDIA Blackwell Ultra Delivers up to 50x Better Performance and 35x Lower Costs for Agentic AI

16/02/2026

The NVIDIA Blackwell platform has been widely adopted by leading inference providers such as Baseten, DeepInfra, Fireworks AI and Together AI to reduce cost per token by up to 10x. Now, the NVIDIA Blackwell Ultra platform is taking this momentum further for agentic AI.

AI agents and coding assistants are driving explosive growth in software-programming-related AI queries: from 11% to about 50% last year, according to OpenRouter's State of Inference report. These applications require low latency to maintain real-time responsiveness across multistep workflows and long context when reasoning across entire codebases.

New performance data shows that the combination of NVIDIA's software optimizations and the next-generation NVIDIA Blackwell Ultra platform has delivered breakthrough advances on both fronts. NVIDIA GB300 NVL72 systems now deliver up to 50x higher throughput per megawatt, resulting in 35x lower cost per token compared with the NVIDIA Hopper platform.

By innovating across chips, system architecture and software, NVIDIA's extreme codesign accelerates performance across AI workloads - from agentic coding to interactive coding assistants - while driving down costs at scale.

GB300 NVL72 Delivers up to 50x Better Performance for Low-Latency Workloads Recent analysis from Signal65 shows that NVIDIA GB200 NVL72 with extreme hardware and software codesign delivers more than 10x more tokens per watt, resulting in one-tenth the cost per token compared with the NVIDIA Hopper platform. These massive performance gains continue to expand as the underlying stack improves.

Continuous optimizations from the NVIDIA TensorRT-LLM, NVIDIA Dynamo, Mooncake and SGLang teams continue to significantly boost Blackwell NVL72 throughput for mixture-of-experts (MoE) inference across all latency targets. For instance, NVIDIA TensorRT-LLM library improvements have delivered up to 5x better performance on GB200 for low-latency workloads compared with just four months ago.

Higher-performance GPU kernels optimized for efficiency and low latency help make the most of Blackwell's immense compute capabilities and boost throughput.

NVIDIA NVLink Symmetric Memory enables direct GPU-to-GPU memory access for more efficient communication.

Programmatic dependent launch minimizes idle time by launching the next kernel's setup phase before the previous one completes.

Building on these software advances, GB300 NVL72 - which features the Blackwell Ultra GPU - pushes the throughput-per-megawatt frontier to 50x compared with the Hopper platform.

NVIDIA GB300 NVL72 and the codesigned software stack with NVIDIA Dynamo and TensorRT-LLM deliver over 50x performance per watt compared with the NVIDIA Hopper platform. This performance gain translates into superior economics, with NVIDIA GB300 lowering costs compared with the Hopper platform across the entire latency spectrum. The most dramatic reduction occurs at low latency, where agentic applications operate: up to 35x lower cost per million tokens compared with the Hopper platform.

NVIDIA GB300 NVL72 and the codesigned software stack including NVIDIA Dynamo and TensorRT-LLM deliver 35x lower cost per token compared with NVIDIA Hopper platform. For agentic coding and interactive assistants workloads where every millisecond compounds across multistep workflows, this combination of relentless software optimization and next-generation hardware enables AI platforms to scale real-time interactive experiences to significantly more users.

GB300 NVL72 Delivers Superior Economics for Long-Context Workloads While both GB200 NVL72 and GB300 NVL72 efficiently deliver ultralow latency, the distinct advantages of GB300 NVL72 become most apparent in long-context scenarios. For workloads with 128,000-token inputs and 8,000-token outputs - such as AI coding assistants reasoning across codebases - GB300 NVL72 delivers up to 1.5x lower cost per token compared with GB200 NVL72.

NVIDIA GB300 NVL72 is ideal for low-latency, long-context workloads. Context grows as the agent reads in more of the code. This allows it to better understand the code base but also requires much more compete. Blackwell Ultra has 1.5x higher NVFP4 compute performance and 2x faster attention processing, enabling the agent to efficiently understand entire code bases.

Infrastructure for Agentic AI Leading cloud providers and AI innovators have already deployed NVIDIA GB200 NVL72 at scale, and are also deploying GB300 NVL72 in production. Microsoft, CoreWeave and OCI are deploying GB300 NVL72 for low-latency and long-context use cases such as agentic coding and coding assistants. By reducing token costs, GB300 NVL72 enables a new class of applications that can reason across massive codebases in real time.

As inference moves to the center of AI production, long-context performance and token efficiency become critical, said Chen Goldberg, senior vice president of engineering at CoreWeave. Grace Blackwell NVL72 addresses that challenge directly, and CoreWeave's AI cloud, including CKS and SUNK, is designed to translate GB300 systems' gains, building on the success of GB200, into predictable performance and cost efficiency. The result is better token economics and more usable inference for customers running workloads at scale.

NVIDIA Vera Rubin NVL72 to Bring Next-Generation Performance With NVIDIA Blackwell systems deployed at scale, continuous software optimizations will keep unlocking additional performance and cost improvements across the installed base.

Looking ahead, the NVIDIA Rubin platform - which combines six new chips to create one AI supercomputer - is set to deliver another round of massive performance leaps. For MoE inference, it delivers up to 10x higher throughput per megawatt compared with Blackwell, translating into one-tenth the cost per million tokens. And for the next wa
LINK: https://blogs.nvidia.com/blog/data-blackwell-ultra-performance-lower-c...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

01/04/2026

DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION

January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION Douyin Users Can Now Create And Share Videos With Stun...

16/02/2026

Content Vault Expands Device Specific File Encryption to...

Content Vault, the patent-pending content security platform originally developed for the film, television and entertainment industries, has today announced a ma...

16/02/2026

RT announces the Appointments of new Clarity Correspondent and Policy & Analysis Correspondent

RT News & Current Affairs has today announced the new appointments of journalis...

16/02/2026

New Data Shows NVIDIA Blackwell Ultra Delivers up to 50x Better Performance and 35x Lower Costs for Agentic AI

The NVIDIA Blackwell platform has been widely adopted by leading inference provi...

15/02/2026

Live From NBA All-Star 2026: Entertainment Takes the Court in a Big Way

With new partnership between the league and NBC, workflows distinguish more between live, broadcast sound There'll be a lot new for the 75th NBA All-Star W...

15/02/2026

Live From NBA All-Star 2026: NBC Sports Director Pierre Moossa Previews NBC's Return to the Event

After 24-year absence, NBC Sports returns to NBA All-Star Weekend with unique ca...

15/02/2026

Live From NBA All-Star 2026: Peacock, NBC Sports Offer Viewers a Front-Row Seat With Courtside Live'

New to NBA coverage, the viewer experience offers several angles in addition to ...

15/02/2026

Live From NBA All-Star 2026: NBC Sports Returns With Plenty of Tech Toys in Tow

Coverage features 4X-slo-mo Supracam and Steadicam, Nucleus 4K cameras, closer play-by-play angle, 10 player mics NBC Sports is in the midst of its first NBA A...

14/02/2026

Cineverse Acquires TV Monetization Platform IndiCue

Share Copy link Facebook X Linkedin Bluesky Email...

14/02/2026

ESPN's Audiences for College Basketball On Track for Major Growth

Share Copy link Facebook X Linkedin Bluesky Email...

14/02/2026

TCL Display Technologies Deployed at Winter Olympics

Share Copy link Facebook X Linkedin Bluesky Email...

14/02/2026

Boston Conservatory Orchestra Helps Peter and Leonardo Dugan Complete Their Dream Piece

Boston Conservatory Orchestra Helps Peter and Leonardo Dugan Complete Their Dre...

13/02/2026

OBS Accelerates Shift to Cloud

Olympic Broadcasting Services (OBS) has provided an update on its adoption of the cloud as it continues on its journey to fully migrate to IT-based systems by 2...

13/02/2026

France Tlvisions Launches France 2 UHD with Dolby Vision and Dolby Atmos to Max out AC-4 Experiences for Winter Olympics Fans

France T l visions has successfully launched France 2 UHD featuring Dolby Vision...

13/02/2026

OBS Expands Athlete Moment,' Family Reunions to Capture Human Side of Winter Games

Partnering with Worldwide Olympic Partner TCL, OBS deploys connected Athlete Mom...

13/02/2026

Men's Figure Skating Photo Gallery

The men's figure skating long-form program is tonight, and it promises to be an exciting night for fans in the stands, fans at home, and even the production...

13/02/2026

Entertainment Takes the NBA Court in a Big Way

With new partnership between the league and NBC, workflows distinguish more between live, broadcast sound There'll be a lot new for the 75th NBA All-Star W...

13/02/2026

SVG GameDay, Episode 3: Sean Tabler - Producing Hockey in the City of Angels

In-venue and creative video staffers at the professional and collegiate level have one major thing in common: the intensity and attention to detail ramps up dur...

13/02/2026

Teradek Introduces RF-X, Revolutionizing Mission-Critical Signal Redundancy

Teradek announces the launch of RF-X Auto Switcher, a revolutionary appliance designed to deliver flawless, uncompromised signal integrity for the world's m...

13/02/2026

Synamedia & Globecast Selected for FA Cup Cloud Distribution

Globecast and Synamedia announces that Pitch International (Pitch), the leading London-based sports marketing agency, has gone live with cloud-based distributi...

13/02/2026

Ratings Roundup: NBC Sports' Legendary February Hits Record Viewership Levels

Ratings Roundup is a rundown of recent rating news and is derived from press rel...

13/02/2026

NBC Olympics' Amy Rosenfeld on the Drone Craze, Friends & Family Moments, Stamford's Role for Milano Cortina

Far from the action in the snow and on the ice, the team controls the production...

13/02/2026

2026 Daytona 500: FOX Sports' Mike Davies, George Grill on Working Within an IP-Based Compound, Solving the Ops Puzzle of the Super Bowl of Racing

The Daytona 500 is called The Super Bowl of Racing for a reason. Whether it's the culmination to five days of action on the track, the sheer size and scop...

13/02/2026

OBS Expands AI-Powered Content Workflows

For the Milano Cortina Games, Olympic Broadcasting Services (OBS) is delivering more than 6,500 hours of content, with more than 900 hours of live action, sprea...

13/02/2026

NBC Sports Director Pierre Moossa Previews NBC's First NBA All-Star Production in 24 Years

After 24-year absence, NBC Sports returns to NBA All-Star Weekend with unique ca...

13/02/2026

Film Festival Watch: 18 Sundance Institute-Supported Projects To Watch at the 2026 Berlin International Film Festival

By Jessica Herndon We may have just wrapped an unforgettable 2026 Sundance Film...

13/02/2026

Give Me the Backstory: Get to Know Amanda Kramer, the Writer-Director Behind By Design

By Jessica Herndon One of the most exciting things about the Sundance Film Fest...

13/02/2026

Women in Podcasting Craft New Connections at Spotify's Galentine's Day Celebration in LA

This Wednesday in Los Angeles, Spotify brought together a group of podcast creat...

13/02/2026

Spotify and LoveShackFancy Bring Galentine's Glam to NYC, Featuring Special Performance by Joshua Basset

Yesterday, Spotify and LoveShackFancy hosted a Galentine's and Gents Lunch a...

13/02/2026

L3Harris Successfully Completes First Phase of P25 Transition for Florida SLERS

The upgrade to a Project 25 network provides state agencies communicating on the Statewide Law Enforcement Radio System flexibility to tailor the network to the...

13/02/2026

Riedel Opens Kuala Lumpur Office to Strengthen Global 24...

Riedel Communications has officially opened a new office in Kuala Lumpur, Malaysia, marking a strategic expansion of its global Customer Success and IT software...

13/02/2026

ES Broadcast Hire duo celebrate 10-year anniversary with...

Two of ES Broadcast Hire's longest-serving employees recently celebrated a decade working for the company. Annie Breislin, Operations Manager, and Charles ...

13/02/2026

Disguise Opens Experience Center and Office in Atlanta

Disguise, the award-winning technology company powering global experiences, today unveils a new 8,000-square-foot office and Experience Center in Atlanta, creat...

13/02/2026

Mavis Expands External Camera Support with Accsoon SeeMo...

At BSC Expo 2026, Mavis announced full support for the Accsoon SeeMo series of iOS camera adapters across Mavis Camera and Mavis Monitor apps. This new integrat...

13/02/2026

Butcher Bird Studios Keeps Signals Flowing Seamlessly Acr...

Executing technically ambitious live streams, virtual productions, and immersive media today requires talent, creativity, and the right supporting technology. L...

13/02/2026

LTN makes key appointments and introduces new Technology...

Michal Miskin-Amir, Jonathan Stanton and Bobby Bond to lead technical advances amid surge in demand for LTN's IP video transport services as satellite capac...

13/02/2026

NATO Upgrades Broadcast Studio with Grass Valley Cameras

Grass Valley, the pioneering media and entertainment technology innovator, has won a competitive NATO-wide tender to provide the new camera system for NATO'...

13/02/2026

Digital Azul strengthens remote production strategy with...

Wireless IP intercom underpins agile, multi-location live production workflows Digital Azul, the independent production powerhouse specialising in complex liv...

13/02/2026

Actus Digital Sets a New Standard for QA Monitoring and C...

Actus Digital, a LiveU company, will unveil major new enhancements to its Actus X Intelligent Monitoring Platform at NAB Show (LiveU booth N1740), reinforcing i...

13/02/2026

FA Cup goes IP with Pitch International plus Synamedia an...

Globecast, a worldwide leader in broadcast services, and leading video software provider, Synamedia, today announced that Pitch International (Pitch), the leadi...

13/02/2026

Rai Selects Imagine Selenio Network Processor for IP Migration

Share Copy link Facebook X Linkedin Bluesky Email...

13/02/2026

CIMM Details Research Plans for 2026 and New Board Appointments

Share Copy link Facebook X Linkedin Bluesky Email...

13/02/2026

Teradek Unveils RF-A Auto Switcher

Share Copy link Facebook X Linkedin Bluesky Email...

13/02/2026

Spectrum Launches 'Invincible Wifi'

Share Copy link Facebook X Linkedin Bluesky Email...

13/02/2026

Actus Digital to Introduce Actus X Platform Enhancements At NAB Show

Share Copy link Facebook X Linkedin Bluesky Email...

13/02/2026

Sennheiser Wireless Spectera Solution Tackles Super Bowl LX With Ease

Share Copy link Facebook X Linkedin Bluesky Email...