
Dramatic gains in hardware performance have spawned generative AI, and a rich pipeline of ideas for future speedups will drive machine learning to new heights, Bill Dally, NVIDIA's chief scientist and senior vice president of research, said today in a keynote.
Dally described a basket of techniques in the works - some already showing impressive results - in a talk at Hot Chips, an annual event for processor and systems architects.
The progress in AI has been enormous, it's been enabled by hardware and it's still gated by deep learning hardware, said Dally, one of the world's foremost computer scientists and former chair of Stanford University's computer science department.
He showed, for example, how ChatGPT, the large language model (LLM) used by millions, could suggest an outline for his talk. Such capabilities owe their prescience in large part to gains from GPUs in AI inference performance over the last decade, he said.
Gains in single-GPU performance are just part of a larger story that includes million-x advances in scaling to data-center-sized supercomputers. Research Delivers 100 TOPS/Watt Researchers are readying the next wave of advances. Dally described a test chip that demonstrated nearly 100 tera operations per watt on an LLM.
The experiment showed an energy-efficient way to further accelerate the transformer models used in generative AI. It applied four-bit arithmetic, one of several simplified numeric approaches that promise future gains.
Bill Dally Looking further out, Dally discussed ways to speed calculations and save energy using logarithmic math, an approach NVIDIA detailed in a 2021 patent.
Tailoring Hardware for AI He explored a half dozen other techniques for tailoring hardware to specific AI tasks, often by defining new data types or operations.
Dally described ways to simplify neural networks, pruning synapses and neurons in an approach called structural sparsity, first adopted in NVIDIA A100 Tensor Core GPUs.
We're not done with sparsity, he said. We need to do something with activations and can have greater sparsity in weights as well.
Researchers need to design hardware and software in tandem, making careful decisions on where to spend precious energy, he said. Memory and communications circuits, for instance, need to minimize data movements.
It's a fun time to be a computer engineer because we're enabling this huge revolution in AI, and we haven't even fully realized yet how big a revolution it will be, Dally said.
More Flexible Networks In a separate talk, Kevin Deierling, NVIDIA's vice president of networking, described the unique flexibility of NVIDIA BlueField DPUs and NVIDIA Spectrum networking switches for allocating resources based on changing network traffic or user rules.
The chips' ability to dynamically shift hardware acceleration pipelines in seconds enables load balancing with maximum throughput and gives core networks a new level of adaptability. That's especially useful for defending against cybersecurity threats.
Today with generative AI workloads and cybersecurity, everything is dynamic, things are changing constantly, Deierling said. So we're moving to runtime programmability and resources we can change on the fly,
In addition, NVIDIA and Rice University researchers are developing ways users can take advantage of the runtime flexibility using the popular P4 programming language.
Grace Leads Server CPUs A talk by Arm on its Neoverse V2 cores included an update on the performance of the NVIDIA Grace CPU Superchip, the first processor implementing them.
Tests show that, at the same power, Grace systems deliver up to 2x more throughput than current x86 servers across a variety of CPU workloads. In addition, Arm's SystemReady Program certifies that Grace systems will run existing Arm operating systems, containers and applications with no modification.
Grace gives data center operators a choice to deliver more performance or use less power. Grace uses an ultra-fast fabric to connect 72 Arm Neoverse V2 cores in a single die, then a version of NVLink connects two of those dies in a package, delivering 900 GB/s of bandwidth. It's the first data center CPU to use server-class LPDDR5X memory, delivering 50% more memory bandwidth at similar cost but one-eighth the power of typical server memory.
Hot Chips kicked off Aug. 27 with a full day of tutorials, including talks from NVIDIA experts on AI inference and protocols for chip-to-chip interconnects, and runs through today.
Most recent headlines
09/11/2025
Dalet today announced a transformative leap forward for media operations: Agentic Artificial Intelligence (AI) that unifies the Dalet ecosystem under one natura...
15/10/2025
New SBS Documentary Series THE CANCER KILLERS Premieres 4 November on SBS & SBS ...
15/10/2025
SBS celebrates the Festivals of Lights with bold, illuminating stories across th...
15/10/2025
One of humanity's most profound questions continues: Is there any place in our galaxy suitable for life beyond Earth? NASA intends to find out, with the hel...
15/10/2025
eds3_5_jq(document).ready(function($) { $(#eds_sliderM519).chameleonSlider_2_1({ content_source:......
15/10/2025
Scality, a global leader in cyber-resilient storage software for the AI era, today announced the advancement of its comprehensive AI ecosystem certification pro...
15/10/2025
The Hollywood Professional Association (HPA) today unveiled key highlights of the 2026 HPA Tech Retreat, scheduled for Feb. 15 19 at the Westin Rancho Mirage Go...
15/10/2025
At NAB Show New York 2025 (Stand 244), Interra Systems will showcase the future of media QC, monitoring, and captioning, highlighting its award-winning, AI- and...
15/10/2025
Grass Valley today announced that RMC BFM, part of the CMA-CGM Group and France's third-largest private media group, has chosen Grass Valley's productio...
15/10/2025
Appear, the global leader in live production technology, will demonstrate how its powerful hardware and software innovations are shaping the future of hybrid, s...
15/10/2025
Rakim to Join Berklee's Hip-Hop Hall of Fame at Signature Series Concert The rap icon becomes the latest inductee, following luminaries such as Roxanne Sh...
15/10/2025
Berklee Brings Live Music to the Head of the Charles Student artists will perform on multiple riverfront stages during the 60th anniversary of the world's...
15/10/2025
CobbTV, the government access television channel for Cobb County, Ga. recently acquired an automated playout system from Pebble, a global provider of automation...
15/10/2025
OSLO, Norway Appear, the global leader in live production technology, will demonstrate how its powerful hardware and software innovations are shaping the future...
15/10/2025
NEW YORK The NBA is making major changes to the NBA App and NBA TV as it takes control of them from TNT Sports, which has long managed the league's digital ...
15/10/2025
SAN MATEO, Calif. In what promises to be a major expansion of interactive features and personalized content on the DirecTV platform, the operator and Glance hav...
15/10/2025
SAN JOSE, Calif. Roku has launched changes to its user interface (UI) that the streaming platform says will better showcase original programming on the platform...
15/10/2025
LOS ANGELES Software-defined data storage and data services provider OpenDrives has elevated Alex Dunfey to chief technology officer, responsible for driving th...
15/10/2025
Rohde & Schwarz unveils compact MXO 3 oscilloscopes with 4 and 8 channels: Advan...
15/10/2025
Back to All News
Revenge Series The Resurrected' Captivates Audiences Across Asia
Entertainment
15 October 2025
GlobalTaiwan
Link copied to clipboard
...
15/10/2025
Back to All News
Bringing the Best in VFX and Virtual Production Together as Eyeline
Jeffrey Shapiro
CEO, Eyeline
Business
15 October 2025
GlobalCanadaInd...
15/10/2025
Deployed by Astound Business Solutions, Harmonic's Primary Distribution Solution Ensures Outstanding Video Quality and Seamless Ad Insertion at the Edge
SA...
15/10/2025
Collaboration extends a trusted relationship as RMC BFM invests in scalable, fut...
15/10/2025
Series coming in 2026 stars Tom Vaughan-Lawlor, Justine Mitchell and Jason O'Mara released today
RT today released first look images of new comedy-drama ...
14/10/2025
SVG Europe Summit 2025: All Sessions Now Available to Watch on SVG PLAYNetworking event that preceded IBC2025 shone a light on elite live sports innovation acro...
14/10/2025
SVG Sit-Down: Author Rich Podolsky on Writing Madden & Summerall: How They Revo...
14/10/2025
SVG All-Stars: Michael Reiners, Coordinating Producer, FloRacingThe Illinois State grad steers a vast schedule of motorsports events at tracks across the countr...
14/10/2025
Content protection: Getting the right management for your DRM By Neal Romanek
Friday, October 10, 2025 - 10:11
Print This Story
Eluvio power the EPCR'...
14/10/2025
As League Takes Over Ops, NBA TV and NBA App Add 60 Games, Weekday Studio Show, ...
14/10/2025
Time and effort: World's largest student-led broadcast prepares to go On Air...
14/10/2025
(L-R) Guest, Kimberly Robinson Jones, Geeta Gandbhir, Pamela Dias, and Takema Ro...
14/10/2025
Lossless ist jetzt mit Spotify Premium verf gbar.
Verlustfreies Audio war eine...
14/10/2025
La qualit Lossless est disponible sur Spotify Premium.
Le format sans perte de...
14/10/2025
For the seventh edition of Spotify and FC Barcelona's artist jersey series, ...
14/10/2025
Spotify is committed to bringing the best listening experience to all our users, and that includes parents and families. That's why we're expanding mana...
14/10/2025
Since its debut, the Spotify Original podcast Caso 63 has been more than just a story; it's been a cultural sensation. The science fiction thriller captivat...
14/10/2025
Desde su debut, el podcast original de Spotify Caso 63 ha sido mucho m s que una historia: se ha convertido en un fen meno cultural. Este thriller de ciencia fi...
14/10/2025
Lossless p Spotify Premium r h r.
Lossless-ljud har varit en av de mest efterl ngtade funktionerna p Spotify och nu, ntligen, har den b rjat rullas ut til...
14/10/2025
Early next year, your favorite video podcasts are getting a bigger stage. Spotify and Netflix are teaming up to bring sports, culture, lifestyle, and true crime...
14/10/2025
Last week, the 4th global Safety Day took place at all SGL Carbon sites.
This years Safety Day focused on hazardous substances. Various information events, wor...
14/10/2025
From bowser to basket, 9 in 10 Aussies are feeling the impact of rising prices
26% of households earn over $160k, but are still concerned about rising prices...
14/10/2025
New players take a bite out of big bank share as consumers increasingly value tr...
14/10/2025
56% of Aussies are looking for a coastal holiday, while 40% are planning a road ...
14/10/2025
51% of Aussies want a hybrid car and 36% want a full EV
Toyota leads the market
75% research online before a new car purchase
Sydney - October 14, 2025 - Aus...
14/10/2025
Unilever leads the market
Beverages, smartphones, and food dominate category sp...
14/10/2025
Top insurance advertisers
Biggest growth categories
Sector ad spend up 4.7...
14/10/2025
WAYNE, Pa. Private-equity firm Saothair Capital Partners said it has completed the acquisition of GatesAir through a newly-formed affiliate....
14/10/2025
Media Excel, a leading provider of encoding and transcoding solutions, today announced that Space Norway, a leading provider of satellite services and operator ...
14/10/2025
Jason Tyler has joined ZTransform, a leader in media environment innovation, as Inside Sales and Procurement Manager bringing commercial and operational focus t...