
It can take a puppy weeks to learn that certain kinds of behaviors will result in a yummy treat, extra cuddles or a belly rub - and that other behaviors won't. With a system of positive reinforcement, a pet pooch will in time anticipate that chasing squirrels is less likely to be rewarded than staying by their human's side.
Deep reinforcement learning, a technique used to train AI models for robotics and complex strategy problems, works off the same principle.
In reinforcement learning, a software agent interacts with a real or virtual environment, relying on feedback from rewards to learn the best way to achieve its goal. Like the brain of a puppy in training, a reinforcement learning model uses information it's observed about the environment and its rewards, and determines which action the agent should take next.
To date, most researchers have relied on a combination of CPUs and GPUs to run reinforcement learning models. This means different parts of the computer tackle different steps of the process - including simulating the environment, calculating rewards, choosing what action to take next, actually taking action, and then learning from the experience.
But switching back and forth between CPU cores and powerful GPUs is by nature inefficient, requiring data to be transferred from one part of the system's memory to another at multiple points during the reinforcement learning training process. It's like a student who has to carry a tall stack of books and notes from classroom to classroom, plus the library, before grasping a new concept.
With Isaac Gym, NVIDIA developers have made it possible to instead run the entire reinforcement learning pipeline on GPUs - enabling significant speedups and reducing the hardware resources needed to develop these models.
Here's what this breakthrough means for the deep reinforcement learning process, and how much acceleration it can bring developers.
Reinforcement Learning on GPUs: Simulation to Action When training a reinforcement learning model for a robotics task - like a humanoid robot that walks up and down stairs - it's much faster, safer and easier to use a simulated environment than the physical world. In a simulation, developers can create a sea of virtual robots that can quickly rack up thousands of hours of experience at a task.
If tested solely in the real world, a robot in training could fall down, bump into or mishandle objects - causing potential damage to its own machinery, the object it's interacting with or its surroundings. Testing in simulation provides the reinforcement learning model a space to practice and work out the kinks, giving it a head start when shifting to the real world.
In a typical system today, the NVIDIA PhysX simulation engine runs this experience-gathering phase of the reinforcement learning process on NVIDIA GPUs. But for other steps of the training application, developers have traditionally still used CPUs.
Traditional deep reinforcement learning uses a combination of CPU and GPU computing resources, requiring significant data transfers back and forth. A key part of reinforcement learning training is conducting what's known as the forward pass: First, the system simulates the environment, records a set of observations about the state of the world and calculates a reward for how well the agent did.
The recorded observations become the input to a deep learning policy network, which chooses an action for the agent to take. Both the observations and the rewards are stored for use later in the training cycle.
Finally, the action is sent back to the simulator so that the rest of the environment can be updated in response.
After several rounds of these forward passes, the reinforcement learning model takes a look back, evaluating whether the actions it chose were effective or not. This information is used to update the policy network, and the cycle begins again with the improved model.
GPU Acceleration with Isaac Gym To eliminate the overhead of transferring data back and forth from CPU to GPU during this reinforcement learning training cycle, NVIDIA researchers have developed an approach to run every step of the process on GPUs. This is Isaac Gym, an end-to-end training environment, which includes the PhysX simulation engine and a PyTorch tensor-based API.
Isaac Gym makes it possible for a developer to run tens of thousands of environments simultaneously on a single GPU. That means experiments that previously required a data center with thousands of CPU cores can in some cases be trained on a single workstation.
NVIDIA Isaac Gym runs entire reinforcement learning pipelines on GPUs, enabling significant speedups. Decreasing the amount of hardware required makes reinforcement learning more accessible to individual researchers who don't have access to large data center resources. It can also make the process a lot faster.
A simple reinforcement learning model tasked with getting a humanoid robot to walk can be trained in just a few minutes with Isaac Gym. But the impact of end-to-end GPU acceleration is most useful for more challenging tasks, like teaching a complex robot hand to manipulate a cube into a specific position.
This problem requires significant dexterity by the robot, and a simulation environment that involves domain randomization, a mechanism that allows the learned policy to more easily transfer to a real-world robot.
Research by OpenAI tackled this task with a cluster of more than 6,000 CPU cores plus multiple NVIDIA Tensor Core GPUs - and required about 30 hours of training for the reinforcement learning model to succeed at the task 20 times in a row using a feed-forward network model.
Using just one NVIDIA A100 GPU with Isaac Gym, NVIDIA developers were able to achieve the same level of success in around 10 hours - a single GPU outperforming
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
01/06/2026
January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026
Throughout the week, Dolby brings to life the latest innovatio...
02/05/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
01/05/2026
January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...
01/04/2026
January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION
Douyin Users Can Now Create And Share Videos With Stun...
10/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
10/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
10/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
10/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
10/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
10/03/2026
NVIDIA and Thinking Machines Lab announced today a multiyear strategic partnersh...
09/03/2026
Foos Gone Wild and Combate Global have teamed up to create a twist on combat sports competition, announcing the launch of a special amateur Mixed Martial Arts (...
09/03/2026
At the 2026 NAB Show, Harmonic will introduce significant enhancements to its video appliances and SaaS solutions, highlighted by a next-generation media server...
09/03/2026
ESPN's March 3 spring training matchup between Team USA and the San Francisc...
09/03/2026
Most Valuable Promotions (MVP) announces the launch of MVPW, a new global platfo...
09/03/2026
Behind The Mic provides a roundup of recent news regarding on-air talent, includ...
09/03/2026
From Super Bowl compounds to Final Four setups, the Hofstra graduate helps coord...
09/03/2026
Stamford plays a key role, but a small team in Cortina and Milan powers local presence and mixed-zone coverage...
09/03/2026
The event brings together SVG's previous Cloud Production and Content Management Forums into a single, comprehensive day of programming...
09/03/2026
Updated Mar 9, 2026
Live surround sound has been a part of the plan for Roman a...
09/03/2026
Contains all six dual-ensemble libraries
VSL's Duality Strings series offers an intriguing alternative to your average string library, capturing two str...
09/03/2026
Outstanding Contribution To UK Music
Photo: Samuel Bradley
Ahead of their upcoming MPG Awards, the Music Producers Guild (MPG) have revealed the latest win...
09/03/2026
Two new high-quality DI boxes announced
Boasting some impressive technical specifications and versatile routing options, Strymon's latest active DI boxe...
09/03/2026
Latest MPE-capable Soundbox library released
The follow-up release for Sonora Cinematic's Pure Nylon has arrived, and becomes the latest addition to the...
09/03/2026
Popular wireless mic head design revived
Sennheiser have revealed that the MD 9235, a cardioid mic head designed to pair up with their handheld wireless sys...
09/03/2026
Captures two sought-after Dumble combo amps
The latest TONEX release captures a pair of sought-after Dumble amplifiers from IK Multimedia's private amp ...
09/03/2026
Flexible all-analogue insert matrix joins line-up
HUM Audio Devices don't tend to do things by halves - even the quickest of glances at the likes of the...
09/03/2026
Captures three sought-after pianos
Rhodes latest software release brings together a collection of three virtual pianos: a concert grand, an acoustic upright...
09/03/2026
Flagship compressor gets an upgrade
Techivation's flagship compressor plug-in has just been treated to a ground-up rebuild that kits it out with some po...
09/03/2026
Profiler OS 14.0 enters open beta
Kemper's amp-modelling systems already have a great reputation, but the latest update to their systems' underlying...
09/03/2026
Procedural stems smasher & recomposition engine
Blinksonic have recently launched a new Reaktor-based tool which they say takes a radical departure from yo...
09/03/2026
13 - 15 March 2026 at University of Warwick Conference Centre
The Institute of Professional Sound (IPS) have announced that The IPS Training Weekend 2026 wi...
09/03/2026
Rohde & Schwarz and NETGEAR collaborate for next generation Wi-Fi 8 access point...
09/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
09/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
09/03/2026
Broadpeak, a leader in streaming and monetization at scale, will showcase its latest innovations for broadcasters and streaming platforms at NAB Show 2026 (boot...
09/03/2026
UKTV has agreed a new partnership deal with Samsung that makes UKTV's free linear channels available to internet-only Samsung TV viewers in the UK for the f...
09/03/2026
Monday 9 March 2026
Sky reveals first look trailer and sets premiere date for S...
09/03/2026
Monday 9 March 2026
Sky Appoints Damian Saunders as Managing Director of Sky Business
Sky has today announced the appointment of Damian Saunders as Managing D...
09/03/2026
Back to All News
The Predator of Seville premieres on Netflix on 27 March
Entertainment
09 March 2026
GlobalSpain
Link copied to clipboard
Download the im...
09/03/2026
Back to All News
Netflix Debuts the Trailer for Love is Blind: Sweden Season 3
Entertainment
09 March 2026
GlobalSweden
Link copied to clipboard
That wait...
09/03/2026
Bill O'Reilly Announces New Weekly Podcast We'll Do It LIVE! We'll Do It LIVE!' A Bold, Fresh Presentation from Bill O'Reilly
New York...
09/03/2026
MOSOLF SE & Co. KG Relies on green.screen from Arvato Systems for Strategic Ener...
09/03/2026
A powerful new national initiative supporting people living with dementia launch...
09/03/2026
AI is everywhere and accelerating everything - becoming essential infrastructure...
09/03/2026
ABB Robotics and NVIDIA today announced a breakthrough partnership that brings i...
07/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
06/03/2026
TNT Sports and the International Basketball Federation (FIBA) have reached a mul...
06/03/2026
OffBall and TOGETHXR, two influential young media companies in sports, announce a strategic and operational partnership in a shared push to scale and create inn...
06/03/2026
InfoComm 2026, a destination for AV, IT, broadcast, and AI-driven systems, annou...