
It can take a puppy weeks to learn that certain kinds of behaviors will result in a yummy treat, extra cuddles or a belly rub - and that other behaviors won't. With a system of positive reinforcement, a pet pooch will in time anticipate that chasing squirrels is less likely to be rewarded than staying by their human's side.
Deep reinforcement learning, a technique used to train AI models for robotics and complex strategy problems, works off the same principle.
In reinforcement learning, a software agent interacts with a real or virtual environment, relying on feedback from rewards to learn the best way to achieve its goal. Like the brain of a puppy in training, a reinforcement learning model uses information it's observed about the environment and its rewards, and determines which action the agent should take next.
To date, most researchers have relied on a combination of CPUs and GPUs to run reinforcement learning models. This means different parts of the computer tackle different steps of the process - including simulating the environment, calculating rewards, choosing what action to take next, actually taking action, and then learning from the experience.
But switching back and forth between CPU cores and powerful GPUs is by nature inefficient, requiring data to be transferred from one part of the system's memory to another at multiple points during the reinforcement learning training process. It's like a student who has to carry a tall stack of books and notes from classroom to classroom, plus the library, before grasping a new concept.
With Isaac Gym, NVIDIA developers have made it possible to instead run the entire reinforcement learning pipeline on GPUs - enabling significant speedups and reducing the hardware resources needed to develop these models.
Here's what this breakthrough means for the deep reinforcement learning process, and how much acceleration it can bring developers.
Reinforcement Learning on GPUs: Simulation to Action When training a reinforcement learning model for a robotics task - like a humanoid robot that walks up and down stairs - it's much faster, safer and easier to use a simulated environment than the physical world. In a simulation, developers can create a sea of virtual robots that can quickly rack up thousands of hours of experience at a task.
If tested solely in the real world, a robot in training could fall down, bump into or mishandle objects - causing potential damage to its own machinery, the object it's interacting with or its surroundings. Testing in simulation provides the reinforcement learning model a space to practice and work out the kinks, giving it a head start when shifting to the real world.
In a typical system today, the NVIDIA PhysX simulation engine runs this experience-gathering phase of the reinforcement learning process on NVIDIA GPUs. But for other steps of the training application, developers have traditionally still used CPUs.
Traditional deep reinforcement learning uses a combination of CPU and GPU computing resources, requiring significant data transfers back and forth. A key part of reinforcement learning training is conducting what's known as the forward pass: First, the system simulates the environment, records a set of observations about the state of the world and calculates a reward for how well the agent did.
The recorded observations become the input to a deep learning policy network, which chooses an action for the agent to take. Both the observations and the rewards are stored for use later in the training cycle.
Finally, the action is sent back to the simulator so that the rest of the environment can be updated in response.
After several rounds of these forward passes, the reinforcement learning model takes a look back, evaluating whether the actions it chose were effective or not. This information is used to update the policy network, and the cycle begins again with the improved model.
GPU Acceleration with Isaac Gym To eliminate the overhead of transferring data back and forth from CPU to GPU during this reinforcement learning training cycle, NVIDIA researchers have developed an approach to run every step of the process on GPUs. This is Isaac Gym, an end-to-end training environment, which includes the PhysX simulation engine and a PyTorch tensor-based API.
Isaac Gym makes it possible for a developer to run tens of thousands of environments simultaneously on a single GPU. That means experiments that previously required a data center with thousands of CPU cores can in some cases be trained on a single workstation.
NVIDIA Isaac Gym runs entire reinforcement learning pipelines on GPUs, enabling significant speedups. Decreasing the amount of hardware required makes reinforcement learning more accessible to individual researchers who don't have access to large data center resources. It can also make the process a lot faster.
A simple reinforcement learning model tasked with getting a humanoid robot to walk can be trained in just a few minutes with Isaac Gym. But the impact of end-to-end GPU acceleration is most useful for more challenging tasks, like teaching a complex robot hand to manipulate a cube into a specific position.
This problem requires significant dexterity by the robot, and a simulation environment that involves domain randomization, a mechanism that allows the learned policy to more easily transfer to a real-world robot.
Research by OpenAI tackled this task with a cluster of more than 6,000 CPU cores plus multiple NVIDIA Tensor Core GPUs - and required about 30 hours of training for the reinforcement learning model to succeed at the task 20 times in a row using a feed-forward network model.
Using just one NVIDIA A100 GPU with Isaac Gym, NVIDIA developers were able to achieve the same level of success in around 10 hours - a single GPU outperforming
Most recent headlines
06/10/2025
France T l visions, France's leading broadcaster, has received the 2025 EBU ...
04/09/2025
Monumental Sports & Entertainment (MSE), in collaboration with Dalet, has been a...
01/07/2025
WASHINGTON The Federal Communications Commission's Enforcement and Media Bureaus have entered into a consent decree with Sinclair to resolve a variety of in...
01/07/2025
DENVER Low-power television (LPTV) station owners looking to navigate the complexities of selling their assets in todays dynamic media environment are invited t...
01/07/2025
NASA announced today that live programming from its NASA+ channel will be available on Netflix starting sometime this summer....
01/07/2025
WASHINGTON Federal Communications Commission Chair Brendan Carr has appointed Katie McAuliffe to serve as policy advisor in his office....
01/07/2025
MOUNTAIN VIEW, Calif. Alphabet's GFiber pay TV and broadband provider has announced that it recently worked with Nokia to demonstrate network slicing....
01/07/2025
NEW YORK, N.Y. DoubleVerify (DV) has announced the launch of DV Authentic Attention for Social. The product will first launch with Snap, the owner of Snapchat....
01/07/2025
WASHINGTON The Federal Communications Commission has rejected license challenges to three full-power Baltimore TV stations and agreed to renew the license for C...
01/07/2025
Compact new converter lets users capture live NDI and streaming sources into software over a USB interface
Video interface and IP workflow innovator Magewell ...
01/07/2025
Disguise, the award-winning tech company driving visuals for Broadway and West End hits including Redwood, Stranger Things: The First Shadow and Disney's Fr...
01/07/2025
Vocal-processing plug-in joins NOIZ Hub series
Launched in 2024, KIT Plugins' NOIZ Hub series was created with the aim of providing a range of professio...
01/07/2025
New self-paced learning programme announced
Mastering.com have announced the availability of a new online course designed to cover the fundamentals of maste...
01/07/2025
Historic appointment ushers in unified leadership for WRAL-TV, New Media, and Digital Solutions
RALEIGH, N.C. - 6-27-25 - Capitol Broadcasting Company is prou...
30/06/2025
There's nothing quite like the magic of finding music that feels made just f...
30/06/2025
When it comes to new music, Spotify's team of editors across North America is always on the hunt for songs that make them feel, think, and move. They're...
30/06/2025
SBS On Demand boosts global news offering with launch of France 24 FAST Channel
30 June, 2025
Media releases
SBS is expanding its international news offeri...
30/06/2025
Star Studded Ensemble Cast Are Joined by Richard Rankin as Filming Begins on the Second Season
[June 12, 2025 - Boston, MA]: The Forsytes, Debbie Horsfield...
30/06/2025
The Artemis II Space Launch System core stage is integrated with the solid rocket boosters inside High Bay 3 of the Vehicle Assembly Building at NASAs Kennedy S...
30/06/2025
RALEIGH, N.C. Capitol Broadcasting Co. has named Heather Gray vice president and general manager of WRAL-TV and WRAZ-TV here....
30/06/2025
The Virginia Association of Broadcasters has recognized Bill Sewell, Director of Engineering at WTKR & WGNT in Norfolk, Va. as the recipient of the 2025 J.J. Fr...
30/06/2025
The Society of Broadcast Engineers said its annual member drive resulted in the recruitment of 49 individual members....
30/06/2025
BURLINGTON, Mass. Avid today released its fully integrated news platform, uniting MediaCentral and Wolftech News in a single newsroom solution, and will demonst...
30/06/2025
WASHINGTON The Federal Communication's Enforcement and Media Bureaus have entered into a Consent Decree with Sinclair Broadcast Group to resolve a variety o...
30/06/2025
Eurorack sequencer module reimagined
California-based modular synth innovators Qu-Bit have announced the launch of a new module that offers a fresh new take...
30/06/2025
Berklee at Umbria Jazz Clinics to Host 40th Anniversary Concert The celebration will be held on July 10 in Perugia, Italy.
By
Colette Greenstein
June 30, 202...
30/06/2025
PremiumBeat Tips and Tricks
Brie Clayton June 30, 2025
0 Comments
When editing to impress, you'll need quality music, and if your studio happens t...
30/06/2025
Improved dynamic behaviour, improved audio quality & more
Techivation have announced the release of an upgraded edition of their very first premium plug-in,...
30/06/2025
German premiere with live flight demonstration: German industry team showcases e...
30/06/2025
Back to All News
Bel n Cuesta and Karra Elejalde Star in El ni o, the New Film ...
30/06/2025
Back to All News
A New Dangerous Troll Awakens: Netflix Unleashes Teaser for Troll 2Play Video
Play Video
Entertainment
30 June 2025
GlobalNorwayDenmarkSwe...
30/06/2025
The Focusrite Summer Sale is now on Don't miss unbeatable deals on Scarlett, Vocaster, and more.
Whether you're an artist, a producer, or a podcaste...
30/06/2025
All 8 episodes of Season 1 of 1923 will be available on RT Player from Tuesday ...
30/06/2025
Facebook
Twitter
LinkedIn
52% report AI security spending is displacing tr...
30/06/2025
Facebook
Twitter
LinkedIn
Cannes, June 30th, 2025 - Thales Alenia Space, t...
29/06/2025
Handpan-inspired instrument announced
Roland have announced the launch of the Mood Pan, a unique electronic hand percussion instrument that has been designe...
29/06/2025
Back to All News
A Secret Society, Ritualistic Killings, and a Century-Old Curs...
28/06/2025
Johannesburg, 27 June 2025 - As the nation commemorates Youth Month
2025, the N...
28/06/2025
WASHINGTON In a press conference following the Federal Communications Commission's May Open Meeting, Chair Brendan Carr promised the agency would move rapid...
28/06/2025
STAMFORD, Conn. Charter Communications has awarded $1.1 million in Spectrum Digital Education grants to 55 nonprofit organizations that work to expand access to...
28/06/2025
LAKE FOREST, Calif. June 19, 2025
What's New:
Sonnet Technologies today announced the certification of its Echo 20 Thunderbolt 4 SuperDock as an Engin...
28/06/2025
MASV (massive.io), the fastest and most reliable large file transfer platform for media professionals, has been named an IDC Innovator in the IDC Innovators: Me...
28/06/2025
Grass Valley today announced that TV SKYLINE GmbH, one of Europe's top mobile production providers, has expanded its camera inventory with 30 LDX 135 UHD/HD...
28/06/2025
AgileTV, a European leader in TV and video technology solutions, signed an agreement with Austrian telco LIWEST to develop and implement its TV service in Austr...
28/06/2025
Music theory plug-in updated
Three months on from the release of the latest version of their renowned music theory plug in, Scaler Music have launched an up...
28/06/2025
The 48th Annual Indian National Finals Rodeo Shot with Blackmagic PYXIS 6K
Brie Clayton June 27, 2025
0 Comments
Filmmaker Cameron Mackey relied on Bl...
28/06/2025
Social, Streaming Don't Compete, They Compliment
Andy Marken June 27, 2025
0 Comments
I think we've all arrived at a very special place. Spir...
28/06/2025
Blackmagic Design Captures Filipino Rock Band Drama Singtala
Brie Clayton June 27, 2025
0 Comments
Blackmagic URSA Mini Pro 12K and DaVinci Resolve St...
28/06/2025
Enhance Videos Faster with Aiarty Video Enhancer - Offline, Sharp, and Natural
Brie Clayton June 27, 2025
0 Comments
If you've used AI video tools...
27/06/2025
By Jessica Herndon
One of the most exciting things about the Sundance Film Fest...