Sony Pixel Power calrec Sony

How GPUs Can Democratize Deep Reinforcement Learning for Robotics Development

11/12/2020

It can take a puppy weeks to learn that certain kinds of behaviors will result in a yummy treat, extra cuddles or a belly rub - and that other behaviors won't. With a system of positive reinforcement, a pet pooch will in time anticipate that chasing squirrels is less likely to be rewarded than staying by their human's side.

Deep reinforcement learning, a technique used to train AI models for robotics and complex strategy problems, works off the same principle.

In reinforcement learning, a software agent interacts with a real or virtual environment, relying on feedback from rewards to learn the best way to achieve its goal. Like the brain of a puppy in training, a reinforcement learning model uses information it's observed about the environment and its rewards, and determines which action the agent should take next.

To date, most researchers have relied on a combination of CPUs and GPUs to run reinforcement learning models. This means different parts of the computer tackle different steps of the process - including simulating the environment, calculating rewards, choosing what action to take next, actually taking action, and then learning from the experience.

But switching back and forth between CPU cores and powerful GPUs is by nature inefficient, requiring data to be transferred from one part of the system's memory to another at multiple points during the reinforcement learning training process. It's like a student who has to carry a tall stack of books and notes from classroom to classroom, plus the library, before grasping a new concept.

With Isaac Gym, NVIDIA developers have made it possible to instead run the entire reinforcement learning pipeline on GPUs - enabling significant speedups and reducing the hardware resources needed to develop these models.

Here's what this breakthrough means for the deep reinforcement learning process, and how much acceleration it can bring developers.

Reinforcement Learning on GPUs: Simulation to Action When training a reinforcement learning model for a robotics task - like a humanoid robot that walks up and down stairs - it's much faster, safer and easier to use a simulated environment than the physical world. In a simulation, developers can create a sea of virtual robots that can quickly rack up thousands of hours of experience at a task.

If tested solely in the real world, a robot in training could fall down, bump into or mishandle objects - causing potential damage to its own machinery, the object it's interacting with or its surroundings. Testing in simulation provides the reinforcement learning model a space to practice and work out the kinks, giving it a head start when shifting to the real world.

In a typical system today, the NVIDIA PhysX simulation engine runs this experience-gathering phase of the reinforcement learning process on NVIDIA GPUs. But for other steps of the training application, developers have traditionally still used CPUs.

Traditional deep reinforcement learning uses a combination of CPU and GPU computing resources, requiring significant data transfers back and forth. A key part of reinforcement learning training is conducting what's known as the forward pass: First, the system simulates the environment, records a set of observations about the state of the world and calculates a reward for how well the agent did.

The recorded observations become the input to a deep learning policy network, which chooses an action for the agent to take. Both the observations and the rewards are stored for use later in the training cycle.

Finally, the action is sent back to the simulator so that the rest of the environment can be updated in response.

After several rounds of these forward passes, the reinforcement learning model takes a look back, evaluating whether the actions it chose were effective or not. This information is used to update the policy network, and the cycle begins again with the improved model.

GPU Acceleration with Isaac Gym To eliminate the overhead of transferring data back and forth from CPU to GPU during this reinforcement learning training cycle, NVIDIA researchers have developed an approach to run every step of the process on GPUs. This is Isaac Gym, an end-to-end training environment, which includes the PhysX simulation engine and a PyTorch tensor-based API.

Isaac Gym makes it possible for a developer to run tens of thousands of environments simultaneously on a single GPU. That means experiments that previously required a data center with thousands of CPU cores can in some cases be trained on a single workstation.

NVIDIA Isaac Gym runs entire reinforcement learning pipelines on GPUs, enabling significant speedups. Decreasing the amount of hardware required makes reinforcement learning more accessible to individual researchers who don't have access to large data center resources. It can also make the process a lot faster.

A simple reinforcement learning model tasked with getting a humanoid robot to walk can be trained in just a few minutes with Isaac Gym. But the impact of end-to-end GPU acceleration is most useful for more challenging tasks, like teaching a complex robot hand to manipulate a cube into a specific position.

This problem requires significant dexterity by the robot, and a simulation environment that involves domain randomization, a mechanism that allows the learned policy to more easily transfer to a real-world robot.

Research by OpenAI tackled this task with a cluster of more than 6,000 CPU cores plus multiple NVIDIA Tensor Core GPUs - and required about 30 hours of training for the reinforcement learning model to succeed at the task 20 times in a row using a feed-forward network model.

Using just one NVIDIA A100 GPU with Isaac Gym, NVIDIA developers were able to achieve the same level of success in around 10 hours - a single GPU outperforming
LINK: https://blogs.nvidia.com/blog/2020/12/10/deep-reinforcement-learning-g...
See more stories from nvidia

Most recent headlines

04/08/2024

Dalet Appoints Santiago Solanas as CEO to Lead Next Era of Growth and Innovation

Dalet, a leading technology and service provider for media-rich organizations, is excited to announce Santiago Solanas as its new Chief Executive Officer (CEO)....

03/06/2024

Dalet and Veritone Reach Agreement to Distribute, Transact and Monetize Media Archives

Dalet, a leading technology and service provider for media-rich organizations, a...

20/05/2024

Meet the VFX producer

Angus Berryman, VFX producer at UNIT studio tells TVBEurope how perseverance paid off as his career progressed By Matthew Corrigan Published: May 20, 2024 ...

20/05/2024

Sony signs NDA with Paramount Global but is reticent about earlier plan

The New York Times said the move was a significant step forward in their effort to court Paramount By Matthew Corrigan Published: May 20, 2024 The New Yor...

20/05/2024

CBC HR Team Getting Ballpark Staff Ready to Play Ball

A pair of stalwart members of the Capitol Broadcasting's dream team have been getting the company's two Coastal Plain League teams ready to launch their...

20/05/2024

First European Election Major TV Debate this Monday Night on Upfront with Katie Hannon

As part of RT 's extensive coverage in the run up to the European Parliament...

19/05/2024

TV Techs Weekly Product Wrap-Up

Missed any of our product coverage during your busy week? The TV Tech weekly product and services news wrap-up provides links to all of our coverage from May 13...

19/05/2024

The CW Shares Its 2024-2025 Lineup

LeVar Burton will host the new game show Trivial Pursuit on The CW, while Raven-Symon will host Scrabble....

19/05/2024

Biden, Trump Agree to Debates on CNN, ABC

President Joe Biden and former President Donald Trump have agreed to debates, set for June 27 on CNN and September 10 on ABC....

19/05/2024

End of the Line for Young Sheldon' on May 16

Young Sheldon signs off after seven seasons on CBS, when the series finale airs Thursday, May 18. Jim Parsons and Mayim Bialik reprise their roles as Sheldon Co...

19/05/2024

House of Zwide' June Spoiler: The Zwides recover their millions

House of Zwide' June Spoiler: The Zwides recover their millions The House of Zwide' June teasers reveal that Funani and Nkosi will recover a share of t...

18/05/2024

If Bundling Is Back, What's the Ideal Bundle?

PORTSMOUTH, N.H. Bundling is back in a big way, with all the major streaming companies and many pay TV operators exploring ways to simplify the consumer experie...

18/05/2024

FCC to Vote on LPTV Rules during June Open Meeting

WASHINGTON, D.C. Federal Communications Commission Chairwoman Jessica Rosenworcel has announced a tentative agenda for the June Open Commission Meeting schedule...

18/05/2024

Matthews Launches New Multipurpose Grip Rail Telescopic Grid Pipe Solution

Matthews Studio Equipment has introduced Grip Rail, which the company said offers a better way to mount equipment on location, in the studio, or on the fly....

18/05/2024

IAB Tech Labs, Google Partner on New First Party Data Solution

In a notable development in the industry-wide effort to address privacy concerns while improving efficacy of marketing efforts in a cookieless ad landscape, IAB...

18/05/2024

TV Tech Weekly Product Wrap-Up

Missed any of our product coverage during your busy week? The TV Tech weekly product and services news wrap-up provides links to all of our coverage from May 13...

18/05/2024

DHD Elevates the Art of Podcast Production

DHD Elevates the Art of Podcast Production Brie Clayton May 17, 2024 0 Comments Hero image: the DHD DX2 base and expansion modules Latest-generation ...

18/05/2024

Skeem Saam: Friday's episode, 17 May 2024 [video]

Skeem Saam: Friday's episode, 17 May 2024 [video]Missed an episode of Skeem Saam? No problem! Watch the latest episode of your favourite South African soapi...

18/05/2024

Shaka Ilembe' approved for season three

Shaka Ilembe' approved for season threeMzansi Magic's hit series Shaka Ilembe' which stars Nomzama Mbatha has been commissioned for a third season....

17/05/2024

Aerojet Rocketdyne's Camden Site Leverages Modernization Investments to Accelerate Solid Rocket Motor Production

Aerojet Rocketdyne has worked to modernize facilities at its Camden, Arkansas, l...

17/05/2024

FCC Plans to Revise LPTV Rules

The FCC has issued a Notice of Proposed Rulemaking (NPRM) that would revise rules governing low power TV stations (LPTV) in a number of areas, including online ...

17/05/2024

Demystifying Post-Production: Introducing Cinema 4D Particles Week 4

Demystifying Post-Production: Introducing Cinema 4D Particles Week 4 Brie Clayton May 17, 2024 0 Comments With the spring release of Maxon One, we&#...

17/05/2024

Takashi Yamazaki Film Godzilla Minus One Graded with DaVinci Resolve Studio

Takashi Yamazaki Film Godzilla Minus One Graded with DaVinci Resolve Studio Brie Clayton May 17, 2024 0 Comments Hero image credit: 2023 TOHO CO., LT...

17/05/2024

Sterling Event Group Streamlines Live Event Productions with AJA

Sterling Event Group Streamlines Live Event Productions with AJA Brie Clayton May 17, 2024 0 Comments Live event productions only happen once, which ...

17/05/2024

Meet the product manager

Muster Ngobi, product manager at LYNX Technik tells TVBEurope how the ever-evolving media industry provides a truly dynamic working environment By Matthew Corr...

17/05/2024

TV, Streaming Schedule for 2024 NFL Regular Season Is Released

NEW YORK As declines in linear TV viewing make the ongoing popularity of live sports, particularly football, central to financial success of the TV industry, th...

17/05/2024

Netflix Ad Tier Hits 40M Monthly Active Users

During Netflixs second Upfront presentation to advertisers, Amy Reinhard, Netflix's president of advertising, walked advertisers through the continued growt...

17/05/2024

Scripps Promotes Jeff Kiernan to VP, Local News

CINCINNATI The E.W. Scripps Company has added to its leadership team for news by promoting Jeff Kiernan a veteran journalist and general manager of Scripps'...

17/05/2024

Survey: New Disney-Fox-WBD Sports Streamer May Hurt Pay TV Sub Counts

Top executives from Disney, Fox and Warner Bros. Discovery have consistently insisted that their joint venture to launch the Venu Sports streaming bundle in the...

17/05/2024

Caitlin Clark's WNBA Debut Set Viewing Records

ESPN has announced that its coverage of Caitlin Clark's WNBA debut in the Indiana Fever versus the Connecticut Sun season opener was the most-watched WNBA g...

17/05/2024

ATEM Mini Extreme ISO switcher and Blackmagic Pocket Cinema Camera 4K

ATEM Mini Extreme ISO switcher and Blackmagic Pocket Cinema Camera 4K Brie Clayton May 16, 2024 0 Comments Blackmagic Design announced today that Yoic...

17/05/2024

Pixomondo's Virtual Production Academy Expands with Programs at Sony PCL, Vook, and Vancouver Film School

Pixomondo's Virtual Production Academy Expands with Programs at Sony PCL, Vo...

17/05/2024

WBD Upfront Show Offers Peeks at House of the Dragon,' White Lotus,' Biden-Trump Debate

The Warner Bros. Discovery upfront presentation took place Wednesday, May 15 at ...

17/05/2024

The Black Keys, Jelly Roll, Kate Hudson Set To Perform on The Voice' Finale

Season 25 of The Voice wraps on NBC Tuesday, May 21, with performances from The Black Keys, Jelly Roll, Kate Hudson, Lainey Wilson, Muni Long, Thomas Rhett and ...

17/05/2024

CNN Boss Mark Thompson's Plan Includes More News in More Categories on More Devices (Upfronts)

New CNN CEO Mark Thompson spelled out his plan for the struggling news network d...

17/05/2024

Netflix To Launch In-House Advertising Tech Platform

Netflix, a newcomer to the advertising business, said it plans to launch an in-house advertising technology platform....

17/05/2024

Netflix Plots TV Takeover at Upfront Presentation

Netflix shared some programming projects at an upfront presentation in New York. Those include the basketball-themed comedy series Running Point, a Mindy Kaling...

17/05/2024

Plex Geek Week Sale Offers 20% Off Plex Lifetime Pass

Plex is offering movie and music collectors a 20% discount off its Lifetime Plex Pass as part of its Geek Week sale....

17/05/2024

GroupM Names Toby Jenner as President, GroupM Clients

Giant media buyer GroupM said it named Toby Jenner as global president, Group M Clients, a new position at the company....

17/05/2024

Clients of Independent Agencies Boost Programmatic Buying

Smaller advertisers are increasingly buying connected TV programmatically, according to a new report from FreeWheel, Comcast's ad-tech unit....

17/05/2024

TCLtvPlus Adds Streaming Music Channels From Vevo

TCLtvPlus, the streaming app on smart TVs made by TCL, has added live linear channel from music-video programmer Vevo....

17/05/2024

StackAdapt Adopts Data From Samba TV for Programmatic Campaigns

StackAdapt said it made a deal to integrate data from Samba TV into its programmatic advertising platform....

17/05/2024

Tonight on Skeem Saam: Lehasa gets a rude awakening when Kgosi blackmails him

Tonight on Skeem Saam: Lehasa gets a rude awakening when Kgosi blackmails himDon't miss Friday, 17 May's riveting episode of South African soapie Skeem ...

17/05/2024

Tonight on House of Zwide: Dorothy is blown away by Ona's sketches for her wedding dress

Tonight on House of Zwide: Dorothy is blown away by Ona's sketches for her w...

17/05/2024

Tonight on Scandal: Dintle has a visit from her past that leaves her very unsettled

Tonight on Scandal: Dintle has a visit from her past that leaves her very unsett...

17/05/2024

Save Time and Money with WO Traffic v24.0

WO Traffic provides a solid foundation from which stations can manage, execute, and scale end-to-end ad trafficking and sales, both today and into the future. W...

17/05/2024

Broadcast Innovation in India: How AI and Automated Production Helps Smaller Sports Grow

Broadcast Innovation in India: How AI and Automated Production Helps Smaller Spo...

17/05/2024

SVG Sports Cloud Production Forum Gives Refresher Course on Cloud-Based Tools, Ecosystem

SVG Sports Cloud Production Forum Gives Refresher Course on Cloud-Based Tools, E...

17/05/2024

WNBA Tip-Off 2024: Scripps Sports Constructs New Studio for Second Season of WNBA Friday Night Spotlight on ION

WNBA Tip-Off 2024: Scripps Sports Constructs New Studio for Second Season of WNB...

17/05/2024

SVG College Summit 2024: Auburn's War Eagle Productions Breaks Down How They Produce Live Gymnastics Broadcasts

SVG College Summit 2024: Auburn's War Eagle Productions Breaks Down How They...