Sony Pixel Power calrec Sony

Reinforcement Learning Really Works' for AI Against Pro Gamers, OpenAI Trailblazer Says

18/09/2018

Fast, creative, smart - great gamers are all these things. Somebody has to teach machines how to keep up. That somebody is Ilya Sutskever and his team at OpenAI

Sutskever, co-founder and research director of OpenAI, and his team at Open AI are developing AI bots smart enough to battle some of the world's best human gamers.

In August, OpenAI Five, a team of five neural networks, were defeated by some of the world's top professional players of Dota 2, the wildly popular multiplayer online battle arena game.

It was a leap for OpenAI Five to even be playing a nearly unrestricted version of Dota 2 at a professional level, which took place at Valve's International competition in Vancouver - a world series of esports played for tens of millions of dollars.

That's because Dota 2 is an extremely complex game. Players can unleash an enormous number of tactics, strategies and interactions in the quest to win. The game layout - only partially observable - requires both short-term tactics and long-term strategy, as each match can last 45 minutes. Professional players dedicate their lives to this game, said Sutskever. It's not an easy game to play.

Sutskever spoke Thursday at NTECH, an annual engineering conference at NVIDIA's Silicon Valley campus. The internal event drew an enthusiastic crowd of several hundred engineers - many also huge gaming fans - and hundreds more online.

Dota 2 Raises AI-Gaming Bar OpenAI Five's Dota 2 work marks an entirely new level for human-versus-AI challenges. For comparison, in chess and Go - also popular AI challenges - the average number of actions is 35 and 250, respectively. In Dota 2, which has really complex rules, there are about 170,000 actions per move and there are 20,000 moves per game.

With all of Dota 2's complexity, it's closer to the real world than any other previous game tackled by an AI, he said. So, how did we do it? We used large scale RL (reinforcement learning), Sutskever told the audience.

Reinforcement learning matters for humans and machines alike. When we earn a bonus point in a game with a move or get blown to bits with another, each of these moments provide reinforcement learning - burned in memory - for the next go-round.

Reinforcement learning matters to AI because it is a very natural way of training neural networks to act in order to achieve goals, which is essential for building an intelligent system.

OpenAI Five has seen spectacular results because it used a reliable reinforcement learning algorithm (Proximal Policy Optimization) at massive scale, running on more than 1,000 NVIDIA Tesla P100 GPUs in Google Cloud Platform.

NVIDIA has been there as an early supporter, with CEO Jensen Huang personally delivering the first DGX-1 AI supercomputer in a box for the folks at OpenAI.

History of GPU Challenges Sutskever is no stranger at unleashing GPUs on AI's biggest challenges. He was among the trio of University of Toronto researchers - including Alex Krizhevsky and advisor Geoffrey Hinton - who pioneered a GPU-based convolutional neural network to take the prestigious ImageNet competition by storm.

The results - nearly slashing in half the error rate - go down in history as the moment that spawned the modern AI boom.

The resulting model - dubbed AlexNet - is the basis of countless deep learning models. At GTC 2018, Huang spoke of AlexNet's influence on thousands of AI strains, stating: Neural networks are growing and evolving at an extraordinary rate.

Sutskever says leaps in AI track closely to processing gains. It's pretty remarkable that the amount of compute from the original AlexNet to AlphaGo Zero is 300,000x. You're talking about a five-year gap. Those are big increases.

OpenAI's Moonshot' Ambitions OpenAI is a nonprofit that was formed in 2015 to develop and release artificial general intelligence aimed at benefiting humanity. Its founding members include Tesla CEO Elon Musk, Y Combinator President Sam Altman and other tech luminaries who have collectively committed $1 billion to its mission.

Researchers at OpenAI are also making strides on a project called Dactyl, which aims to increase the dexterity of a robot hand. The team there has been working on domain randomization - an old concept - with remarkable results. They have been able to train the robot hand to manipulate objects in simulation, and then transfer that knowledge to real-world manipulation. This is important, because simulation is the only way to get enough training experience for these robots. The idea works really, really well, Sutskever said.

Sutskever is keen on pushing common AI concepts such as reinforcement learning and domain randomization to new heights. In the wide-ranging discussion at NTECH, he praised the conclusions of Arthur C. Clarke's book Profiles of the Future, which said historically, doubts were cast on great inventions such as the airplane and space travel.

Skepticism, he said, initially led the U.S. to pass on building and sending a 200-ton rocket to space - on the grounds that it's too large to be built. So the Russians went on and built a 200-ton rocket, he quipped, drawing audience laughter.
LINK: https://blogs.nvidia.com/blog/2018/09/14/reinforcement-learning-openai...
See more stories from nvidia

Most recent headlines

09/11/2025

Dalet Unveils Agentic AI Media Workflows at IBC2025

Dalet today announced a transformative leap forward for media operations: Agentic Artificial Intelligence (AI) that unifies the Dalet ecosystem under one natura...

01/11/2025

ITN, Magnite Launch New Private Marketplace for Local Linear TV

NEW YORK ITN and the sell-side advertising company Magnite have announced the launch of what they are billing as the industrys first Local Linear TV Private Mar...

31/10/2025

FanDuel Sports Network To Deliver Selected Live NBA, NHL Games to Major Streaming Services for In-Market Viewers

FanDuel Sports Network To Deliver Selected Live NBA, NHL Games to Major Streamin...

31/10/2025

NBC Jumps Out of the Gate in Extended Breeder's Cup Deal With Dual Drones, Jockey Cams, RF Super-Mo

NBC Jumps Out of the Gate in Extended Breeder's Cup Deal With Dual Drones, J...

31/10/2025

Tribute: Remembering Segomotso Keorapetse (28 May 1968 22 October 2025)

FOR IMMEDIATE RELEASE 30 October 2025 It is with great sadness that we mourn the passing of Segomotso Keorapetse, an award- winning South African television d...

31/10/2025

Nexstar Extends Chairman and CEO Perry Sook Through 2029

IRVING, Texas As station groups move into an era that promises rapid tech, regulatory and economic changes, Nexstar Media Group said its board has extended chai...

31/10/2025

Late Night Thrives on Social Media With Billions of Views in 2025

While some analysts have questioned the ongoing economic viability of broacast-TV late night shows amid ongoing declines in linear viewing, new data from Tubula...

31/10/2025

Disney Programming Dropped From YouTube TV

The contentious contract negotiations between The Walt Disney Co. and YouTube TV have resulted in a blackout of Disney-owned programming on the pay TV operator....

31/10/2025

tvONE Integrates CALICO PRO Video Processing With Matrox ConvertIP Series

CINCINNATI Video conversion and AV signal distribution specialist tvONE and Matrox Video have struck a strategic partnership, combining CALICO PRO's video p...

31/10/2025

IAB Urges Standards for CTV Ad Measurement

NEW YORK The Interactive Advertising Bureau (IAB) today released a new industry guide that discusses the urgency of adopting new standards that will help advert...

31/10/2025

Late Night Shows Thrive on Social Media with Billions of Views in 2025

While some analysts have questioned the ongoing economic viability of late night shows on broadcast TV amid ongoing declines in linear viewing, new data from Tu...

31/10/2025

Berklee Celebrates the Inauguration of President Jim Lucchese

Berklee Celebrates the Inauguration of President Jim Lucchese In his inaugural address, Lucchese shared an optimistic vision for Berklee's future as a for...

31/10/2025

Family, Food, and Films: Netflix's 'Dining with the Kapoors' Arrives November 21

Back to All News Family, Food, and Films: Netflix's Dining with the Kapoors...

31/10/2025

DPA 4055 Featured in Technologies for Worship Magazine

The review highlights DPA 4055 Kick Drum Microphone for its compact design, ease of placement, and authentic tone that captures the true character of the drum p...

31/10/2025

RT Raidi na Gaeltachta Award 2025 to be presented to Piln N Chiarin

The RT Raidi na Gaeltachta Award 2025 will be presented to journalist P il n N Chiar in at the Oireachtas na Samhna in Belfast tomorrow, Saturday 1 November,...

31/10/2025

Share the magic: RT lyric fm Choirs for Christmas Competition 2025 open for submissions

RT lyric fm is calling for choirs across Ireland to share their festive music-m...

31/10/2025

Dnall Mac Ruair, Cuan Seireadin and Ts ite among the winners at the Oireachtas Communications Awards 2025

Three awards were presented to RT Raidi na Gaeltachta broadcasters at the Oire...

31/10/2025

RT is Supporting 29 Arts and Cultural Events across Ireland this November

RT continues its proud tradition of championing Ireland's vibrant arts and cultural landscape through its RT Supporting the Arts initiative. This November...

31/10/2025

RT selects Irish independent production company to produce Christian Worship on RT One and RT Player

RT selects Irish independent production company to produce Christian Worship on...

31/10/2025

Korea Joins AI Industrial Revolution: NVIDIA CEO Jensen Huang Unveils Historic Partnership at APEC Summit

Amidst Gyeongju, South Korea's ancient temples and modern skylines, Jensen H...

30/10/2025

Midwich Secures UK & Ireland Distribution Deal with X2O Media To Revolutionize Hybrid Learning

Midwich has signed a UK and Ireland distribution deal with X2O Media, a worldwid...

30/10/2025

SVG Students To Watch: Sam Newitt, Kansas State University

SVG Students To Watch: Sam Newitt, Kansas State UniversityThe South Dakota native thrives in many roles behind the scenes at K-StateHD.TVBy Brandon Costa, Direc...

30/10/2025

SVG Sit-Down: Swerve Sports' Christy Tanner Explores the Young FAST Channel's Early Success

SVG Sit-Down: Swerve Sports' Christy Tanner Explores the Young FAST Channel&...

30/10/2025

SVG Campus Shot Callers: Andy Liebsch, Senior Director, Video Services, Kansas State University

SVG Campus Shot Callers: Andy Liebsch, Senior Director, Video Services, Kansas S...

30/10/2025

Diversified Names Paul Lidsky CEO, Expanding Leadership Role After Serving as Board Chairman

Diversified Names Paul Lidsky CEO, Expanding Leadership Role After Serving as Bo...

30/10/2025

NBA, Cosm Enter Long-Term Partnership for Shared Reality Production, Distribution

NBA, Cosm Enter Long-Term Partnership for Shared Reality Production, Distributio...

30/10/2025

FanDuel Sports Network to Deliver Select Live NBA, NHL Games to Major Streaming Services for In-Market Viewers

FanDuel Sports Network to Deliver Select Live NBA, NHL Games to Major Streaming ...

30/10/2025

If I Had Legs, I'd Kick You, East of Wall, and More Sundance Institute-Supported Films Nominated for 35th Gotham Awards

As the year comes to a close, we can feel the invigorating wind sweeping in for ...

30/10/2025

Give Me the Backstory: Get to Know Max Walker-Silverman, the Writer-Director of Rebuilding

By Bailey Pennick One of the most exciting things about the Sundance Film Festi...

30/10/2025

Excellent training at SGL Carbon's Bonn site

The SGL Carbon site in Bonn has a long tradition of training. For many years, young talent has been successfully trained here, regularly achieving excellent exa...

30/10/2025

SBS, NITV and Screen Australia announce 2025 Digital Originals Shortlist

SBS, NITV and Screen Australia announce 2025 Digital Originals Shortlist 29 October, 2025 Media releases SBS, NITV and Screen Australia are excited to unve...

30/10/2025

Remarks for the 2025 APEC CEO Roundtable

Jon Rambeau, President of Integrated Mission Systems at L3Harris Technologies, speaks about industrial collaboration at the Asia-Pacific Economic Cooperation (A...

30/10/2025

L3Harris Technologies Reports Strong Third Quarter 2025 Results, Increases 2025 Guidance

MELBOURNE, Fla., October 30, 2025 - L3Harris Technologies (NYSE: LHX) reports th...

30/10/2025

FCC's Brendan Carr Issues Draft Proposal for More C-Band Spectrum Sales

WASHINGTON Federal Communications Commission Chair Brendan Carr said he has circulated a proposal for the agency to auction additional midband spectrum in the U...

30/10/2025

Diversified Names Paul Lidsky as CEO

PLANO, Texas Technology solutions provider Diversified has named Paul Lidsky as CEO, tasked with guiding the company's next stage of growth, driving market ...

30/10/2025

Interra Adds Stream Recording, BATON Integration to ORION

CUPERTINO, Calif. Interra Systems today unveiled ORION stream recording support and seamless integration with BATON Media Player, a combination that lets broadc...

30/10/2025

InterDigital Buys AI-Driven Video Codec Startup Deep Render

WILMINGTON, Del. InterDigital today announced the acquisition of Deep Render, an artificial intelligence startup with a team of AI experts focused on video code...

30/10/2025

TAG Video Systems Earns Two ESG Recognitions

NEW YORK TAG Video Systems has earned a higher-rated Digital Product Passport (DPP) Committed to Sustainability badge and the Aclymate Climate Wise Silver Tier ...

30/10/2025

Nexstar Extends Employment Agreement with Perry Sook Through 2029

IRVING, Texas As station groups move into an era that promises rapid tech, regulatory and economic changes, the Nexstar Media Group, Inc. has announced that its...

30/10/2025

Samba TV: 60% Of TV Time Spent Viewing Streaming Content

Television viewers are spending more time watching streaming content than linear TV, but sports continues to be a bright spot for broadcasters, according to the...

30/10/2025

Operative Media Names Mike Napadano as CEO

NEW YORK Advertising technology company Operative Media has named Mike Napadano as its new CEO....

30/10/2025

Walmart Selects Marshall Cameras to Power New Campus Broa...

Walmart Inc. has chosen Marshall Electronics cameras for use across its brand-new corporate campus studios and event center. The installation includes Marshall ...

30/10/2025

NETGEAR Academy Expands Into Industry-Wide IP Training Pl...

NETGEAR, Inc. (NASDAQ: NTGR), a global leader in intelligent networking solutions designed to power extraordinary experiences, today announced the launch of its...

30/10/2025

Clear-Com Gen-IC Virtual Intercom Connects Students World...

Clear-Com recently contributed its award-winning Gen-IC virtual intercom solution to power real-time communications for On-Air Student TV, a 24-hour global st...

30/10/2025

Maxon Strengthens Growth Strategy with Appointment of Kse...

Maxon, maker of powerful, approachable software solutions for creators working in 2D and 3D design, motion graphics, visual effects, and more, today announced t...

30/10/2025

Studio Technologies Dante Enabled Model 394 GPI Interface...

Studio Technologies, a leading manufacturer of high-quality audio, video, and fiber-optic solutions, announces that its new Model 394 GPI Interface and Model 39...

30/10/2025

Astro selects Broadpeak for high performance streaming an...

Broadpeak , a leader in streaming and monetization at scale, has been selected by leading Malaysian content and entertainment company Astro to enable two major ...

30/10/2025

Riedel Communications Appoints Ulrich Voigt as Director L...

Riedel Communications is pleased to announce that Ulrich Voigt has joined the company as Director Live Production Solutions, taking over the SimplyLive business...

30/10/2025

LiveU and Kinetiq Launch Cloud Native Watermarking Integr...

LiveU, the global leader in live IP-video contribution, production, and distribution, today announced a new partnership with Kinetiq, the AI-powered platform un...