Reinforcement Learning Really Works' for AI Against Pro Gamers, OpenAI Trailblazer Says
18/09/2018
Sutskever, co-founder and research director of OpenAI, and his team at Open AI are developing AI bots smart enough to battle some of the world's best human gamers.
In August, OpenAI Five, a team of five neural networks, were defeated by some of the world's top professional players of Dota 2, the wildly popular multiplayer online battle arena game.
It was a leap for OpenAI Five to even be playing a nearly unrestricted version of Dota 2 at a professional level, which took place at Valve's International competition in Vancouver - a world series of esports played for tens of millions of dollars.
That's because Dota 2 is an extremely complex game. Players can unleash an enormous number of tactics, strategies and interactions in the quest to win. The game layout - only partially observable - requires both short-term tactics and long-term strategy, as each match can last 45 minutes. Professional players dedicate their lives to this game, said Sutskever. It's not an easy game to play.
Sutskever spoke Thursday at NTECH, an annual engineering conference at NVIDIA's Silicon Valley campus. The internal event drew an enthusiastic crowd of several hundred engineers - many also huge gaming fans - and hundreds more online.
Dota 2 Raises AI-Gaming Bar OpenAI Five's Dota 2 work marks an entirely new level for human-versus-AI challenges. For comparison, in chess and Go - also popular AI challenges - the average number of actions is 35 and 250, respectively. In Dota 2, which has really complex rules, there are about 170,000 actions per move and there are 20,000 moves per game.
With all of Dota 2's complexity, it's closer to the real world than any other previous game tackled by an AI, he said. So, how did we do it? We used large scale RL (reinforcement learning), Sutskever told the audience.
Reinforcement learning matters for humans and machines alike. When we earn a bonus point in a game with a move or get blown to bits with another, each of these moments provide reinforcement learning - burned in memory - for the next go-round.
Reinforcement learning matters to AI because it is a very natural way of training neural networks to act in order to achieve goals, which is essential for building an intelligent system.
OpenAI Five has seen spectacular results because it used a reliable reinforcement learning algorithm (Proximal Policy Optimization) at massive scale, running on more than 1,000 NVIDIA Tesla P100 GPUs in Google Cloud Platform.
NVIDIA has been there as an early supporter, with CEO Jensen Huang personally delivering the first DGX-1 AI supercomputer in a box for the folks at OpenAI.
History of GPU Challenges Sutskever is no stranger at unleashing GPUs on AI's biggest challenges. He was among the trio of University of Toronto researchers - including Alex Krizhevsky and advisor Geoffrey Hinton - who pioneered a GPU-based convolutional neural network to take the prestigious ImageNet competition by storm.
The results - nearly slashing in half the error rate - go down in history as the moment that spawned the modern AI boom.
The resulting model - dubbed AlexNet - is the basis of countless deep learning models. At GTC 2018, Huang spoke of AlexNet's influence on thousands of AI strains, stating: Neural networks are growing and evolving at an extraordinary rate.
Sutskever says leaps in AI track closely to processing gains. It's pretty remarkable that the amount of compute from the original AlexNet to AlphaGo Zero is 300,000x. You're talking about a five-year gap. Those are big increases.
OpenAI's Moonshot' Ambitions OpenAI is a nonprofit that was formed in 2015 to develop and release artificial general intelligence aimed at benefiting humanity. Its founding members include Tesla CEO Elon Musk, Y Combinator President Sam Altman and other tech luminaries who have collectively committed $1 billion to its mission.
Researchers at OpenAI are also making strides on a project called Dactyl, which aims to increase the dexterity of a robot hand. The team there has been working on domain randomization - an old concept - with remarkable results. They have been able to train the robot hand to manipulate objects in simulation, and then transfer that knowledge to real-world manipulation. This is important, because simulation is the only way to get enough training experience for these robots. The idea works really, really well, Sutskever said.
Sutskever is keen on pushing common AI concepts such as reinforcement learning and domain randomization to new heights. In the wide-ranging discussion at NTECH, he praised the conclusions of Arthur C. Clarke's book Profiles of the Future, which said historically, doubts were cast on great inventions such as the airplane and space travel.
Skepticism, he said, initially led the U.S. to pass on building and sending a 200-ton rocket to space - on the grounds that it's too large to be built. So the Russians went on and built a 200-ton rocket, he quipped, drawing audience laughter.
LINK: | https://blogs.nvidia.com/blog/2018/09/14/reinforcement-learning-openai... |
See more stories from nvidia |
Most recent headlines
04/08/2024
Dalet Appoints Santiago Solanas as CEO to Lead Next Era of Growth and Innovation
Dalet, a leading technology and service provider for media-rich organizations, is excited to announce Santiago Solanas as its new Chief Executive Officer (CEO)....
03/06/2024
Dalet and Veritone Reach Agreement to Distribute, Transact and Monetize Media Archives
Dalet, a leading technology and service provider for media-rich organizations, a...
03/05/2024
Sudan: New WhatsApp Course Equips Journalists to Report on Conflict Related Sexual Violence
An innovative learning tool to help media and civil society better understand ho...
03/05/2024
Slow Explores an Unusual Relationship With Sincere Romance
PARK CITY, UTAH - JANUARY 21: (L-R) Actors K stutis Cic nas and Greta Grinevi i t , Director Marija Kavtaradze and Producer Marija Razgute attend the 2023 Sunda...
03/05/2024
Be Honest With Your Art': Danny Ocean Reflects on Sudden Stardom and His Deeply Personal New Album
If you aren't familiar with Danny Ocean's music, it's only a matter ...
03/05/2024
Spotify and ELLE Collaborate in Celebration of Emerging Women in Music
Spotify has an unwavering commitment to supporting emerging artists across all genres, to helping them launch and thrive in their careers, and to connecting the...
03/05/2024
La Msica Mexicana no es solamente un fenmeno en Mxico y Estados Unidos, tambin en toda Amrica Latina
Como uno de los sonidos de mayor expansi n en el mundo, la M sica Mexicana va m ...
03/05/2024
Msica Mexicana Isn't Just a Phenomenon in Mexico and the U.S.-It's Taking Over Latin America.
As one of the fastest-growing sounds worldwide, M sica Mexicana is more than jus...
03/05/2024
Fubo Loses Subs in Q1 But Exceeds Guidance and Reduces Losses
NEW YORK FuboTV reported that it lost 110,000 subs in Q1 2024 but it exceeded its guidance for the quarter with double digit year-over-year increases across key...
03/05/2024
Only 3 Weeks Left To Enter The Creative Industries Most Award
Only 3 Weeks Left To Enter The Creative Industries Most Award Brie Clayton May 3, 2024 0 Comments Creativepool is giving a last call to the creative i...
03/05/2024
WePlay Studios Transforms the Future of Live Event Storytelling with AJA Gear
WePlay Studios Transforms the Future of Live Event Storytelling with AJA Gear Brie Clayton May 3, 2024 0 Comments By 2032, the esports market is expec...
03/05/2024
The Business of TV News: Shannon Bream Singles Out Key Issue in 2024 Election
WASHINGTON Shannon Bream, Fox News Sunday anchor and chief legal correspondent, sat for a keynote chat at The Business of TV News, with Tom Umstead, senior co...
03/05/2024
ABC News Anchor Martha Raddatz Says Local News Is Everything'
WASHINGTON Asked how important is local news, Martha Raddatz, who has covered foreign conflicts for decades and moderated presidential debates for ABC News, s...
03/05/2024
Paramount Stock Jumps on Report of $26 Billion Bid From Sony Pictures and Apollo Global
Sony Pictures and Apollo Global Management have made a $26 billion cash offer fo...
03/05/2024
CBS Shares 2024-2025 Schedule
CBS announced its 2024-2025 primetime schedule, which features the new dramas NCIS: Origins, Matlock and Watson, and new comedies Poppa's House and Georgie ...
03/05/2024
Fubo Continues To Reduce Red Ink Despite Losing 110,000 Subs in Q1
Sports-focused streaming service Fubo said it reduced its red ink in the first quarter, despite losing 110,000 subscribers since the end of the year....
03/05/2024
Walmart Ready to Release 4K Hybrid Streaming/Smart Speaker
Walmart is reportedly about to release a new 4K streaming box/smart speaker hybrid device for Google TV that could retail for as low as $50, according to a repo...
03/05/2024
Limecraft updates content delivery platform
Enhancements to its Delivery Workspace solution provide increased content delivery robustness and security By Matthew Corrigan Published: May 3, 2024 Enha...
03/05/2024
Sony and Apollo submit $26 billion bid for Paramount Global
According to reports, if the bid is successful, Sony would be the significant majority shareholder of the new private company By Jenny Priestley Published: M...
03/05/2024
Global Academy launches Manchester media school
Located in Manchesters Spinningfields district, the course aims to equip students with the skills, knowledge and industry insight for a career in the creative s...
03/05/2024
New Lightbridge C-Move Core
Lightbridge has created a new Cine Reflect Lighting System kit, dedicated to enabling artistic lighting with consideration of today's productivity practical...
03/05/2024
KPBS Modernizes Infrastructure With Major AES67 Control R...
Broadcast media systems integrator BeckTV and Lawo worked together on a large-scale AES67 build of radio control rooms at KPBS, San Diego's NPR and PBS memb...
03/05/2024
SmallHD Free PageOS 6 Update Adds Powerful New Capabiliti...
Raleigh, NC, April 12, 2024 SmallHD announces PageOS 6, a powerful software update free for owners of SmallHD monitors, featuring increased functionality, exp...
03/05/2024
SmallHD Launches Quantum 32 Quantum Dot OLED HDR Referenc...
SmallHD announces the Quantum 32 a new 31.5 OLED monitor designed to provide reference HDR/SDR images for post-production final color grading, as well as cri...
03/05/2024
How WePlay Studios is Transforming the Future of Live Eve...
By 2032, the esports market is expected to grow to $9.29 billion, bolstered by a global player count and fan following that both continue to climb. The 2023 Lea...
03/05/2024
Amagis Latest FAST Report Unveils the Rise of a Diverse G...
Amagi, the global leader in cloud-based SaaS technology for broadcast and connected TV (CTV), today announced that the 11th edition of the Amagi Global FAST Rep...
03/05/2024
WDR Filmhaus to Implement Riedel Infrastructure
Riedel Communications today announced that Westdeutscher Rundfunk (WDR) is relying on a media and intercom network based on Riedel technology for the renewal of...
03/05/2024
TF1 Chooses Broadpeak to Power Targeted Advertising for N...
Broadpeak , a leading provider of content delivery network (CDN) and video streaming solutions for content providers and pay-TV operators worldwide, announced t...
03/05/2024
Limecraft Announces New Platform Update and Preview of MI...
Media technology innovator Limecraft will exhibit the latest updates to its online platform on stand G79 at the 15-16 May 2024 Media Production & Technology Sho...
03/05/2024
Prism Sound Announces Partners For 8 Hours At Rockfield 2...
Audio interface specialist Prism Sound is partnering with console manufacturer Solid State Logic, professional monitor manufacturer PMC and acoustic panel and d...
03/05/2024
Intinor brings excellence in contribution streaming to MP...
Intinor, Sweden's leading developer of products and solutions for high-quality video over the internet, is set to join forces with its UK partner Zest Techn...
03/05/2024
NAKIVO Reports 10 Percent Revenue Growth in EMEA Region
NAKIVO Inc., a fast-growing software company dedicated to protecting physical, virtual, cloud, and SaaS environments, announced its Q1 2024 financial results to...
03/05/2024
Hiltron Introduces Field-Upgradable Motorisation Kit for...
Hiltron Communications announces an addition to its range of satellite communication products and systems: a field-upgradable motorisation kit specifically desi...
03/05/2024
LiveU Showcases Story-Centric Workflows and the LiveU Eco...
Visit LiveU at Stand D30 With this year an exceptional one for major sporting events alongside multiple, key elections, the power and flexibility of LiveU'...
03/05/2024
Agile Content appoints Koldo Unanue as new CEO to boost t...
Agile Content, the leader in the development of technology and solutions for the provision of television services over the Internet, announces Koldo Unanue as i...
03/05/2024
Fubo, Dish, DirecTV & Others Urge Congress to Probe New Sports Streaming JV
Eight co-signers representing streaming services, pay TV operators and public interest groups have sent a letter to Congressional leaders urging their Committee...
03/05/2024
Brightcove Integrates AWS-Powered Generative AI Solution
BOSTON Brightcove has announced that it has implemented Amazon Q Business, a new generative AI assistant on Amazon Web Services (AWS)....
03/05/2024
CBS Claims 16th Straight Prime Time Viewing Title
NEW YORK CBS has declared that it will finish the 2023-24 season as Americas Most-Watched Network in primetime for the 16th consecutive season....
03/05/2024
Survey: U.S. Business and Consumers Plan to Increase Tech Spending
A new survey indicates that U.S. businesses and consumers had an overall positive attitude towards their technology spending plans in the first quarter of 2024,...
03/05/2024
Sony, Apollo Make $26B Bid for Paramount
Sony and Apollo Global Management have sent an offer letter to Paramount Global's board of directors making an $26 billion all cash offer for Paramount Glob...
03/05/2024
EMG / Gravity Media Grow Executive Team
EMG / Gravity Media has appointed Charlie Cubbon chief operating officer (COO) and Jamie Hindhaugh regional CEO for the U.K., U.S., Australia and the Middle Eas...
03/05/2024
WRAL-TV Brain Game Producer Randy Mews Wraps Season Taping Before Retirement
Final 2024 Season Games to Air Saturdays on WRAL at 10:30am in May, Championship on June 1 Taping for the final episodes of the 2024 season of WRAL-TV's Br...
03/05/2024
Estrella MediaCo Launches FAST Channels With Curiosity (NewFronts)
Estrella MediaCo said it is expanding its connected TV portfolio by launching three new channels with Curiosity that will first appear on Samsung TV Plus....
03/05/2024
SAG-AFTRA Will Use Nielsen Streaming Data To Enforce New Contract Terms
Nielsen said that actors' union SAG-AFTRA has selected Nielsen as its third-party provider of streaming content measurement....
03/05/2024
Cineverse Moves AI Content Search Tool cineSearch Into Public Beta
Cineverse said it has released a public beta version of cineSearch, its artificial intelligence-based system designed to help users find content to watch....
03/05/2024
Court TV To Debut Season 2 Of Accomplice To Murder May 5
Court TV said that its series Accomplice to Murder with Vinnie Politan is returning for a second season beginning May 5....
03/05/2024
Paramount Advertising Taps Mastercard, EDO for Attribution
Paramount Advertising said it is working with Mastercard and research and analytics company EDO to measure the impact of advertising campaigns and improve their...
03/05/2024
The Business of TV News: How To Grow Local Programming Without Burning Out Staffers, Viewers
WASHINGTON The panel The Growth of Local Programming happened at The Busines...
03/05/2024
Katz Media Group Names Craig Broitman President
Katz Media Group said that Craig Broitman will become president of the company, succeeding Leo MacCourtney, who announced that he is retiring effective July 1....