, 3 tweets, 1 min read Read on Twitter
Within a day of @OpenAI 's release of the Dota 2 bots, *eight* different human non-professional teams were able to beat them. It took them a number of games to learn how the bots worked, and devise strategies to counter them. Humans never fail to surprise us with their ingenuity.
#OpenAIFive has been defeated 26 times now. One team defeated them 4 times, and two others twice. A professional team will probably be able to defeat them consistently after some practice.
To me, this confirms the hypothesis that model-free RL doesn't scale to complex problems. Even after restricting the game to 17 heroes, and providing 45k years worth of data, model-free RL learns fragile one-trick policies which beat humans only due to mechanical superiority.
Missing some Tweet in this thread?
You can try to force a refresh.

Like this thread? Get email updates or save it to PDF!

Subscribe to Sherjil Ozair
Profile picture

Get real-time email alerts when new unrolls are available from this author!

This content may be removed anytime!

Twitter may remove this content at anytime, convert it as a PDF, save and print for later use!

Try unrolling a thread yourself!

how to unroll video

1) Follow Thread Reader App on Twitter so you can easily mention us!

2) Go to a Twitter thread (series of Tweets by the same owner) and mention us with a keyword "unroll" @threadreaderapp unroll

You can practice here first or read more on our help page!

Follow Us on Twitter!

Did Thread Reader help you today?

Support us! We are indie developers!

This site is made by just three indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3.00/month or $30.00/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!