Edit: Back for day 2. 17 wins after first day
Edit 2: OpenAI did an AMA on reddit here, will add some interesting info from there to this thread
Winners:
Result page
- DreamShitters win 4 in a row
- Winstrike win 3 in a row
- 🇷🇺 @Winstrike_Team
- 🇹🇭 Question Mark by Hashtag
- 🇲🇾 AOES.K2Surf
- 🇵🇭 Pogiz Poseidon Esports
- 🇹🇭 Alpha Red
- 🇷🇺 DreamShiters
- 🇪🇺 @WagaGaming & Friends
General info on the OpenAI Five:
Main Arena website
Blog post on OpenAI
After first 1000 games, after 10 hours of uptime, only 3 were won by humans
The bot used 45,000 years of game time for reference, proving that humans are better at learning from less information and experience
The bot is frozen in the arena, and it is not learning anymore
Replays will be available in the future
Live Results are found here https://arena.openai.com/#/results
You had to register previously if you wanted to play with it
Bot will be ran over the weekend (66hours from the upload of this post)
There is no 1v1 version of the bot to play (it hasn't been trained on 1v1 in newest patch)
The current bot version has 99.9% win rate against the TI version of the bot: "Winrate evaluated on the current game patch. This biases the winrate towards the Finals version as the TI version was trained on an older patch, but currently we don’t have another way to compare agents trained on different game versions."
OpenAI AMA summary:
The bot model "brain" has 167 million parameters and 667MB in size (Source)
The bot will be discontinued to the public, since they are not planning to keep training the bot after each patch hits (Source)
They won't train the bot with the full 100+ heroes, but might be a possibility in the future if new drastic improvement in training methods are found (Source)
Takes 32 skylane CPU cores to run a game with five after training (Source)
The likely reason the bots are dropping 4 wards in one place is to make inventory room, since items and backpack behavior is scripted and not learned (Source)
Rough estimate on how much the bot cost to train is $110,000 (Source)
OpenAI won't branch out to other games, but will keep Dota for testing ideas (Source)
A follow up blog post will be added after arena weekend
The OpenAI team tried to expand to 80 heroes, and training was transferred decently from the current model, to about 3-5k MMR of the bots (Source)
How OpenAI sees: It sorts all game units by closest to it (Source)
The reasons OpenAI chose Dota2: Popularity (and huge prize pools) - Reflex/Micro is a secondary skill - Depth (complexity) - Availability for linux - API (Source)
Things the AI is surprisingly bad at
The winning games will be analyzed for any unusual insights on the bots
Streams:
Vods
Twitter handles:
Lessons from pro players
1st win by Alpha red:
Waga:
lineup: sniper, axe, razor, cm, sven
bot lineup: dp, gyro, viper, riki, wd
"The bots are locked, they are not learning, but we humans are. We will win." ~ Waga
Win1: Won a 55 minute game with 0 deaths sniper, by abusing vision and playing the late game
They won in the 3rd try, proving what OG Notail said "Give us 5 games, and we will figure it out"
For reference, pro players have around 10,000-20,000 hours in the game
Bots are bad at splitpush
Bad with shrapnel splitpush
Bad warding and dewarding game
They rarely use dust, so shadow blade is good
bad usage of DP exorcism
CM is needed to win in tempo against bots, mana is useful
bots don't favor late game runes, especially DD
bots are never guessing when humans want to rosh
5th ever human win
couriers are easy to gank, they got 5+ couriers killed each game
any mistake is heavily punished by the bot, and you basically have to keep up with them in gold and tempo
bots seem to never try to deny towers
Dreamshitters / ainodehna stack:
Won 3 in a row https://i.imgur.com/A7hpBLw.png
Lineup: Riki, SF, Sven, CM, ES
Strategy: Split pushing and taking it late, rushing buildings with roshan and shadow blades
iLTW stack:
lineup: sven, razor, cm(lil, ex-VP player), Shadow Fiend (iLTW), sniper (unstable/nonghrata)
enemy lineup: wd, dp, gyro, riki, razor
won in 33 minute by playing pure superior dota (silent and iLTW are monster players)
4th ever human win
Jabs stack:
Others:
riki + radiance is not detected at all by bots, they don't know what the burn is
if bots jump you, you better run
if bots are running, better push them
some buybacks by the bot are insta, and good players can abuse it lategame
bots do cancel their ults (for example a witch doctor stopped a channel to cask a CM ult)
openai's midgame is poor if they are behind on gold
bots react badly to atos, will TP even before atos is used
seems like bots won't bkb tp out even in front of an atos
Fun things:
Will update for more, please refer to the discord
[–]CorruptDropbear 27 points28 points29 points (5 children)
[–]loopuleasa[S] 5 points6 points7 points (0 children)
[–]AGI_69 1 point2 points3 points (3 children)
[–]Sordahon 0 points1 point2 points (2 children)
[–]AGI_69 4 points5 points6 points (1 child)
[–]AGI_69 1 point2 points3 points (0 children)
[–]teerre 51 points52 points53 points (13 children)
[–]dpwiz 15 points16 points17 points (4 children)
[–]loopuleasa[S] 39 points40 points41 points (0 children)
[–]teerre 0 points1 point2 points (2 children)
[–]*sheever support* Dropped my pants off at the cleaners.RedGuyNoPants 3 points4 points5 points (1 child)
[–]teerre 0 points1 point2 points (0 children)
[–]loopuleasa[S] 2 points3 points4 points (0 children)
[–]scooerp 2 points3 points4 points (6 children)
[–]teerre 0 points1 point2 points (5 children)
[–]scooerp 2 points3 points4 points (4 children)
[–]teerre 0 points1 point2 points (3 children)
[–]scooerp 1 point2 points3 points (2 children)
[–]teerre 0 points1 point2 points (1 child)
[–]scooerp 1 point2 points3 points (0 children)
[–]silent godWalrusPorn 34 points35 points36 points (3 children)
[–]kanak42 12 points13 points14 points (2 children)
[–]lolfail9001 16 points17 points18 points (0 children)
[–]silent godWalrusPorn 29 points30 points31 points (0 children)
[–]AI enthusiastmuskar2 5 points6 points7 points (1 child)
[–]你气不气?Imbluedabodee 1 point2 points3 points (0 children)
[–]decibelsBouncing 6 points7 points8 points (10 children)
[–]W10104 23 points24 points25 points (3 children)
[–]loopuleasa[S] 3 points4 points5 points (0 children)
[–]justatimebomb 5 points6 points7 points (0 children)
[–][deleted] 0 points1 point2 points (0 children)
[–]loopuleasa[S] 5 points6 points7 points (1 child)
[–]TweetsInCommentsBot 5 points6 points7 points (0 children)
[+]loopuleasa[S] comment score below threshold-13 points-12 points-11 points (2 children)
[–]scooerp 2 points3 points4 points (0 children)
[–]Sunrise1912 4 points5 points6 points (2 children)
[–]loopuleasa[S] 0 points1 point2 points (1 child)
[–]AI enthusiastmuskar2 4 points5 points6 points (0 children)
[–]ZCC_TTC_IAUS 1 point2 points3 points (0 children)
[–][deleted] 1 point2 points3 points (1 child)
[–]Pjosk 3 points4 points5 points (0 children)
[–]ebhBali 1 point2 points3 points (0 children)
[–]I'd still have my TL flair but i have to move onbrianbezn 3 points4 points5 points (2 children)
[–]loopuleasa[S] 0 points1 point2 points (1 child)
[–]I'd still have my TL flair but i have to move onbrianbezn 1 point2 points3 points (0 children)
[–]MouZeWarrioR 0 points1 point2 points (23 children)
[–]lolfail9001 19 points20 points21 points (4 children)
[–]MouZeWarrioR -1 points0 points1 point (3 children)
[–]lolfail9001 -2 points-1 points0 points (2 children)
[–]MouZeWarrioR 1 point2 points3 points (1 child)
[–]lolfail9001 -1 points0 points1 point (0 children)
[–][deleted] 6 points7 points8 points (0 children)
[–]ElTigreChang1 7 points8 points9 points (3 children)
[–]MouZeWarrioR 1 point2 points3 points (2 children)
[–]IgnoobV 0 points1 point2 points (1 child)
[–]MouZeWarrioR 1 point2 points3 points (0 children)
[–]BuggyVirus 4 points5 points6 points (1 child)
[–]MouZeWarrioR -3 points-2 points-1 points (0 children)
[–]loopuleasa[S] 1 point2 points3 points (6 children)
[–]MouZeWarrioR 2 points3 points4 points (0 children)
[–]lolfail9001 1 point2 points3 points (0 children)
[+][deleted] (3 children)
[–]Aldehyde1 -1 points0 points1 point (1 child)
[–]MouZeWarrioR 1 point2 points3 points (0 children)
[–]ebhBali -1 points0 points1 point (1 child)
[–]MouZeWarrioR 1 point2 points3 points (0 children)
[–]exensual 0 points1 point2 points (0 children)
[–]sheeverdjoler 0 points1 point2 points (0 children)
[–]ENVY'S #1 FANSolarClipz 0 points1 point2 points (2 children)
[–]AI enthusiastmuskar2 0 points1 point2 points (0 children)
[–]loopuleasa[S] 0 points1 point2 points (0 children)
[–]NimblePunch 0 points1 point2 points (2 children)
[–]loopuleasa[S] 0 points1 point2 points (0 children)
[–]AI enthusiastmuskar2 0 points1 point2 points (0 children)
[–]m3ltd0wn02 0 points1 point2 points (0 children)
[–]Luxon31 -3 points-2 points-1 points (13 children)
[–]loopuleasa[S] 16 points17 points18 points (4 children)
[+]IgnoobV comment score below threshold-15 points-14 points-13 points (3 children)
[–]Chikerenaham 6 points7 points8 points (5 children)
[–]DezZzO 1 point2 points3 points (0 children)
[+]IgnoobV comment score below threshold-7 points-6 points-5 points (3 children)
[–]Vitosi4ek 4 points5 points6 points (0 children)
[–]WeskerHawke 0 points1 point2 points (0 children)
[–]skykoz -1 points0 points1 point (2 children)
[–]loopuleasa[S] 3 points4 points5 points (1 child)
[–]AI enthusiastmuskar2 0 points1 point2 points (0 children)
[–]AMERggwpm8f8 -2 points-1 points0 points (0 children)
[–]scooerp -2 points-1 points0 points (1 child)
[–]loopuleasa[S] 0 points1 point2 points (0 children)