Hello - we're the dev team behind OpenAI Five! We will be answering questions starting at 2:30pm PDT.

nadipity · 2019-04-19T21:41:11+00:00

It appears that some of our team members don't use Reddit and their freshly made accounts are getting rate limited. Will be translating some of their answers using our Redditor team member accounts =P

prohjort · 2019-04-19T21:10:49+00:00

What is the bots logic when warding 4 wards on the same spot, or leaving a creep left in their own creep camp?

ColonelWilly · 2019-04-19T21:16:36+00:00

When the bot is training, is there an advantage between Dire and Radiant?

For human players, Radiant has a huge advantage: https://www.dotabuff.com/heroes/meta?view=played&metric=faction

reapr56 · 2019-04-19T21:33:09+00:00

Would you guys consider adding a gimped version as replacement for the dota2 bots?

Plebinator6000 · 2019-04-19T21:10:13+00:00

Hey! Is there a possibility of OpenAI Five being accessible to the public again in the future? I'm away for the weekend and I'm gutted I can't play against them, and I'm sure the community would love having an extra bot mode in the game to practise with (and be demolished by)

Thanks a lot for all the work you guys have done, it's been really interesting

jstq · 2019-04-19T21:12:31+00:00

So after this weekend, the dota part of OpenAI is done?

jQiNoBi · 2019-04-19T22:00:22+00:00

How can we be sure that you guys are not an AI as well?

mechkg · 2019-04-19T21:09:00+00:00

Hi guys. I was wondering how much does it cost to train the bots to the current level of play purely in terms of computational resources if you used AWS or the Google equivalent?

How much would it cost to train the bots to play the full hero roster at the same level?

FakePsyho · 2019-04-19T21:59:09+00:00

Btw, there's a small easter egg that we have hidden in the drafting phase. As far we know, no one found it yet!

Funnily enough, it's there since benchmark match. But since we streamed matches with custom UI for drafting, no could see it before.

2019-04-19T21:27:34+00:00

j2i2t2u2 · 2019-04-19T21:41:30+00:00

Huge congrats to the team. Couple of questions, thanks for answering.
1) Now that you have achieved super-human perf on this complex games, what is the 6 months roadmap for RL for Dota 2 ?
2) What is your day to day like as engineer of RL for a MOBA game?
3) What is your (OpenAI-s) cooperation with Valve like? To what degree, did Valve support you in achieving super-human AI for dota 2?

Yamakasinge · 2019-04-19T21:08:44+00:00

How much computing ressources does it cost to run one bot after training is done ?

TentacularMaelrawn · 2019-04-19T21:08:46+00:00

What's the decision process for choosing which heroes for the OpenAI Five to train on?

Castature · 2019-04-19T21:10:00+00:00

Are you guys planning on branching out into other games? Whether they be mobas, rts games, fps etc.

Wivyx · 2019-04-19T22:32:41+00:00

Watching games where the humans win, it feels like Open AI is quite bad/not capabale of anticipating moves or planning for the long term. They react to what they see, and don't seem to think "we can't see the enemy, they are probably planning a gank/smoked" or "this hero has a tendency to splitpush top, let's set a trap to catch him" like humans would do. Do you think these are strict limitations to the AI or do you think the AI could learn such human-like behaviour if they trained with (high skilled) humans? Why?

Yamakasinge · 2019-04-19T21:08:18+00:00

Will we ever see bot play full hero pool dota ?

rawriclark · 2019-04-19T22:03:03+00:00

can you please not close this? i wanna play this forever

buck614 · 2019-04-19T21:15:33+00:00

How does the AI get vision on itself, friendly units, and friendly structures? Can it 'see' all those at once in real time wherein a normal player only see the native field of view? I hope that makes sense.

Deamon- · 2019-04-19T21:19:52+00:00

will you ever show us what those bots can do with heroes like ember meepo invoker etc?

FakePsyho · 2019-04-19T21:28:02+00:00

Do you have any data on Average MMR of team vs Win Rate against OpenAI?

dinosaur_noises · 2019-04-19T21:27:12+00:00

One of the biggest surprises for me was that the relatively simple Proximal Policy Optimization method seems to be successful with the long-term thinking required for success in DotA 2, as you mentioned in your blog post about it. I think it aligns nicely with the recent short essay from Rich Sutton called The Bitter Lesson. I've noticed though that both OpenAI Five and the DeepMind SC2 AI seem to do best against human in short-term tactics and are perhaps just competitive in long-term strategy. It is amazing that a general learning method can be successful in playing in such a complex, cooperative, and partial information setting, but is it really measuring long-term strategic thinking? I know your team thinks carefully about this in limiting response times and ensuring their performance is similar to a humans to avoid beating them only in mirco. Do you believe the AI is succeeding in this long-term planning or is this a weaknesses? Thanks!

RogueCarpet · 2019-04-19T22:01:58+00:00

How are item builds and skill builds handled? I believe an early version of OpenAI had a few pre-selected builds for each hero and the bots would pick between these. Any changes here?

Fortheseoccasions · 2019-04-19T21:27:12+00:00

What are your mmr?

ColonelWilly · 2019-04-19T21:58:14+00:00

I know the team has worked to compensate for the fact that the bots do not have the same physical barriers that humans do by limiting actions per minute or reaction time, but have they considered solutions for the loss of efficiency from how humans are forced to physically interact with the game (moving the mouse, only having so many fingers to press keys, eyes having a cone of focus, etc)?

I ask because, as I'm sure you've considered, the bot can "out-play" a human opponent not through strategy but because we do not have direct I/O to the game.

xpkoala · 2019-04-19T21:12:54+00:00

Had a blast watching the show with OG and the OpenAI crew. Are any technical papers about the current capabilities available to read? Will raw stats on the matches taking place over the weekend be made public (game win/loss, hero selection, apm, gpm, etc)? You all seem to be having a blast working on the project, wish you all the best as it continues to grow.

Bokoloony · 2019-04-19T21:26:09+00:00

So a lot of people argue that since your AI "figured out" DotA, there's no incentive for you to make it train against more heroes. 17 (is it ?) or 117, it's only a matter of computation power and training. Do you think that's correct ?

I wouldn't be surprised if the computation power required to train for 117 heroes is orders of magnitude above what you needed for 17, making it an actual challenge. Because the time required is not linear at all but rather quadratic (or exponential, or factorial even, I don't know). How wrong am I ?

Another argument is that the other heroes add a lot more diversity, making it heck of a lot easier to exploit openAI's weaknesses (such as splitpushing, or AOE denial spells like shrapnel, apparently it's bad against that). I guess you could tweak the set of rewards you laid out for it to learn, but would that be enough ? Does OpenAI adapts its rewards according to the enemy team composition and its own ?

HPA97 · 2019-04-19T21:22:03+00:00

Could putting the AI through custom scenarios to teach stuff like smoking/warding/invis be a way to fix the current problems they have with those things? Instead of having them only play the regular dota map. Have a map where they need go get from A to B without getting detected ( smoke or deward type scenario )

heypaps · 2019-04-19T21:23:43+00:00

Are there any professional fields that have expressed interest in the learning system of OpenAI for practical application?

LooseGoose0 · 2019-04-19T21:29:00+00:00

I think the ability of OpenAI Five to be able to cooperate with humans is really interesting, especially as it was not trained to be able to do this. For AI to be able to cooperate and work with humans, rather than just replace them, is really bloody cool. Are there plans for your team to work on this problem moving forwards? Either within Dota or not.

BubbsTheCuber · 2019-04-19T21:08:59+00:00

Hey! I wrote a paper about deep learning and the sort. Artificial Intelligence is really interesting to me. Do you think in the near future a artificial general intelligence will be created? Thanks for the AMA guys!

Xexos1 · 2019-04-19T21:17:59+00:00

Whats the main reason you choose dota2?

fdasilva59 · 2019-04-19T21:32:26+00:00

Any possibility to have a collaboration with Deepmind in order to have AlphaStar and OpenAiFive to compete against each other and have a technical debrief on the approaches, what is working and what is not working ?

I mean both agents competing both against each oghers at both Dota2 and Starcraft2. That would give a nice insight about how the 2 approaches can generalize to another competitive environment.

burnmelt · 2019-04-19T21:16:52+00:00

Any plans to lift all restrictions (heroes, summons, items, etc)?

What is the most interesting thing y’all learned?

Are there any other experiences or information you want to share, but haven’t been asked about yet?

SFKillkenny · 2019-04-20T00:04:08+00:00

When the two teams of AI vs each other do they both predict the same win probabilities or do they predict separate ones because of the lack of information. Also if they do predict separate how big is the discrepancy usually and have you ever had both teams thinking they are ahead before?

turingalan_ · 2019-04-19T21:49:59+00:00

Kudos to OpenAI team for AMA!
First and foremost congrats on the winning of the OG, it's big for both AI and DOTA communities, and shifts the perspective on how well simple algorithms could actually scale and get to the point of winning the best human player.

I have a couple of questions for anyone who could address them:

Have you observed any hierarchical behavior on how the agent is controlling the hero when it plays with other AI in the team vs in collab mode? E.g would the frequency of the actions the agent takes would be much higher because of the uncertainty the human teammate introduces?
On Twitter, Ilya Sutskever has mentioned that the agent was trained continuously for 10 months, any insights on how different it is from the regular lifecycle of other ML/RL project when the training is almost always started from scratch? What were the challenges there and what worked the best?
And lastly, one of the goals of the project was to demonstrate the capabilities of the scaling the algorithms to absurd level (in nowadays computational resource terms), what are the other things you have learned and what do you expect to learn but continuing working on this project further?

Thank you!

Ziggy_st · 2019-04-19T21:27:41+00:00

Do you think it is better if bots could train with only +1, -1 rewards for winning/losing instead of RL with rewards for 'small' things like cs, wards, towers etc. ?

FeIiix · 2019-04-19T21:22:59+00:00

What hardware setup are the agents currently playing in the arena running on?

Have you done tests/benchmarks on how much different hardware affects agent performance?

Are there plans to release the trained model to the general public?

hanmas_aaa · 2019-04-19T21:57:15+00:00

Any plan to tone down AI's reaction time so they can't instant eul/hex blink initiator? Actually, are those plays really 200ms?

Kitchen_Owl · 2019-04-19T21:45:06+00:00

Is there a way that the bots could learn other methods to win apart from the 5-man deathball observed in the games? Not that I'm saying it isn't effective, just curious if they are capable of playing from behind( let's say), and one major win condition is ratting (destroying buildings), while the other team members engage a fight. In short, can there be various strats considered as early as in the drafting phase against specific teams with specific playstyles?

kamelasa11 · 2019-04-19T21:48:32+00:00

Firstly, I love what you guys have done! Amazing work :-) Will there be more heroes in the mis any time soon? And please make it available to the public some time in the future as well!

I was looking at the architecture of your neural network and was confused about one thing. For each of the five heroes on one team you take N units into account at any point (such as creeps, heroes, etc which makes sense). But you need a fixed size vector to feed your network. Is the procedure here to just take the max value for each element (some form for max-pooling)? E.g. if you have two units represented with vectors [10, 7, 8] and [1, 2, 15] then the resulting vector is [10, 7, 15]. But let's say you have a thousand units you are looking at and the max at each of those also results in a vector [10, 7, 15], but these two states are not equal, even though the resulting vectors are equal. I guess max pooling also has this issue in 2D, but not to the same extend as here..

JustAprofile · 2019-04-19T21:55:13+00:00

It seems that while the bots reached a far more optimum learned methodology to playing dota they still lagged behind any active reasoning in the middle of the game. Only operating from a constrained set of parameters without employing strategy or creativity. Only specializing in a narrow discipline and excelling along those lines possible above any competitive team. Does there exist a way to engrave creativity or even narrow forms of higher order reasoning using either software or hardware solutions, to emulate some smaller parts of cognition?

Lagmawnster · 2019-04-19T23:14:52+00:00

As a (finishing) PhD student in computer science myself, currently working on my third publication involving Deep Learning and Transfer Learning. Do you have any recommendations as to what could make my profile particularly interesting for companies like OpenAI? I know about the general profile you are looking for, from your recruiting pages, but would, for example, a Deep Learning side-project that is utilizing state-of-the-art methods on Dota 2 be worth noting?

buck614 · 2019-04-19T21:08:09+00:00

How often will the AI update this weekend? After every game, day, or after the weekend is over? Also ... any additional info on how the AI updates after it finishes matches would be great!

Nortrom_ · 2019-04-19T21:39:32+00:00

Sorry to ask but how i can play againts open ai bots?

TheSausageKing · 2019-04-20T00:07:27+00:00

How do you feel about OpenAI changing from an open, non-profit, to a for-profit entity that keeps some research proprietary? Has it affected the work you do or your view of the organization?

surrealmemoir · 2019-04-19T21:50:18+00:00

Have you run into difficulties of letting bots perform “big jumps” of their strategies? My understanding of Deep Learning is that with gradient descent, you usually make small changes of their strategies each time.

For example, “macro” strategic decisions like 5-man vs split push may deviate from each other significantly. If the bot is being improved mostly by self-play, how would you adapt if it turns out the split strategy is effective?

realjebby · 2019-04-19T22:06:01+00:00

With AlphaStar there was the issue about how it was too good at micro aspects ("mechanical skill") comparing to a human. And such advantage feels like some kind of cheating, like an aimbot in a shooter. I think OpenAI Five has a similar issue. It's just too good at mechanical skill related things like right-clicking (with Sniper) and casting spells (all 5 bots perfectly focusing someone) in a teamfight, but has no signs of understanding of the big picture (the macro aspect).

So what would you prefer between two options: developing a strong brute-force bot which is able to defeat any human team using that artificial mechanical skill advantage or a mechanically weak bot (below average skill), but able to win (sometimes) by using different strategies, showing some kind of adaptation to what the opponent is doing ("understanding" of the big picture)?

DotA2

XQC TOP DOTA GAMEPLAY XQC !youtube CluelessClown

7432 @ qojqva

RANK 1 IN IXDL ATM. 10-0 WTF!!! !nordvpn

7302 @ masondota2

Inhouse Awards Special

971 @ Pyrionflax

PANGO PROBABLY I WANNA WIN IDK !hogeman !kineko

681 @ lukiluki

Carrry Tryhard DotA !hogeman

540 @ EternaLEnVyy

MODERATORS