NetHack 2021 NeurIPS Challenge -- winning agent episode visualizations

iamquah · 2022-01-02T01:47:05+00:00

What is a "symbolic method" exactly? I Googled "symbolic method nle" and didn't really find anything pertinent. I can't watch the vid so I thought I'd just ask away.

TIA!

timthebaker · 2022-01-02T01:04:11+00:00

Saw the results on twitter a few weeks ago and thought NLE was a neat challenge for AI. Not only was the best approach (yours) symbolic, but in general the symbolic entries took the top 3 spots over "neural" approaches which was cool. Congrats on winning. Haven't and still don't have time to go through the results, but hoping to pop in discussions on this thread.

Michel, why do you think symbolic approaches outperformed in this competition, what is deep RL missing?

moschles · 2022-01-02T09:47:05+00:00

The winning agent isn't based on reinforcement learning in the end, but the victory of symbolic methods in this competition shows what RL is still missing to some extent -- so I believe this subreddit is a good place to discuss it.

No, RL is not "missing" something provided by symbolic methods. The symbolic methods are specifically tweaked to the game itself, in what researchers call "domain knowledge". Domain knowledge is the whole crux to the Atari playing agents of Deepmind. Those agents learned the games starting only from raw pixels, without the aid of human beings pre-labelling the entities that appear on the screen. In the case of NetHack, you can come along and hand-code symbols that correspond to the primary entities that appear in the game world. Such software systems will necessarily outperform the deep learning agents who have to create all the "entities" from scratch by uncovering their invariant features.

In short : you can always code up a bot for a specific game. And that bot will out-compete those agents required to learn it from scratch. The reason is not mystical -- the reason is because a coded bot is endowed with the all the cognitive heavy lifting already done for it by a human being.

rogal_the_stubborn · 2022-01-03T16:00:35+00:00

Hey u/procedural_only congrats on winning the challenge! great result!

I was wondering if your agent is available online, I am writing a paper and I would like to benchmark an (inferior) learned-agent against it. Thanks!

reinforcementlearning

MODERATORS

1. lack of some innate human priors:

2. lack of some human acquired priors:

3. Problems that makes this environment hard from currently known RL algorithms perspective:

reinforcementlearning

MODERATORS

Welcome to Reddit.

Want to add to the discussion?

1. lack of some innate human priors:

2. lack of some human acquired priors:

3. Problems that makes this environment hard from currently known RL algorithms perspective: