‘Nethack AI’ directory
- See Also
- Links
- “Zork-Bench: An LLM Reasoning Eval Based on Text Adventure Games; a Tale As Old As Time, or at Least As Old As Computers”, Aiken 2026
- “Playing With AI: How Do State-Of-The-Art Large Language Models Perform in the 1977 Text-Based Adventure Game Zork?”, Gerrits 2026
- “My First NetHack Ascension, and Insights into the AI Capabilities It Requires”, Henaff 2025
- “BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games”, Paglieri et al 2024
- “Playing NetHack With LLMs: Potential & Limitations As Zero-Shot Agents (NetPlay)”, Jeurissen et al 2024
- “Diff History for Neural Language Agents”, Piterbarg et al 2023
- “Motif: Intrinsic Motivation from Artificial Intelligence Feedback”, Klissarov et al 2023
- “Dungeons and Data: A Large-Scale NetHack Dataset”, Hambro et al 2022
- “E3B: Exploration via Elliptical Episodic Bonuses”, Henaff et al 2022
- “MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research”, Samvelyan et al 2021
- “The NetHack Learning Environment”, Küttler et al 2020
- “The Tactical Amulet Extraction Bot: Predicting and Controlling NetHack’s Randomness”
- “BALROG”
- “You Have a Sad Feeling for a Moment, Then It Passes”
- “SWAGGINZZZ”
- Wikipedia (1)
- Miscellaneous
- Bibliography
See Also
Links
“Zork-Bench: An LLM Reasoning Eval Based on Text Adventure Games; a Tale As Old As Time, or at Least As Old As Computers”, Aiken 2026
“Playing With AI: How Do State-Of-The-Art Large Language Models Perform in the 1977 Text-Based Adventure Game Zork?”, Gerrits 2026
“My First NetHack Ascension, and Insights into the AI Capabilities It Requires”, Henaff 2025
My First NetHack ascension, and insights into the AI capabilities it requires
“BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games”, Paglieri et al 2024
“Playing NetHack With LLMs: Potential & Limitations As Zero-Shot Agents (NetPlay)”, Jeurissen et al 2024
Playing NetHack with LLMs: Potential & Limitations as Zero-Shot Agents (NetPlay)
“Diff History for Neural Language Agents”, Piterbarg et al 2023
“Motif: Intrinsic Motivation from Artificial Intelligence Feedback”, Klissarov et al 2023
Motif: Intrinsic Motivation from Artificial Intelligence Feedback
“Dungeons and Data: A Large-Scale NetHack Dataset”, Hambro et al 2022
“E3B: Exploration via Elliptical Episodic Bonuses”, Henaff et al 2022
“MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research”, Samvelyan et al 2021
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
“The NetHack Learning Environment”, Küttler et al 2020
“The Tactical Amulet Extraction Bot: Predicting and Controlling NetHack’s Randomness”
The Tactical Amulet Extraction Bot: Predicting and controlling NetHack’s randomness
“BALROG”
“You Have a Sad Feeling for a Moment, Then It Passes”
“SWAGGINZZZ”
Wikipedia (1)
Miscellaneous
https://ai.meta.com/blog/launching-the-nethack-challenge-at-neurips-2021/https://ai.meta.com/blog/minihack-a-new-sandbox-for-open-ended-reinforcement-learninghttps://www.aicrowd.com/challenges/neurips-2021-the-nethack-challengehttps://www.reddit.com/r/nethack/comments/2tluxv/yaap_fullauto_bot_ascension_bothack
Bibliography
https://www.lowimpactfruit.com/p/zork-bench-an-llm-reasoning-eval: “Zork-Bench: An LLM Reasoning Eval Based on Text Adventure Games; a Tale As Old As Time, or at Least As Old As Computers”,https://arxiv.org/abs/2411.13543: “BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games”,