NetHack: Fast & Complex Learning Environment For Testing RL Agent Robustness & Generalization

NLE is a procedurally generated environment for testing the robustness and systematic generalization of RL agents.

2020-07-01

Reinforcement Learning (RL) agents seem to get smarter and more powerful every day. But while it is relatively easy to compare a given agent’s performance against other agents or human experts for example in video gameplay, it is more difficult to objectively evaluate RL agents’ robustness and their all-important ability to generalize to other environments.

The NetHack Learning Environment (NLE) aims to solve this problem. Introduced by a team of researchers from Facebook AI, University of Oxford, New York University, Imperial College London, and University College London, NLE is a procedurally generated environment for testing the robustness and systematic generalization of RL agents. The name “NetHack” is taken from a popular, procedurally generated dungeon exploration role-playing video game that helped inspire the new environment.

Most of the recent advances in RL and Deep Reinforcement Learning have been driven by the development of novel simulation environments. However, the researchers found that while existing RL environments tend to be either sufficiently complex or based on fast simulation, they are rarely both.

NetHack, the researchers say, combines lightning-fast simulation with very complex game dynamics that are difficult even for humans to master — enabling RL agents to experience billions of steps in the environment in a reasonable time frame while still challenging the limits of what current methods can achieve. NetHack is thus sufficiently complex to drive long-term research on problems such as exploration, planning, skill acquisition and language-conditioned RL; while also capable of dramatically reducing the computational resources required to gather a large amount of experience.

The NetHack Learning Environment (NLE) is built on NetHack 3.6.6, the latest available version of the game, and is designed to provide a standard, turn-based RL interface around the standard NetHack terminal interface.

NetHack’s extremely large number of possible states and environment dynamics present interesting challenges for exploration methods. In order to master NetHack gameplay, even human players often need to consult external resources to identify critical strategies or discover new paths forward. This makes it more challenging for an agent to acquire required game skills such as collecting resources, fighting monsters, eating, manipulating objects, casting spells, taking care of their pets, etc.

The researchers say NLE can make it easy for researchers to probe the behaviour of their agents by defining new tasks with just a few lines of code. To demonstrate NetHack as a suitable testbed for advancing RL, the team released a set of initial tasks in the environment that include navigating to a staircase while being accompanied by a pet, locating and consuming edibles, collecting gold, scouting to discover unseen parts of the dungeon, and finding an oracle.

They also introduced baseline models trained using IMPALA and Random Network Distillation — a popular exploration bonus that enables agents to learn diverse policies for early stages of NetHack — and performed a qualitative analysis of their trained agents to demonstrate the benefit of NetHack’s symbolic observation space.

Currently built on NetHack 3.6.6, the researchers plan to base future NLE versions on NetHack 3.7 and beyond. The 3.7 version, once released, will add a new “Themed Rooms” feature to increase the variability of observations. Researchers will also introduce scripting in the Lua language to enable users to create custom sandbox tasks.

The paper The NetHack Learning Environment is on arXiv.

Journalist: Yuan Yuan | Editor: Michael Sarazen

45 comments on “NetHack: Fast & Complex Learning Environment For Testing RL Agent Robustness & Generalization”

Pingback: [R] NetHack: Fast & Complex Learning Environment For Testing RL Agent Robustness & Generalization – tensor.io
zianezo

2020-07-02

Beautiful article, Thank you!

Loading...

Reply
zianez

2020-07-02

Thank you!

Loading...

Reply
Ziane

2020-12-08

Thank you so much for writing this. Beautiful articil

Loading...

Reply
Ziane

2021-05-21

http://virtuelcampus.univ-msila.dz/facdroitsp/
Merci pour ces document très bien faits.
Thank you ever so for you article.
Nice topic. Thank you
Merci pour ce partage et ce travail.beautiful articile.

Loading...

Reply
زوليخة

2021-05-21

http://virtuelcampus.univ-msila.dz/facdroitsp/
Merci pour ces document très bien faits
Nice topic. Thank you
Merci pour ce partage et ce travail.beautiful articile.

Loading...

Reply
zoli zoli

2021-05-21

http://virtuelcampus.univ-msila.dz/facdroitsp
Thank you ever so for you article.
Nice topic. Thank you
Merci pour ce partage et ce travail.beautiful articile.

Loading...

Reply
Ziane

2021-05-26

http://virtuelcampus.univ-msila.dz/facdroitsp/?p=13565
Much thanks again.
Thank you ever so for you article.
Merci pour ces document très bien faits.

Loading...

Reply
زوليخة

2021-05-26

http://virtuelcampus.univ-msila.dz/facdroitsp/?p=13565
Much thanks again.
Thank you ever so for you article.
Merci pour ces document très bien faits.

Loading...

Reply
Ziane

2021-10-02

Much thanks again.
Your article
Thank you

Loading...

Reply
زوليخة

2021-10-02

Much thanks again.
Your article
Thank you

Loading...

Reply
زوليخة

2021-11-17

Merci pour ce partage
Nice topic. Thank you
very much.

Loading...

Reply
zoli zoli

2021-11-17

Merci pour ce partage
Nice topic. Thank you
very much.

Loading...

Reply
Ziane

2021-11-17

Merci pour ce partage
Nice topic. Thank you
very much.

Loading...

Reply
zoli zoli

2021-11-20

great information;
Merci pour cet article très complet.

thank you

Loading...

Reply
زوليخة

2021-11-23

Your article is very useful
Thank you
very much.

Loading...

Reply
zoli zoli

2021-11-23

Your article is very useful
Thank you
very much.

Loading...

Reply
Ziane

2021-11-23

Your article is very useful
Thank you
very much.

Loading...

Reply
زوليخة

2021-11-24

Merci pour tout travail
Thank you so much for writing this. Beautiful articil

Loading...

Reply
zoli zoli

2021-11-24

Merci pour tout travail
Thank you so much for writing this. Beautiful articil

Loading...

Reply
زوليخة

2021-11-27

Thank you ever so for you article.
Much thanks again.
Your article is very useful

Loading...

Reply
zoli zoli

2021-11-27

Thank you ever so for you article.
Much thanks again.
Your article is very useful

Loading...

Reply
زوليخة

2021-11-30

Much thanks again.
Your article is very useful
very much.

Loading...

Reply
zoli zoli

2021-11-30

Much thanks again.
Your article is very useful
very much.

Loading...

Reply
Ziane

2021-11-30

Much thanks again.
Your article is very useful
very much.

Loading...

Reply
زوليخة

2021-12-02

Merci pour cet article très complet.
very much.
thank you

Loading...

Reply
zoli zoli

2021-12-02

Merci pour cet article très complet.
very much.
thank you

Loading...

Reply
Ziane

2021-12-02

Merci pour cet article très complet.
very much.
thank you

Loading...

Reply
Ziane

2022-01-23

Thank you very much.
Blog really helpful for us .

Loading...

Reply
Ziane

2022-01-24

Thank you very much.
Your article is very useful

Loading...

Reply
Ziane

2022-01-24

Thank you very much.
Your ery useful

Loading...

Reply
Ziane

2022-01-24

Thank you very much.
s very useful

Loading...

Reply
Ziane

2022-01-24

Thank ry much.
Your a very useful

Loading...

Reply
Ziane

2022-01-24

Tha very much.
Your a very useful

Loading...

Reply
Ziane

2022-01-24

Thankery much.
Your very useful

Loading...

Reply
Ziane

2022-01-24

Thank you very much.
Yos very useful

Loading...

Reply
Ziane

2022-02-28

Thank you helpful for us .

Loading...

Reply
zoli zoli

2022-02-28

Thank you vpful for us .

Loading...

Reply
Larry Martin

2023-11-06

Reinforcement Learning agents continue to impress with their performance, but the challenge lies in assessing their robustness and adaptability to diverse environments beyond specific benchmarks.
Hull bottom cleaning in Fort Lauderdale FL

Loading...

Reply
jewelgalore

2023-12-28

Jewelgalore is your premier destination for online Pakistani jewelry . Explore their exquisite collection and adorn yourself with intricately designed pieces that reflect the cultural richness and artistry of Pakistan.

Loading...

Reply
OSH UNIVERSITY

2023-12-28

Osh University and national state university are committed to shaping the future of medical professionals in Kyrgyzstan. Their partnership provides an excellent platform for students to excel in the healthcare field.

Loading...

Reply
Shalamar Hospital

2023-12-28

Shalamar Hospital is your destination for top neurological care in Lahore, with the best neurologist in Lahore providing expert consultations and treatments.

Loading...

Reply
OSH UNIVERSITY

2024-02-24

Secure your path to a medical career with Osh University, where admission in mbbs opens doors to excellence. Join a community of aspiring medical professionals, guided by expert faculty, and embark on a transformative journey.

Loading...

Reply
shalamarhospital02@gmail.com

2024-02-24

Shalamar Hospital is home to a dedicated child specialist . Our expert child specialist provides compassionate and top-notch pediatric care, ensuring the well-being of your little ones is in capable hands.

Loading...

Reply
Tempo Garments

2024-02-24

The short sleeves and vest from Tempo Garments combine for effortless style. Our line, which is made from high-quality fabrics, is adaptable and comfortable for every setting. Whether you are dressing for a formal event or a casual get-together, you can count on Tempo Garments to deliver the ideal balance of style and utility, making you feel and look your best.

Loading...

Reply