AI Machine Learning & Data Science Research

DeepMind’s MEME Agent Achieves Human-level Atari Game Performance 200x Faster Than Agent57

In the new paper Human-level Atari 200x Faster, a DeepMind research team applies a set of diverse strategies to Agent57, with their resulting MEME (Efficient Memory-based Exploration) agent surpassing the human baseline on all 57 Atari games in just 390 million frames — two orders of magnitude faster than Agent57.

The Atari57 suite of classic video games is a popular benchmark used in the reinforcement learning (RL) community to test the general competency of RL algorithms. As Syncedreported in 2020, DeepMind researchers created the first deep RL agent — Agent57 — to achieve above-human performance on all 57 games. While Agent57’s performance was heralded as a breakthrough moment in RL, it came at the cost of very poor data efficiency: requiring nearly 80 billion frames of experience.

In the new paper Human-level Atari 200x Faster, a DeepMind research team applies a set of diverse strategies to Agent57, with their resulting MEME (Efficient Memory-based Exploration) agent surpassing the human baseline on all 57 Atari games in just 390 million frames — two orders of magnitude faster than Agent57.

The team summarizes their work’s main contributions as follows:

  1. Building off Agent57, we carefully examine bottlenecks that slow down learning and address instabilities that arise when these bottlenecks are removed.
  2. We propose a novel agent that we call MEME (Efficient Memory-based Exploration agent), which introduces solutions to enable taking advantage of three approaches that would otherwise lead to instabilities: training the value functions of the whole family of policies from Agent57 in parallel, on all policies’ transitions (instead of just the behaviour policy transitions), bootstrapping from the online network, and using high replay ratios.
  3. We explore several recent advances in deep learning and determine which of them are beneficial for non-stationary problems like the ones considered in this work.
  4. We examine approaches to robustify performance by introducing a policy distillation mechanism that learns a policy head based on the actions obtained from the value network without being sensitive to value magnitudes.

The DeepMind researchers’ goal was the development of an agent as general as Agent57 and capable of reaching human-level performance across the entire Atari57 game suite but with much higher sample efficiency. The paper details the novel techniques used to achieve this:

  • An approximate trust region method for stable bootstrapping from the online network to enable faster propagation of learning signals for rare events
  • A normalization scheme for the loss and priorities to improve the robustness of value function learning and stabilize learning under differing value scales
  • Leveraging NFNets to advance model architecture without the need for normalization layers to improve the neural network architecture
  • A policy distillation method to smooth out the instantaneous greedy policy over time and enable more robust updates under a rapidly-changing policy

In their empirical study, the researchers applied MEME on all 57 Atari games, where it handily surpassed all human baselines in just 390M frames, 200 times faster than Agent57.

The team notes that despite MEME’s success, there remains room for improvement with regard to its generality; and envisions applying MEME to additional challenges such as more complex observation spaces (e.g. 3D navigation, multi-modal inputs), complex action spaces, and longer-term credit assignment.

The paper Human-level Atari 200x Faster is on arXiv.


Author: Hecate He | Editor: Michael Sarazen


We know you don’t want to miss any news or research breakthroughs. Subscribe to our popular newsletter Synced Global AI Weekly to get weekly AI updates.

128 comments on “DeepMind’s MEME Agent Achieves Human-level Atari Game Performance 200x Faster Than Agent57

  1. Pingback: Meertalig, lachend, valkuilspelend en streetwise AI • TechCrunch - Aktuelle Nachrichten

  2. Pingback: Multilingual, laughing, Pitfall-playing and streetwise AI • TechCrunch | Truespot Digital

  3. Pingback: Multilingual, laughing, Pitfall-playing and streetwise AI • TechCrunch - SHSTRENDZ

  4. Pingback: Perceptron: Multilingual, laughing, Pitfall-playing and streetwise AI - 9jarecent

  5. Pingback: Multilingual, laughing, Pitfall-playing and streetwise AI • TechCrunch – Fidegisthub

  6. Pingback: Multilingual, laughing, Pitfall-playing and streetwise AI • TechCrunch – Gossips Nation

  7. Pingback: Perceptron: Multilingual, laughing, Pitfall-playing and streetwise AI | Game Drip

  8. Pingback: Perceptron: Multilingual, laughing, Pitfall-playing and streetwise AI – Social Snug, Inc. – Blog – Social Snug

  9. Pingback: Multilingual, laughing, Pitfall-playing and streetwise AI • TechCrunch - Fedaan

  10. Pingback: Perceptron: Multilingual, laughing, Pitfall-playing and streetwise AI Kyle Wiggers - GetMeBug

  11. Pingback: Multilingual, laughing, pitfall and street AI • TechCrunch - News & Views

  12. Pingback: Perceptron: Multilingual, laughing, Pitfall-playing and streetwise AI - Newslaga

  13. Pingback: Multilingual, laughing, Pitfall-playing and streetwise AI • VDN - Venture Daily News

  14. Pingback: Multilingual, laughing, Pitfall-playing and streetwise AI • TechCrunch – Essay Majestic

  15. Pingback: Multilingual, laughing, Pitfall-playing and streetwise AI • TechCrunch - Natli Tech

  16. Pingback: Perceptron: Multilingual, laughing, Pitfall-playing and streetwise AI • TechCrunch

  17. Pingback: Multilingual, laughing, Pitfall-playing and streetwise AI • TechCrunch - Trend Fool

  18. Pingback: Multilingual, laughing, Pitfall-playing and streetwise AI • TechCrunch

  19. Pingback: Multilingual, Laughter, Jeopardy and AI on the Road • TechCrunch | Business News – Verse.IndiaBlogger.in

  20. Pingback: Çok Dilli, Gülen, Pitfall Oynayan Ve Sokak Odaklı Yapay Zeka • TechCrunch

  21. Pingback: Perceptron: Multilingual, laughing, Pitfall-playing and streetwise AI – dadventures.io

  22. Pingback: Multilingual, laughing, tricky and streetwise AI • TechCrunch – Tech News – Inside

  23. Pingback: Multilingual, amusing, game-playing in traps and artificial intelligence • TechCrunch - News7D

  24. Pingback: Multilingual, laughing, Pitfall-playing and streetwise AI • TechCrunch - Trending Poster

  25. Pingback: Multilingual, laughing, Pitfall-playing and streetwise AI • TechCrunch - MDS New Live

  26. Pingback: Perceptron: Multilingual, laughing, Pitfall-playing and streetwise AI - Finance News

  27. Pingback: Multilingual, laughing, Pitfall-playing and streetwise AI • TechCrunch - PureiTech

  28. Pingback: Multilingual, Laughing, Pitfall Playing and Street AI • TechCrunch - Instant News

  29. Pingback: Multilingual, laughing, Pitfall-playing and streetwise AI • TechCrunch – www.sawana..info

  30. Pingback: Multilingual, laughing, Pitfall-playing and streetwise AI • TechCrunch - Trend Tich

  31. Pingback: Multilingual, laughing, Pitfall-playing and streetwise AI • TechCrunch - Nasha Digital

  32. Pingback: Multilingual, laughing, Pitfall-playing and streetwise AI • TechCrunch – Technology

  33. Pingback: Multilingual, laughing, Pitfall-playing and streetwise AI • TechCrunch - Todaysfintechnews.com

  34. Pingback: Multilingual, laughing, Pitfall-playing and streetwise AI • TechCrunch – Redhotnews

  35. Pingback: Perceptron: Multilingual, laughing, Pitfall-playing and streetwise AI – Discovery Mosti

  36. Pingback: Perceptron: Multilingual, laughing, Pitfall-playing and streetwise AI - That TechGuy

  37. Pingback: Multilingual, tumatawa, Pitfall-playing at streetwise AI • TechCrunch - Hadrian TV

  38. Pingback: Multilingual, laughing, tricky and streetwise AI • TechCrunch - Tech News

  39. Pingback: 多言語、笑い、落とし穴遊び、ストリートワイズ AI • TechCrunch - JP NewsS

  40. Pingback: Perceptron: Multilingual, laughing, Pitfall-playing and streetwise AI - News Portal by AfricaX

  41. Pingback: Perceptron: Multilingual, laughing, Pitfall-playing and streetwise AI - Shopwise

  42. Pingback: Multilingual, laughing, Pitfall-playing and streetwise AI • TechCrunch - ChroniclesLive

  43. Pingback: Perceptron: Multilingual, laughing, Pitfall-playing and streetwise AI – Båtbørsens news – Båtbørsens nyheter på engelsk

  44. Pingback: பன்மொழி, சிரிப்பு, பிட்ஃபால்-விளையாடுதல் மற்றும் தெருவில் AI • TechCrunch - roopgarh news

  45. Pingback: Perceptron: Multilingual, laughing, Pitfall-playing and streetwise AI - DIGEST WIRE

  46. Pingback: Perceptron: Multilingual, laughing, Pitfall-playing and streetwise AI – The Web Serv

  47. Pingback: Multilingual, laughing, Pitfall-playing and streetwise AI • TechCrunch | cryptomediaexpo

  48. Pingback: Perceptron: Multilingual, laughing, Pitfall-playing and streetwise AI – Conference Call Systems

  49. Pingback: ‘Gubbins’ is a Way Too Adorable Word Game Coming to iOS Devices in 2023 – TouchArcade - Enfohill

  50. Pingback: Perceptron: Multilingual, laughing, Pitfall-playing and streetwise AI – Tech News

Leave a Reply

Your email address will not be published. Required fields are marked *