Google's DeepMind Agent57 AI Masters Classic Atari Games Like Pitfall And Solaris

atari deepmind
We’ve seen Alphabet/Google’s DeepMind AI crush human opponents in games like StarCraft II, but now the team is going back in time to tackle games from 4 decades ago. Dubbed Agent57, this new AI is gaining attention for learning how to play a total of – you guessed it – 57 Atari 2600 games.

The DeepMind team often looks to existing games to help improve its AI routines, and it explains that they provide “excellent testing ground for building adaptive algorithms” that can encompass a “rich suite of tasks which players must develop sophisticated behavioral strategies to master.” More importantly, each game has its own high score which can be used as a benchmark for improved performance in the future.

A number of well-known titles were included in the grouping of 57 like Solaris, Skiing, Montezuma’s Revenge, and Pitfall. These four games in particular were chosen because previous attempts to tackle these games with AI have failed miserably at being successful. In the case of Montezuma’s Revenge and Pitfall, the aspect of exploring the game environment was the pitfall with previous attempts.

In the case of Skiing and Solaris, the DeepMind team writes:

[These games] are long-term credit assignment problems: in these games, it’s challenging to match the consequences of an agents’ actions to the rewards it receives. Agents must collect information over long time scales to get the feedback necessary to learn.

However, the DeepMind team was able to make optimizations to the Deep-Q network to improve decision-making skills and take better advantage of the reward systems (using a time horizon) of games like Solaris. They then leveraged exploration options in Pitfall, which are then funneled through an adaptive meta-controller.

In the end, Agent57 was able to learn how to play all 57 games, one at a time, retraining before moving on to the next title. This “limitation” is also inherent in DeepMind’s AlphaZero AI, and is something that the team hopes to be able to move beyond in the future to make such systems more human. "True versatility, which comes so easily to a human infant, is still far beyond AIs' reach,” the team added.

Brandon Hill

Brandon Hill

Brandon received his first PC, an IBM Aptiva 310, in 1994 and hasn’t looked back since. He cut his teeth on computer building/repair working at a mom and pop computer shop as a plucky teen in the mid 90s and went on to join AnandTech as the Senior News Editor in 1999. Brandon would later help to form DailyTech where he served as Editor-in-Chief from 2008 until 2014. Brandon is a tech geek at heart, and family members always know where to turn when they need free tech support. When he isn’t writing about the tech hardware or studying up on the latest in mobile gadgets, you’ll find him browsing forums that cater to his long-running passion: automobiles.

Opinions and content posted by HotHardware contributors are their own.