Facebook and Intel reign supreme in 'Doom' AI deathmatch

But the competition was full of interesting entrants.

Executive Editor

Updated Thu, Sep 22, 2016, 1:22 PM·10 min read

On the island of Santorini, Greece, a group of AIs has been facing off in an epic battle of Doom.

This is VizDoom, a contest born from one man's idea: to improve the state of artificial intelligence by teaching computers the art of fragging. That simple notion then spiraled into a battle between tech giants, universities and coders. Over the past few months, they've all been honing their bots (known as "agents"), building up to one final death match.

OK, it was a lot more than one match. But that doesn't sound nearly as dramatic.

The competition is all about machine visual learning. Just like when you or I play Doom, the agents can make decisions based only on what they "see" and have no access to information within the game's code.

There were two "tracks" for agents to compete on, offering very different challenges. Track 1 featured a map known to the teams, and rocket launchers were the only weapons. The agents started with a weapon but were able to collect ammo and health kits.

Track 2 was a far harder challenge. It featured three maps, unknown to teams, and a full array of weapons and items. While Track 1 agents could learn by repeating a map over and over, agents competing in Track 2 needed more general AI capabilities to navigate their unknown environments. Both maps were played for a total of two hours, with Track 1 consisting of 12 10-minute matches and Track 2 consisting of three sets of four 10-minute matches (one for each map).

As you might have expected, the winners for both categories came from the private sector. The agent F1, programmed by Facebook AI researchers Yuxin Wu and Yuandong Tian, won Track 1 overall, besting its opponents in 10 of 12 rounds. For Track 2, IntelAct, programmed by Intel Labs researchers Alexey Dosovitskiy and Vladlen Koltun, put in a similarly dominating performance, taking the victory and winning 10 of 12 rounds. But while Intel and Facebook may have won the overall prizes, there were other impressive performances. Three standout bots -- Arnold, Clyde and Tuho -- came from students.

Arnold

Arnold is the product of Devendra Singh Chaplot and Guillaume Lample, two master's students from Carnegie Mellon University's School of Computer Science. Their team, The Terminators, competed on Tracks 1 and 2, and saw success on both. In fact, Arnold was the only agent outside Facebook and Intel to win rounds. On Track 1, each bot had to skip one round, and F1's departure gave round three to Arnold. In round six, though, Arnold won outright, besting F1 by two frags. The result never looked in doubt, though, and Arnold ended in second place, 146 frags behind F1.

Track 2 was where things got interesting. Arnold was competitive in the first map, but IntelAct already had a 19-frag lead heading into map two. On the second map, however, Arnold suddenly came alive. It won the first two rounds, closing the gap to just 11 frags at one point and ending the map 15 behind. But it wasn't to be. IntelAct excelled at the final map, scoring 130 frags in just four rounds and destroying the plucky underdog's hopes of pulling off an upset. Arnold lost the overall count 256 to 164, again ending in second place.

Behind the scenes, though, all the work was done as long as several months ago. Arnold is one of the more ambitious efforts in the VizDoom competition, combining multiple techniques. It's actually the result of two distinct networks. The first is a deep Q-network (DQN), a technique Google DeepMind pioneered to master 49 Atari 2600 games. The second is a deep recurrent Q-network (DRQN). It's similar to a DQN but it processes information in a directed cycle and uses its internal memory of what's come before deciding what to do next. Arnold's DRQN has been augmented to help the agent detect when an enemy is visible in the frame.

In a death match, Arnold can be in one of two states: navigation (exploring the map to pick objects and find enemies) or action (combat with enemies), with separate neural networks handling each. The DQN is for navigation. It's responsible for moving the agent around the level when nothing much is happening, hunting down items and other players. As soon as an enemy shows up on the screen, however, it hands control to the DRQN, which sets about shooting things. Combining these two methods, which can be trained in parallel independently, is the key to Arnold's success.

But Arnold's creators aren't interested in pursuing an unbeatable Doom agent. Instead, they saw VizDoom as a nice application to test their ideas on reinforcement learning. Speaking by phone, Chaplot explained that the networks deployed in Arnold can be applied to robotics in the real world. Navigation and self-localization are a real challenge for machines, and the team is now focused on solving those issues. They've published their initial findings from Arnold and VizDoom, and are using what they've learned to try to create better robots.

Clyde

Clyde was created by Dino Ratcliffe, a Ph.D. candidate at the University of Essex in the Intelligent Games and Game Intelligence program. A one-person effort, the AI competed on Track 1 only. Though Clyde never won a round, it was extremely competitive throughout, besting Arnold in five rounds and, in one match, losing to F1 by only one frag. It ended the competition in third place with 393 frags, putting it 20 behind Arnold and 166 behind F1.

It could have gone so differently for Clyde. Ratcliffe began development in order to understand "what the state of the art in general video-game-playing" was for AI right now. He used asynchronous advantage actor-critic (A3C), an advancement in the DQN method that uses multiple neural networks learning in parallel to update a global network.

Ratcliffe told me he took a hands-off approach to training, preferring the agent to learn by itself what enemies are, what death is, what health packs are and so on. "I think it's dangerous to start encoding your own domain knowledge into these agents as it inhibits their ability to generalize across games," he explained. "I simply gave it a reward for killing opponents and increasing its health, ammo or armor."

But a catastrophic failure -- Ratcliffe's PC power supply blew up 24 hours before the competition deadline -- caused Clyde to complete only around 40 percent of its training regimen. That meant that it had learned from 30 million frames rather than the necessary 80 million. The biggest downside of this incomplete training, Ratcliffe explains, is that the agent still occasionally commits suicide. It's for this reason that Clyde got his moniker -- it's named for the weakest ghost in Pac-Man, which, rather than pursuing or holding position, just moves around at random.

Clyde learned a simple form of spawn camping

The fully trained Clyde, which wasn't submitted, is far stronger. Ratcliffe said he's observed Clyde using a simple form of "spawn camping," a much-maligned tactic in multiplayer shooters in which you wait at strategic points on a map and kill players as they spawn in. "It notices certain corridors that have spawn points close by and shoots more," he explained. This behavior is apparently in the competition version of Clyde, but not as noticeable.

Before the results were published, Ratcliffe said he didn't think Clyde would be competitive, so a third-place rosette was definitely above expectations. Ratcliffe has already moved on to a new project: 2D platformers. "I had only started looking into deep reinforcement learning around one week before the competition was announced," he said. "I pretty much had to learn the whole field in the process of competing, and that was the point of me taking part. So I now have a solid foundation to start my own research this year." While other agents have mastered 2D platformers, he wants to teach one to learn Mario and then try to apply that learning set to other games with minimal retraining.

Tuho

The final prize-winning spot was taken by Anssi "Miffyli" Kanervisto, a master of science student at the University of Eastern Finland's School of Computing. His agent, Tuho, (Finnish for "doom") is a one-person effort, created with oversight by Ville Hautamäki, Ph.D., from the same university.

Some of Tuho's best performances came on Track 1, where it managed to finish in second place, behind F1, in three rounds. It ultimately placed fourth, just outside the prize rankings. On Track 2, it didn't get close to challenging F1 or Arnold. It put in a solid performance, though, on the first and last map, which was enough to balance out a disastrous showing on the second map. Tuho ended up in third place with 51 frags. That's despite spending the four middle rounds killing itself more than others.

Kanervisto built a complex agent in Tuho, with a navigation system based on multiple techniques. The most important aspect is a dueling DQN -- two networks using different methodologies to provide a better end result. Tuho's shooting and firing system is largely based on image recognition, matching potential enemies against a manually recorded library of images.

It was trained to prioritize movement speed in order to get it running in straight lines, and the result, Kanervisto says, is a "well-behaving model that was able to move around and not get stuck, although it struggled with doorways." But the entire training regimen took place on his personal computer with an Ivy Bridge i7 processor and GTX 760 graphics card. You typically need a very powerful computer, or better yet several, to train an AI at a reasonable speed. Because of this, he was limited in the size of the network and input image size.

Everyone's a winner

It may be a mostly false cliché, but at least with VizDoom, it feels like everyone here is a winner. Arnold's creators will receive €300 for their agent's performance on Track 1, and €1,000 for Track 2, leaving them with around $1,450 to share. Ratcliffe earned €200 ($222) for Clyde's third place. Tuho bagged Kanervisto €500 ($558) for its exploits.

Some are going home with prizes, but all the teams I've spoken to have gained a lot from their experience. Take Oliver Dressler, and his agent, Abyss II. Dressler is a Ph.D. candidate in microfluidics (bioengineering) at ETH in Switzerland and had no previous experience in AI. I asked him what he'd learned from participating in VizDoom. "Literally all my machine-learning knowledge" was the answer.

Dressler based Abyss II on the A3C algorithm, and had to learn everything as he went along. This led to some big mistakes, but lots of gained knowledge. One such lesson came in training. "Shooting is required to win," he explained, "but shooting at the wrong moment (which is nearly every moment) will result in suicide." The map was full of small corridors, and any explosion nearby will kill the agent. Just overcoming that is a challenge in itself.

Abyss II placed seventh on Track 1, but from speaking to Dressler before the contest, it was apparent he would be happy regardless of the result. "Given the short time frame, I really don't expect my bot to perform particularly well, but it has been an amazing challenge," he added. "It has even paid off more than I expected, and I can use this knowledge very well in my current work."

VizDoom will also have knock-on effects. Google DeepMind and other leaders in machine learning, despite not formally entering the competition, will also have learned a few things. Doom is a highly complex title, and various DQN, DRQN and A3C-based agents have performed to great success.

I don't know what methods Facebook and Intel employees used to win the top prizes in their categories, but it's likely we'll see papers published from them soon. Regardless, as is often the case with AI, the innovative techniques used to win VizDoom will serve to strengthen every researcher's knowledge of vision-based machine learning.

Engadget
Your old Rock Band guitars now work in Fortnite Festival
Fortnite's Rock Band-style Festival mode now supports Rock Band 4 guitars. Meanwhile, Billie Eilish has joined the game as its latest music icon.
7m ago
Engadget
Elon Musk says it's his turn to have the remote
X just announced a smart TV app for streaming video content. X TV may or may not launch at some point in the near or far future.
28m ago
Engadget
Nobody needs to spend $160 on a gaming mouse, but Razer’s new Viper V3 Pro is excellent anyway
Razer has announced the Viper V3 Pro, its latest premium wireless gaming mouse. Here are our hands-on impressions.
2h ago
Engadget
Apple will host a virtual event on May 7th, ahead of WWDC
Apple isn't waiting until WWDC to make some announcements. The company will hold a virtual event on May 7 and all signs suggest it'll have new iPads to show off.
2h ago
Engadget
The best noise-canceling headphones for 2024
Noise cancellation is a primary feature on most flagship, over-ear headphones. If you're looking to get a pair of cans that can truly block out the world, these are the best noise-canceling headphones you can get today.
4h ago
Engadget
The rebuilt Sonos app focuses on getting you to your tunes faster
Sonos has rebuilt its mobile app from the ground up to make it easier for people to get to their favorite content, regardless of what service it comes from.
4h ago
Engadget
Castlevania fan uncovers new Konami code in 1999 game
A new Konami Code has been discovered hidden inside Castlevania's code, and it changes gameplay so drastically that it aficionados will want to give it a fresh try.
4h ago
Engadget
The best travel gear for graduates
For those recent grads itching to get away after a busy semester, these are the best travel gifts you can get them.
10 months ago
Engadget
Rivian offers (up to) $5,000 discount if you trade in your gas-powered truck
Rivian will give you up to around $5,470 in discount if you trade in an eligible gas-powered truck or SUV when you purchase or lease a qualifying R1 electric vehicle package in the US and Canada.
5h ago
Engadget
The Morning After: Meta teases a limited-edition Quest headset inspired by Xbox
The biggest news stories this morning: Grindr sued for allegedly sharing users’ HIV status and other info with ad companies, What we watched: Bluey’s joyful finales, Amazon halts drone deliveries in California, but kicks off tests in Phoenix
6h ago
Engadget
Metaphor: ReFantazio, a fantasy RPG from the Persona 5 team, comes out in October
Metaphor: ReFantazio is coming out on October 11, and you can pre-order it right now.
7h ago
Engadget
Microsoft's lightweight Phi-3 Mini model can run on smartphones
Microsoft has unveiled its latest light AI model called the Phi-3 Mini designed to run smartphones and other devices.
7h ago
Engadget
Adobe Photoshop's latest beta makes AI-generated images from simple text prompts
Adobe will now let you use AI to generate images directly within Photoshop. That's just the beginning.
8h ago
Engadget
The best VPN service for 2024
VPNs are not a one-size-fits-all security solution. Instead, they’re just one part of keeping your data private and secure. We tested out nine of the best VPNs available now to help you choose the best one for your needs.
4 months ago
Engadget
Amazon halts drone deliveries in California, but kicks off operations in Phoenix
The e-commerce company has closed its delivery site in Lockeford, which has been operational since 2022.
9h ago
Engadget
Newsletter service Ghost will support the fediverse protocol ActivityPub
Newsletter platform Ghost is the latest service to pledge support for ActivityPub, the open source protocol powering the fediverse.
18h ago
Engadget
Russian court sentences Meta spokesperson in absentia to six years in prison
A Russian court has sentenced a Meta spokesperson in absentia to six years in prison. It claims that Andy Stone was "publicly defending terrorism."
21h ago
Engadget
Mozilla urges WhatsApp to combat misinformation ahead of global elections
Social media companies like Meta, YouTube and TikTok, have promised to protect the integrity of those elections, at least as far as discourse and factual claims being made on their platforms are concerned. Missing from the conversation, however, is closed messaging app WhatsApp, which now rivals public social media platforms in both scope and reach.
21h ago
Engadget
Even the indie game El Paso, Elsewhere is getting turned into a movie
The hit indie game El Paso, Elsewhere is getting turned into a movie, with Academy Award nominee LaKeith Stanfield attached to both star and produce. Di Bonaventura Pictures and Colin Stark will also produce.
22h ago
Engadget
Star Wars Jedi: Survivor is coming to Game Pass Ultimate and EA Play on April 25
Star Wars Jedi: Survivor is coming to Game Pass Ultimate, PC Game Pass and EA Play on April 25.
23h ago

Facebook and Intel reign supreme in 'Doom' AI deathmatch

But the competition was full of interesting entrants.

Arnold

Clyde

Tuho

Everyone's a winner

Latest Stories

Your old Rock Band guitars now work in Fortnite Festival

Elon Musk says it's his turn to have the remote

Nobody needs to spend $160 on a gaming mouse, but Razer’s new Viper V3 Pro is excellent anyway

Apple will host a virtual event on May 7th, ahead of WWDC

The best noise-canceling headphones for 2024

The rebuilt Sonos app focuses on getting you to your tunes faster

Castlevania fan uncovers new Konami code in 1999 game

The best travel gear for graduates

Rivian offers (up to) $5,000 discount if you trade in your gas-powered truck

The Morning After: Meta teases a limited-edition Quest headset inspired by Xbox

Metaphor: ReFantazio, a fantasy RPG from the Persona 5 team, comes out in October

Microsoft's lightweight Phi-3 Mini model can run on smartphones

Adobe Photoshop's latest beta makes AI-generated images from simple text prompts

The best VPN service for 2024

Amazon halts drone deliveries in California, but kicks off operations in Phoenix

Newsletter service Ghost will support the fediverse protocol ActivityPub

Russian court sentences Meta spokesperson in absentia to six years in prison

Mozilla urges WhatsApp to combat misinformation ahead of global elections

Even the indie game El Paso, Elsewhere is getting turned into a movie

Star Wars Jedi: Survivor is coming to Game Pass Ultimate and EA Play on April 25

About

Sections

Contribute

Buying Guides