↓ Skip to main content

Mastering the game of Go with deep neural networks and tree search

Overview of attention for article published in Nature, January 2016
Altmetric Badge

About this Attention Score

  • In the top 5% of all research outputs scored by Altmetric
  • Among the highest-scoring outputs from this source (#25 of 49,258)
  • High Attention Score compared to outputs of the same age (99th percentile)
  • High Attention Score compared to outputs of the same age and source (99th percentile)

Readers on

mendeley
5395 Mendeley
citeulike
27 CiteULike
Title
Mastering the game of Go with deep neural networks and tree search
Published in
Nature, January 2016
DOI 10.1038/nature16961
Pubmed ID
Authors

David Silver, Aja Huang, Chris J. Maddison, Arthur Guez, Laurent Sifre, George van den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, Sander Dieleman, Dominik Grewe, John Nham, Nal Kalchbrenner, Ilya Sutskever, Timothy Lillicrap, Madeleine Leach, Koray Kavukcuoglu, Thore Graepel, Demis Hassabis, Silver, David, Huang, Aja, Maddison, Chris J, Guez, Arthur, Sifre, Laurent, van den Driessche, George, Schrittwieser, Julian, Antonoglou, Ioannis, Panneershelvam, Veda, Lanctot, Marc, Dieleman, Sander, Grewe, Dominik, Nham, John, Kalchbrenner, Nal, Sutskever, Ilya, Lillicrap, Timothy, Leach, Madeleine, Kavukcuoglu, Koray, Graepel, Thore, Hassabis, Demis, Silver D, Huang A, Maddison CJ, Guez A, Sifre L, van den Driessche G, Schrittwieser J, Antonoglou I, Panneershelvam V, Lanctot M, Dieleman S, Grewe D, Nham J, Kalchbrenner N, Sutskever I, Lillicrap T, Leach M, Kavukcuoglu K, Graepel T, Hassabis D

Abstract

The game of Go has long been viewed as the most challenging of classic games for artificial intelligence owing to its enormous search space and the difficulty of evaluating board positions and moves. Here we introduce a new approach to computer Go that uses 'value networks' to evaluate board positions and 'policy networks' to select moves. These deep neural networks are trained by a novel combination of supervised learning from human expert games, and reinforcement learning from games of self-play. Without any lookahead search, the neural networks play Go at the level of state-of-the-art Monte Carlo tree search programs that simulate thousands of random games of self-play. We also introduce a new search algorithm that combines Monte Carlo simulation with value and policy networks. Using this search algorithm, our program AlphaGo achieved a 99.8% winning rate against other Go programs, and defeated the human European Go champion by 5 games to 0. This is the first time that a computer program has defeated a human professional player in the full-sized game of Go, a feat previously thought to be at least a decade away.

Twitter Demographics

The data shown below were collected from the profiles of 1,912 tweeters who shared this research output. Click here to find out more about how the information was compiled.

Mendeley readers

The data shown below were compiled from readership statistics for 5,395 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 122 2%
United Kingdom 60 1%
Germany 43 <1%
Japan 33 <1%
China 27 <1%
Spain 16 <1%
Netherlands 14 <1%
France 13 <1%
Canada 12 <1%
Other 129 2%
Unknown 4926 91%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 1589 29%
Student > Master 1136 21%
Researcher 911 17%
Student > Bachelor 639 12%
Other 275 5%
Other 845 16%
Readers by discipline Count As %
Computer Science 2625 49%
Engineering 817 15%
Agricultural and Biological Sciences 345 6%
Physics and Astronomy 342 6%
Unspecified 207 4%
Other 1059 20%

Attention Score in Context

This research output has an Altmetric Attention Score of 3211. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 21 January 2018.
All research outputs
#96
of 8,940,263 outputs
Outputs from Nature
#25
of 49,258 outputs
Outputs of similar age
#6
of 338,632 outputs
Outputs of similar age from Nature
#2
of 966 outputs
Altmetric has tracked 8,940,263 research outputs across all sources so far. Compared to these this one has done particularly well and is in the 99th percentile: it's in the top 5% of all research outputs ever tracked by Altmetric.
So far Altmetric has tracked 49,258 research outputs from this source. They typically receive a lot more attention than average, with a mean Attention Score of 76.4. This one has done particularly well, scoring higher than 99% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 338,632 tracked outputs that were published within six weeks on either side of this one in any source. This one has done particularly well, scoring higher than 99% of its contemporaries.
We're also able to compare this research output to 966 others from the same source and published within six weeks on either side of this one. This one has done particularly well, scoring higher than 99% of its contemporaries.