IDEAS home Printed from https://ideas.repec.org/a/nat/nature/v529y2016i7587d10.1038_nature16961.html
   My bibliography  Save this article

Mastering the game of Go with deep neural networks and tree search

Author

Listed:
  • David Silver

    (Google DeepMind)

  • Aja Huang

    (Google DeepMind)

  • Chris J. Maddison

    (Google DeepMind)

  • Arthur Guez

    (Google DeepMind)

  • Laurent Sifre

    (Google DeepMind)

  • George van den Driessche

    (Google DeepMind)

  • Julian Schrittwieser

    (Google DeepMind)

  • Ioannis Antonoglou

    (Google DeepMind)

  • Veda Panneershelvam

    (Google DeepMind)

  • Marc Lanctot

    (Google DeepMind)

  • Sander Dieleman

    (Google DeepMind)

  • Dominik Grewe

    (Google DeepMind)

  • John Nham

    (Google, 1600 Amphitheatre Parkway, Mountain View)

  • Nal Kalchbrenner

    (Google DeepMind)

  • Ilya Sutskever

    (Google, 1600 Amphitheatre Parkway, Mountain View)

  • Timothy Lillicrap

    (Google DeepMind)

  • Madeleine Leach

    (Google DeepMind)

  • Koray Kavukcuoglu

    (Google DeepMind)

  • Thore Graepel

    (Google DeepMind)

  • Demis Hassabis

    (Google DeepMind)

Abstract

The game of Go has long been viewed as the most challenging of classic games for artificial intelligence owing to its enormous search space and the difficulty of evaluating board positions and moves. Here we introduce a new approach to computer Go that uses ‘value networks’ to evaluate board positions and ‘policy networks’ to select moves. These deep neural networks are trained by a novel combination of supervised learning from human expert games, and reinforcement learning from games of self-play. Without any lookahead search, the neural networks play Go at the level of state-of-the-art Monte Carlo tree search programs that simulate thousands of random games of self-play. We also introduce a new search algorithm that combines Monte Carlo simulation with value and policy networks. Using this search algorithm, our program AlphaGo achieved a 99.8% winning rate against other Go programs, and defeated the human European Go champion by 5 games to 0. This is the first time that a computer program has defeated a human professional player in the full-sized game of Go, a feat previously thought to be at least a decade away.

Suggested Citation

  • David Silver & Aja Huang & Chris J. Maddison & Arthur Guez & Laurent Sifre & George van den Driessche & Julian Schrittwieser & Ioannis Antonoglou & Veda Panneershelvam & Marc Lanctot & Sander Dieleman, 2016. "Mastering the game of Go with deep neural networks and tree search," Nature, Nature, vol. 529(7587), pages 484-489, January.
  • Handle: RePEc:nat:nature:v:529:y:2016:i:7587:d:10.1038_nature16961
    DOI: 10.1038/nature16961
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/nature16961
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1038/nature16961?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:nature:v:529:y:2016:i:7587:d:10.1038_nature16961. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.