IDEAS home Printed from https://ideas.repec.org/a/nat/nature/v550y2017i7676d10.1038_nature24270.html
   My bibliography  Save this article

Mastering the game of Go without human knowledge

Author

Listed:
  • David Silver

    (DeepMind)

  • Julian Schrittwieser

    (DeepMind)

  • Karen Simonyan

    (DeepMind)

  • Ioannis Antonoglou

    (DeepMind)

  • Aja Huang

    (DeepMind)

  • Arthur Guez

    (DeepMind)

  • Thomas Hubert

    (DeepMind)

  • Lucas Baker

    (DeepMind)

  • Matthew Lai

    (DeepMind)

  • Adrian Bolton

    (DeepMind)

  • Yutian Chen

    (DeepMind)

  • Timothy Lillicrap

    (DeepMind)

  • Fan Hui

    (DeepMind)

  • Laurent Sifre

    (DeepMind)

  • George van den Driessche

    (DeepMind)

  • Thore Graepel

    (DeepMind)

  • Demis Hassabis

    (DeepMind)

Abstract

A long-standing goal of artificial intelligence is an algorithm that learns, tabula rasa, superhuman proficiency in challenging domains. Recently, AlphaGo became the first program to defeat a world champion in the game of Go. The tree search in AlphaGo evaluated positions and selected moves using deep neural networks. These neural networks were trained by supervised learning from human expert moves, and by reinforcement learning from self-play. Here we introduce an algorithm based solely on reinforcement learning, without human data, guidance or domain knowledge beyond game rules. AlphaGo becomes its own teacher: a neural network is trained to predict AlphaGo’s own move selections and also the winner of AlphaGo’s games. This neural network improves the strength of the tree search, resulting in higher quality move selection and stronger self-play in the next iteration. Starting tabula rasa, our new program AlphaGo Zero achieved superhuman performance, winning 100–0 against the previously published, champion-defeating AlphaGo.

Suggested Citation

  • David Silver & Julian Schrittwieser & Karen Simonyan & Ioannis Antonoglou & Aja Huang & Arthur Guez & Thomas Hubert & Lucas Baker & Matthew Lai & Adrian Bolton & Yutian Chen & Timothy Lillicrap & Fan , 2017. "Mastering the game of Go without human knowledge," Nature, Nature, vol. 550(7676), pages 354-359, October.
  • Handle: RePEc:nat:nature:v:550:y:2017:i:7676:d:10.1038_nature24270
    DOI: 10.1038/nature24270
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/nature24270
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1038/nature24270?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:nature:v:550:y:2017:i:7676:d:10.1038_nature24270. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.