IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v15y2024i1d10.1038_s41467-024-45305-z.html
   My bibliography  Save this article

High-efficiency reinforcement learning with hybrid architecture photonic integrated circuit

Author

Listed:
  • Xuan-Kun Li

    (Shanghai Jiao Tong University
    Hefei National Laboratory)

  • Jian-Xu Ma

    (TuringQ Co., Ltd.)

  • Xiang-Yu Li

    (TuringQ Co., Ltd.)

  • Jun-Jie Hu

    (Shanghai Jiao Tong University
    Hefei National Laboratory)

  • Chuan-Yang Ding

    (Shanghai Jiao Tong University
    Hefei National Laboratory)

  • Feng-Kai Han

    (Shanghai Jiao Tong University
    Hefei National Laboratory)

  • Xiao-Min Guo

    (TuringQ Co., Ltd.)

  • Xi Tan

    (Shanghai Jiao Tong University
    Hefei National Laboratory)

  • Xian-Min Jin

    (Shanghai Jiao Tong University
    Hefei National Laboratory
    TuringQ Co., Ltd.
    Shanghai Jiao Tong University)

Abstract

Reinforcement learning (RL) stands as one of the three fundamental paradigms within machine learning and has made a substantial leap to build general-purpose learning systems. However, using traditional electrical computers to simulate agent-environment interactions in RL models consumes tremendous computing resources, posing a significant challenge to the efficiency of RL. Here, we propose a universal framework that utilizes a photonic integrated circuit (PIC) to simulate the interactions in RL for improving the algorithm efficiency. High parallelism and precision on-chip optical interaction calculations are implemented with the assistance of link calibration in the hybrid architecture PIC. By introducing similarity information into the reward function of the RL model, PIC-RL successfully accomplishes perovskite materials synthesis task within a 3472-dimensional state space, resulting in a notable 56% improvement in efficiency. Our results validate the effectiveness of simulating RL algorithm interactions on the PIC platform, highlighting its potential to boost computing power in large-scale and sophisticated RL tasks.

Suggested Citation

  • Xuan-Kun Li & Jian-Xu Ma & Xiang-Yu Li & Jun-Jie Hu & Chuan-Yang Ding & Feng-Kai Han & Xiao-Min Guo & Xi Tan & Xian-Min Jin, 2024. "High-efficiency reinforcement learning with hybrid architecture photonic integrated circuit," Nature Communications, Nature, vol. 15(1), pages 1-10, December.
  • Handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-45305-z
    DOI: 10.1038/s41467-024-45305-z
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-024-45305-z
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-024-45305-z?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Kathryn Tunyasuvunakool & Jonas Adler & Zachary Wu & Tim Green & Michal Zielinski & Augustin Žídek & Alex Bridgland & Andrew Cowie & Clemens Meyer & Agata Laydon & Sameer Velankar & Gerard J. Kleywegt, 2021. "Highly accurate protein structure prediction for the human proteome," Nature, Nature, vol. 596(7873), pages 590-596, August.
    2. Guo-Wei Lu & Jianxun Hong & Feng Qiu & Andrew M. Spring & Tsubasa Kashino & Juro Oshima & Masa-aki Ozawa & Hideyuki Nawata & Shiyoshi Yokoyama, 2020. "High-temperature-resistant silicon-polymer hybrid modulator operating at up to 200 Gbit s−1 for energy-efficient datacentres and harsh-environment applications," Nature Communications, Nature, vol. 11(1), pages 1-9, December.
    3. Farshid Ashtiani & Alexander J. Geers & Firooz Aflatouni, 2022. "An on-chip photonic deep neural network for image classification," Nature, Nature, vol. 606(7914), pages 501-506, June.
    4. John Jumper & Richard Evans & Alexander Pritzel & Tim Green & Michael Figurnov & Olaf Ronneberger & Kathryn Tunyasuvunakool & Russ Bates & Augustin Žídek & Anna Potapenko & Alex Bridgland & Clemens Me, 2021. "Highly accurate protein structure prediction with AlphaFold," Nature, Nature, vol. 596(7873), pages 583-589, August.
    5. Keith T. Butler & Daniel W. Davies & Hugh Cartwright & Olexandr Isayev & Aron Walsh, 2018. "Machine learning for molecular and materials science," Nature, Nature, vol. 559(7715), pages 547-555, July.
    6. J. M. Arrazola & V. Bergholm & K. Brádler & T. R. Bromley & M. J. Collins & I. Dhand & A. Fumagalli & T. Gerrits & A. Goussev & L. G. Helt & J. Hundal & T. Isacsson & R. B. Israel & J. Izaac & S. Jaha, 2021. "Quantum circuits with many photons on a programmable nanophotonic chip," Nature, Nature, vol. 591(7848), pages 54-60, March.
    7. G. Mourgias-Alexandris & M. Moralis-Pegios & A. Tsakyridis & S. Simos & G. Dabos & A. Totovic & N. Passalis & M. Kirtas & T. Rutirawut & F. Y. Gardes & A. Tefas & N. Pleros, 2022. "Noise-resilient and high-speed deep learning with coherent silicon photonics," Nature Communications, Nature, vol. 13(1), pages 1-7, December.
    8. Yang Shi & Junyu Ren & Guanyu Chen & Wei Liu & Chuqi Jin & Xiangyu Guo & Yu Yu & Xinliang Zhang, 2022. "Nonlinear germanium-silicon photodiode for activation and monitoring in photonic neuromorphic networks," Nature Communications, Nature, vol. 13(1), pages 1-9, December.
    9. V. Saggio & B. E. Asenbeck & A. Hamann & T. Strömberg & P. Schiansky & V. Dunjko & N. Friis & N. C. Harris & M. Hochberg & D. Englund & S. Wölk & H. J. Briegel & P. Walther, 2021. "Experimental quantum speed-up in reinforcement learning agents," Nature, Nature, vol. 591(7849), pages 229-233, March.
    10. J. Feldmann & N. Youngblood & M. Karpov & H. Gehring & X. Li & M. Stappers & M. Gallo & X. Fu & A. Lukashchuk & A. S. Raja & J. Liu & C. D. Wright & A. Sebastian & T. J. Kippenberg & W. H. P. Pernice , 2021. "Parallel convolutional processing using an integrated photonic tensor core," Nature, Nature, vol. 589(7840), pages 52-58, January.
    11. David Silver & Julian Schrittwieser & Karen Simonyan & Ioannis Antonoglou & Aja Huang & Arthur Guez & Thomas Hubert & Lucas Baker & Matthew Lai & Adrian Bolton & Yutian Chen & Timothy Lillicrap & Fan , 2017. "Mastering the game of Go without human knowledge," Nature, Nature, vol. 550(7676), pages 354-359, October.
    12. Cheng Wang & Mian Zhang & Xi Chen & Maxime Bertrand & Amirhassan Shams-Ansari & Sethumadhavan Chandrasekhar & Peter Winzer & Marko Lončar, 2018. "Integrated lithium niobate electro-optic modulators operating at CMOS-compatible voltages," Nature, Nature, vol. 562(7725), pages 101-104, October.
    13. Volodymyr Mnih & Koray Kavukcuoglu & David Silver & Andrei A. Rusu & Joel Veness & Marc G. Bellemare & Alex Graves & Martin Riedmiller & Andreas K. Fidjeland & Georg Ostrovski & Stig Petersen & Charle, 2015. "Human-level control through deep reinforcement learning," Nature, Nature, vol. 518(7540), pages 529-533, February.
    14. Oriol Vinyals & Igor Babuschkin & Wojciech M. Czarnecki & Michaël Mathieu & Andrew Dudzik & Junyoung Chung & David H. Choi & Richard Powell & Timo Ewalds & Petko Georgiev & Junhyuk Oh & Dan Horgan & M, 2019. "Grandmaster level in StarCraft II using multi-agent reinforcement learning," Nature, Nature, vol. 575(7782), pages 350-354, November.
    15. Hsinhan Tsai & Wanyi Nie & Jean-Christophe Blancon & Constantinos C. Stoumpos & Reza Asadpour & Boris Harutyunyan & Amanda J. Neukirch & Rafael Verduzco & Jared J. Crochet & Sergei Tretiak & Laurent P, 2016. "High-efficiency two-dimensional Ruddlesden–Popper perovskite solar cells," Nature, Nature, vol. 536(7616), pages 312-316, August.
    16. J. Feldmann & N. Youngblood & M. Karpov & H. Gehring & X. Li & M. Stappers & M. Gallo & X. Fu & A. Lukashchuk & A. S. Raja & J. Liu & C. D. Wright & A. Sebastian & T. J. Kippenberg & W. H. P. Pernice , 2021. "Publisher Correction: Parallel convolutional processing using an integrated photonic tensor core," Nature, Nature, vol. 591(7849), pages 13-13, March.
    17. Xingyuan Xu & Mengxi Tan & Bill Corcoran & Jiayang Wu & Andreas Boes & Thach G. Nguyen & Sai T. Chu & Brent E. Little & Damien G. Hicks & Roberto Morandotti & Arnan Mitchell & David J. Moss, 2021. "11 TOPS photonic convolutional accelerator for optical neural networks," Nature, Nature, vol. 589(7840), pages 44-51, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yang, Kaiyuan & Huang, Houjing & Vandans, Olafs & Murali, Adithya & Tian, Fujia & Yap, Roland H.C. & Dai, Liang, 2023. "Applying deep reinforcement learning to the HP model for protein structure prediction," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 609(C).
    2. Han Zhao & Bingzhao Li & Huan Li & Mo Li, 2022. "Enabling scalable optical computing in synthetic frequency dimension using integrated cavity acousto-optics," Nature Communications, Nature, vol. 13(1), pages 1-7, December.
    3. Betz, Ulrich A.K. & Arora, Loukik & Assal, Reem A. & Azevedo, Hatylas & Baldwin, Jeremy & Becker, Michael S. & Bostock, Stefan & Cheng, Vinton & Egle, Tobias & Ferrari, Nicola & Schneider-Futschik, El, 2023. "Game changers in science and technology - now and beyond," Technological Forecasting and Social Change, Elsevier, vol. 193(C).
    4. Shaofu Xu & Jing Wang & Sicheng Yi & Weiwen Zou, 2022. "High-order tensor flow processing using integrated photonic circuits," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    5. Wenting Wang & Ping-Keng Lu & Abhinav Kumar Vinod & Deniz Turan & James F. McMillan & Hao Liu & Mingbin Yu & Dim-Lee Kwong & Mona Jarrahi & Chee Wei Wong, 2022. "Coherent terahertz radiation with 2.8-octave tunability through chip-scale photomixed microresonator optical parametric oscillation," Nature Communications, Nature, vol. 13(1), pages 1-9, December.
    6. Niklas W. A. Gebauer & Michael Gastegger & Stefaan S. P. Hessmann & Klaus-Robert Müller & Kristof T. Schütt, 2022. "Inverse design of 3d molecular structures with conditional generative neural networks," Nature Communications, Nature, vol. 13(1), pages 1-11, December.
    7. Weifan Long & Taixian Hou & Xiaoyi Wei & Shichao Yan & Peng Zhai & Lihua Zhang, 2023. "A Survey on Population-Based Deep Reinforcement Learning," Mathematics, MDPI, vol. 11(10), pages 1-17, May.
    8. Bitao Shen & Haowen Shu & Weiqiang Xie & Ruixuan Chen & Zhi Liu & Zhangfeng Ge & Xuguang Zhang & Yimeng Wang & Yunhao Zhang & Buwen Cheng & Shaohua Yu & Lin Chang & Xingjun Wang, 2023. "Harnessing microcomb-based parallel chaos for random number generation and optical decision making," Nature Communications, Nature, vol. 14(1), pages 1-10, December.
    9. Bowen Bai & Qipeng Yang & Haowen Shu & Lin Chang & Fenghe Yang & Bitao Shen & Zihan Tao & Jing Wang & Shaofu Xu & Weiqiang Xie & Weiwen Zou & Weiwei Hu & John E. Bowers & Xingjun Wang, 2023. "Microcomb-based integrated photonic processing unit," Nature Communications, Nature, vol. 14(1), pages 1-10, December.
    10. Xiangyan Meng & Guojie Zhang & Nuannuan Shi & Guangyi Li & José Azaña & José Capmany & Jianping Yao & Yichen Shen & Wei Li & Ninghua Zhu & Ming Li, 2023. "Compact optical convolution processing unit based on multimode interference," Nature Communications, Nature, vol. 14(1), pages 1-9, December.
    11. Li, Wenqing & Ni, Shaoquan, 2022. "Train timetabling with the general learning environment and multi-agent deep reinforcement learning," Transportation Research Part B: Methodological, Elsevier, vol. 157(C), pages 230-251.
    12. Yiwei Li & Ning An & Zheyi Lu & Yuchen Wang & Bing Chang & Teng Tan & Xuhan Guo & Xizhen Xu & Jun He & Handing Xia & Zhaohui Wu & Yikai Su & Yuan Liu & Yunjiang Rao & Giancarlo Soavi & Baicheng Yao, 2022. "Nonlinear co-generation of graphene plasmons for optoelectronic logic operations," Nature Communications, Nature, vol. 13(1), pages 1-7, December.
    13. Mitsumasa Nakajima & Katsuma Inoue & Kenji Tanaka & Yasuo Kuniyoshi & Toshikazu Hashimoto & Kohei Nakajima, 2022. "Physical deep learning with biologically inspired training method: gradient-free approach for physical hardware," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    14. Jingzhao Zhang & Yanan Wang & Benben Jiang & Haowei He & Shaobo Huang & Chen Wang & Yang Zhang & Xuebing Han & Dongxu Guo & Guannan He & Minggao Ouyang, 2023. "Realistic fault detection of li-ion battery via dynamical deep learning," Nature Communications, Nature, vol. 14(1), pages 1-8, December.
    15. Minkyu Shin & Jin Kim & Bas van Opheusden & Thomas L. Griffiths, 2023. "Superhuman Artificial Intelligence Can Improve Human Decision Making by Increasing Novelty," Papers 2303.07462, arXiv.org, revised Apr 2023.
    16. Ming Deng & Michele Cotrufo & Jian Wang & Jianji Dong & Zhichao Ruan & Andrea Alù & Lin Chen, 2024. "Broadband angular spectrum differentiation using dielectric metasurfaces," Nature Communications, Nature, vol. 15(1), pages 1-10, December.
    17. Guangwei Cong & Noritsugu Yamamoto & Takashi Inoue & Yuriko Maegami & Morifumi Ohno & Shota Kita & Shu Namiki & Koji Yamada, 2022. "On-chip bacterial foraging training in silicon photonic circuits for projection-enabled nonlinear classification," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    18. Dmitry Kazakov & Theodore P. Letsou & Maximilian Beiser & Yiyang Zhi & Nikola Opačak & Marco Piccardo & Benedikt Schwarz & Federico Capasso, 2024. "Active mid-infrared ring resonators," Nature Communications, Nature, vol. 15(1), pages 1-8, December.
    19. Yang He & Raymond Lopez-Rios & Usman A. Javid & Jingwei Ling & Mingxiao Li & Shixin Xue & Kerry Vahala & Qiang Lin, 2023. "High-speed tunable microwave-rate soliton microcomb," Nature Communications, Nature, vol. 14(1), pages 1-7, December.
    20. Liuting Shan & Qizhen Chen & Rengjian Yu & Changsong Gao & Lujian Liu & Tailiang Guo & Huipeng Chen, 2023. "A sensory memory processing system with multi-wavelength synaptic-polychromatic light emission for multi-modal information recognition," Nature Communications, Nature, vol. 14(1), pages 1-11, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-45305-z. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.