High-efficiency reinforcement learning with hybrid architecture photonic integrated circuit

My bibliography Save this article

High-efficiency reinforcement learning with hybrid architecture photonic integrated circuit

Author

Listed:

Xuan-Kun Li
(Shanghai Jiao Tong University
Hefei National Laboratory)
Jian-Xu Ma
(TuringQ Co., Ltd.)
Xiang-Yu Li
(TuringQ Co., Ltd.)
Jun-Jie Hu
(Shanghai Jiao Tong University
Hefei National Laboratory)
Chuan-Yang Ding
(Shanghai Jiao Tong University
Hefei National Laboratory)
Feng-Kai Han
(Shanghai Jiao Tong University
Hefei National Laboratory)
Xiao-Min Guo
(TuringQ Co., Ltd.)
Xi Tan
(Shanghai Jiao Tong University
Hefei National Laboratory)
Xian-Min Jin
(Shanghai Jiao Tong University
Hefei National Laboratory
TuringQ Co., Ltd.
Shanghai Jiao Tong University)

Registered:

Abstract

Reinforcement learning (RL) stands as one of the three fundamental paradigms within machine learning and has made a substantial leap to build general-purpose learning systems. However, using traditional electrical computers to simulate agent-environment interactions in RL models consumes tremendous computing resources, posing a significant challenge to the efficiency of RL. Here, we propose a universal framework that utilizes a photonic integrated circuit (PIC) to simulate the interactions in RL for improving the algorithm efficiency. High parallelism and precision on-chip optical interaction calculations are implemented with the assistance of link calibration in the hybrid architecture PIC. By introducing similarity information into the reward function of the RL model, PIC-RL successfully accomplishes perovskite materials synthesis task within a 3472-dimensional state space, resulting in a notable 56% improvement in efficiency. Our results validate the effectiveness of simulating RL algorithm interactions on the PIC platform, highlighting its potential to boost computing power in large-scale and sophisticated RL tasks.

Suggested Citation

Xuan-Kun Li & Jian-Xu Ma & Xiang-Yu Li & Jun-Jie Hu & Chuan-Yang Ding & Feng-Kai Han & Xiao-Min Guo & Xi Tan & Xian-Min Jin, 2024. "High-efficiency reinforcement learning with hybrid architecture photonic integrated circuit," Nature Communications, Nature, vol. 15(1), pages 1-10, December.

Handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-45305-z
DOI: 10.1038/s41467-024-45305-z

Download full text from publisher

References listed on IDEAS

Oriol Vinyals & Igor Babuschkin & Wojciech M. Czarnecki & Michaël Mathieu & Andrew Dudzik & Junyoung Chung & David H. Choi & Richard Powell & Timo Ewalds & Petko Georgiev & Junhyuk Oh & Dan Horgan & M, 2019. "Grandmaster level in StarCraft II using multi-agent reinforcement learning," Nature, Nature, vol. 575(7782), pages 350-354, November.
G. Mourgias-Alexandris & M. Moralis-Pegios & A. Tsakyridis & S. Simos & G. Dabos & A. Totovic & N. Passalis & M. Kirtas & T. Rutirawut & F. Y. Gardes & A. Tefas & N. Pleros, 2022. "Noise-resilient and high-speed deep learning with coherent silicon photonics," Nature Communications, Nature, vol. 13(1), pages 1-7, December.
Kathryn Tunyasuvunakool & Jonas Adler & Zachary Wu & Tim Green & Michal Zielinski & Augustin Žídek & Alex Bridgland & Andrew Cowie & Clemens Meyer & Agata Laydon & Sameer Velankar & Gerard J. Kleywegt, 2021. "Highly accurate protein structure prediction for the human proteome," Nature, Nature, vol. 596(7873), pages 590-596, August.
Hsinhan Tsai & Wanyi Nie & Jean-Christophe Blancon & Constantinos C. Stoumpos & Reza Asadpour & Boris Harutyunyan & Amanda J. Neukirch & Rafael Verduzco & Jared J. Crochet & Sergei Tretiak & Laurent P, 2016. "High-efficiency two-dimensional Ruddlesden–Popper perovskite solar cells," Nature, Nature, vol. 536(7616), pages 312-316, August.
J. Feldmann & N. Youngblood & M. Karpov & H. Gehring & X. Li & M. Stappers & M. Gallo & X. Fu & A. Lukashchuk & A. S. Raja & J. Liu & C. D. Wright & A. Sebastian & T. J. Kippenberg & W. H. P. Pernice , 2021. "Publisher Correction: Parallel convolutional processing using an integrated photonic tensor core," Nature, Nature, vol. 591(7849), pages 13-13, March.
Yang Shi & Junyu Ren & Guanyu Chen & Wei Liu & Chuqi Jin & Xiangyu Guo & Yu Yu & Xinliang Zhang, 2022. "Nonlinear germanium-silicon photodiode for activation and monitoring in photonic neuromorphic networks," Nature Communications, Nature, vol. 13(1), pages 1-9, December.
Guo-Wei Lu & Jianxun Hong & Feng Qiu & Andrew M. Spring & Tsubasa Kashino & Juro Oshima & Masa-aki Ozawa & Hideyuki Nawata & Shiyoshi Yokoyama, 2020. "High-temperature-resistant silicon-polymer hybrid modulator operating at up to 200 Gbit s−1 for energy-efficient datacentres and harsh-environment applications," Nature Communications, Nature, vol. 11(1), pages 1-9, December.
V. Saggio & B. E. Asenbeck & A. Hamann & T. Strömberg & P. Schiansky & V. Dunjko & N. Friis & N. C. Harris & M. Hochberg & D. Englund & S. Wölk & H. J. Briegel & P. Walther, 2021. "Experimental quantum speed-up in reinforcement learning agents," Nature, Nature, vol. 591(7849), pages 229-233, March.
Farshid Ashtiani & Alexander J. Geers & Firooz Aflatouni, 2022. "An on-chip photonic deep neural network for image classification," Nature, Nature, vol. 606(7914), pages 501-506, June.
Xingyuan Xu & Mengxi Tan & Bill Corcoran & Jiayang Wu & Andreas Boes & Thach G. Nguyen & Sai T. Chu & Brent E. Little & Damien G. Hicks & Roberto Morandotti & Arnan Mitchell & David J. Moss, 2021. "11 TOPS photonic convolutional accelerator for optical neural networks," Nature, Nature, vol. 589(7840), pages 44-51, January.
J. Feldmann & N. Youngblood & M. Karpov & H. Gehring & X. Li & M. Stappers & M. Gallo & X. Fu & A. Lukashchuk & A. S. Raja & J. Liu & C. D. Wright & A. Sebastian & T. J. Kippenberg & W. H. P. Pernice , 2021. "Parallel convolutional processing using an integrated photonic tensor core," Nature, Nature, vol. 589(7840), pages 52-58, January.
David Silver & Julian Schrittwieser & Karen Simonyan & Ioannis Antonoglou & Aja Huang & Arthur Guez & Thomas Hubert & Lucas Baker & Matthew Lai & Adrian Bolton & Yutian Chen & Timothy Lillicrap & Fan , 2017. "Mastering the game of Go without human knowledge," Nature, Nature, vol. 550(7676), pages 354-359, October.
John Jumper & Richard Evans & Alexander Pritzel & Tim Green & Michael Figurnov & Olaf Ronneberger & Kathryn Tunyasuvunakool & Russ Bates & Augustin Žídek & Anna Potapenko & Alex Bridgland & Clemens Me, 2021. "Highly accurate protein structure prediction with AlphaFold," Nature, Nature, vol. 596(7873), pages 583-589, August.
Volodymyr Mnih & Koray Kavukcuoglu & David Silver & Andrei A. Rusu & Joel Veness & Marc G. Bellemare & Alex Graves & Martin Riedmiller & Andreas K. Fidjeland & Georg Ostrovski & Stig Petersen & Charle, 2015. "Human-level control through deep reinforcement learning," Nature, Nature, vol. 518(7540), pages 529-533, February.
Cheng Wang & Mian Zhang & Xi Chen & Maxime Bertrand & Amirhassan Shams-Ansari & Sethumadhavan Chandrasekhar & Peter Winzer & Marko Lončar, 2018. "Integrated lithium niobate electro-optic modulators operating at CMOS-compatible voltages," Nature, Nature, vol. 562(7725), pages 101-104, October.
Keith T. Butler & Daniel W. Davies & Hugh Cartwright & Olexandr Isayev & Aron Walsh, 2018. "Machine learning for molecular and materials science," Nature, Nature, vol. 559(7715), pages 547-555, July.
J. M. Arrazola & V. Bergholm & K. Brádler & T. R. Bromley & M. J. Collins & I. Dhand & A. Fumagalli & T. Gerrits & A. Goussev & L. G. Helt & J. Hundal & T. Isacsson & R. B. Israel & J. Izaac & S. Jaha, 2021. "Quantum circuits with many photons on a programmable nanophotonic chip," Nature, Nature, vol. 591(7848), pages 54-60, March.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Wang, Zixuan & Chen, Zijian & Wang, Boyuan & Wu, Chuang & Zhou, Chao & Peng, Yang & Zhang, Xinyu & Ni, Zongming & Chung, Chi-yung & Chan, Ching-chuen & Yang, Jian & Zhao, Haitao, 2025. "Digital manufacturing of perovskite materials and solar cells," Applied Energy, Elsevier, vol. 377(PB).

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Yunping Bai & Yifu Xu & Shifan Chen & Xiaotian Zhu & Shuai Wang & Sirui Huang & Yuhang Song & Yixuan Zheng & Zhihui Liu & Sim Tan & Roberto Morandotti & Sai T. Chu & Brent E. Little & David J. Moss & , 2025. "TOPS-speed complex-valued convolutional accelerator for feature extraction and inference," Nature Communications, Nature, vol. 16(1), pages 1-13, December.
Zhongjin Lin & Bhavin J. Shastri & Shangxuan Yu & Jingxiang Song & Yuntao Zhu & Arman Safarnejadian & Wangning Cai & Yanmei Lin & Wei Ke & Mustafa Hammood & Tianye Wang & Mengyue Xu & Zibo Zheng & Moh, 2024. "120 GOPS Photonic tensor core in thin-film lithium niobate for inference and in situ training," Nature Communications, Nature, vol. 15(1), pages 1-10, December.
Dimitrios C. Tzarouchis & Brian Edwards & Nader Engheta, 2025. "Programmable wave-based analog computing machine: a metastructure that designs metastructures," Nature Communications, Nature, vol. 16(1), pages 1-7, December.
Yang, Kaiyuan & Huang, Houjing & Vandans, Olafs & Murali, Adithya & Tian, Fujia & Yap, Roland H.C. & Dai, Liang, 2023. "Applying deep reinforcement learning to the HP model for protein structure prediction," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 609(C).
Dongliang Wang & Yikun Nie & Gaolei Hu & Hon Ki Tsang & Chaoran Huang, 2024. "Ultrafast silicon photonic reservoir computing engine delivering over 200 TOPS," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
Junwei Cheng & Chaoran Huang & Jialong Zhang & Bo Wu & Wenkai Zhang & Xinyu Liu & Jiahui Zhang & Yiyi Tang & Hailong Zhou & Qiming Zhang & Min Gu & Jianji Dong & Xinliang Zhang, 2024. "Multimodal deep learning using on-chip diffractive optics with in situ training capability," Nature Communications, Nature, vol. 15(1), pages 1-10, December.
Chenduan Chen & Zhan Yang & Tao Wang & Yalun Wang & Kai Gao & Jiajia Wu & Jun Wang & Jianrong Qiu & Dezhi Tan, 2024. "Ultra-broadband all-optical nonlinear activation function enabled by MoTe2/optical waveguide integrated devices," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
Jingwei Ling & Zhengdong Gao & Shixin Xue & Qili Hu & Mingxiao Li & Kaibo Zhang & Usman A. Javid & Raymond Lopez-Rios & Jeremy Staffa & Qiang Lin, 2024. "Electrically empowered microcomb laser," Nature Communications, Nature, vol. 15(1), pages 1-8, December.
Han Zhao & Bingzhao Li & Huan Li & Mo Li, 2022. "Enabling scalable optical computing in synthetic frequency dimension using integrated cavity acousto-optics," Nature Communications, Nature, vol. 13(1), pages 1-7, December.
Betz, Ulrich A.K. & Arora, Loukik & Assal, Reem A. & Azevedo, Hatylas & Baldwin, Jeremy & Becker, Michael S. & Bostock, Stefan & Cheng, Vinton & Egle, Tobias & Ferrari, Nicola & Schneider-Futschik, El, 2023. "Game changers in science and technology - now and beyond," Technological Forecasting and Social Change, Elsevier, vol. 193(C).
Shaofu Xu & Jing Wang & Sicheng Yi & Weiwen Zou, 2022. "High-order tensor flow processing using integrated photonic circuits," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
Cui, Tianxiang & Du, Nanjiang & Yang, Xiaoying & Ding, Shusheng, 2024. "Multi-period portfolio optimization using a deep reinforcement learning hyper-heuristic approach," Technological Forecasting and Social Change, Elsevier, vol. 198(C).
Yaowen Hu & Yunxiang Song & Xinrui Zhu & Xiangwen Guo & Shengyuan Lu & Qihang Zhang & Lingyan He & Cornelis A. A. Franken & Keith Powell & Hana Warner & Daniel Assumpcao & Dylan Renaud & Ying Wang & L, 2025. "Integrated lithium niobate photonic computing circuit based on efficient and high-speed electro-optic conversion," Nature Communications, Nature, vol. 16(1), pages 1-11, December.
Chen-Guang Wang & Wuyue Xu & Chong Li & Lili Shi & Junliang Jiang & Tingting Guo & Wen-Cheng Yue & Tianyu Li & Ping Zhang & Yang-Yang Lyu & Jiazheng Pan & Xiuhao Deng & Ying Dong & Xuecou Tu & Sining , 2024. "Integrated and DC-powered superconducting microcomb," Nature Communications, Nature, vol. 15(1), pages 1-7, December.
Wenting Wang & Ping-Keng Lu & Abhinav Kumar Vinod & Deniz Turan & James F. McMillan & Hao Liu & Mingbin Yu & Dim-Lee Kwong & Mona Jarrahi & Chee Wei Wong, 2022. "Coherent terahertz radiation with 2.8-octave tunability through chip-scale photomixed microresonator optical parametric oscillation," Nature Communications, Nature, vol. 13(1), pages 1-9, December.
Chenlei Li & Hongyan Yu & Tao Shu & Yueyang Zhang & Chengfeng Wen & Hengzhen Cao & Jin Xie & Hanwen Li & Zixu Xu & Gong Zhang & Zejie Yu & Huan Li & Liu Liu & Yaocheng Shi & Feng Qiu & Daoxin Dai, 2025. "PZT optical memristors," Nature Communications, Nature, vol. 16(1), pages 1-13, December.
Niklas W. A. Gebauer & Michael Gastegger & Stefaan S. P. Hessmann & Klaus-Robert Müller & Kristof T. Schütt, 2022. "Inverse design of 3d molecular structures with conditional generative neural networks," Nature Communications, Nature, vol. 13(1), pages 1-11, December.
Weifan Long & Taixian Hou & Xiaoyi Wei & Shichao Yan & Peng Zhai & Lihua Zhang, 2023. "A Survey on Population-Based Deep Reinforcement Learning," Mathematics, MDPI, vol. 11(10), pages 1-17, May.
Bitao Shen & Haowen Shu & Weiqiang Xie & Ruixuan Chen & Zhi Liu & Zhangfeng Ge & Xuguang Zhang & Yimeng Wang & Yunhao Zhang & Buwen Cheng & Shaohua Yu & Lin Chang & Xingjun Wang, 2023. "Harnessing microcomb-based parallel chaos for random number generation and optical decision making," Nature Communications, Nature, vol. 14(1), pages 1-10, December.
Bowen Bai & Qipeng Yang & Haowen Shu & Lin Chang & Fenghe Yang & Bitao Shen & Zihan Tao & Jing Wang & Shaofu Xu & Weiqiang Xie & Weiwen Zou & Weiwei Hu & John E. Bowers & Xingjun Wang, 2023. "Microcomb-based integrated photonic processing unit," Nature Communications, Nature, vol. 14(1), pages 1-10, December.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-45305-z. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

High-efficiency reinforcement learning with hybrid architecture photonic integrated circuit

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data