A Multi-Variable Coupled Control Strategy Based on a Deep Deterministic Policy Gradient Reinforcement Learning Algorithm for a Small Pressurized Water Reactor

A Multi-Variable Coupled Control Strategy Based on a Deep Deterministic Policy Gradient Reinforcement Learning Algorithm for a Small Pressurized Water Reactor

Author

Listed:

Jie Chen
(National Key Laboratory of Nuclear Reactor Technology, Nuclear Power Institute of China, Chengdu 610213, China)
Kai Xiao
(National Key Laboratory of Nuclear Reactor Technology, Nuclear Power Institute of China, Chengdu 610213, China)
Ke Huang
(National Key Laboratory of Nuclear Reactor Technology, Nuclear Power Institute of China, Chengdu 610213, China)
Zhen Yang
(National Key Laboratory of Nuclear Reactor Technology, Nuclear Power Institute of China, Chengdu 610213, China)
Qing Chu
(National Key Laboratory of Nuclear Reactor Technology, Nuclear Power Institute of China, Chengdu 610213, China)
Guanfu Jiang
(National Key Laboratory of Nuclear Reactor Technology, Nuclear Power Institute of China, Chengdu 610213, China)

Abstract

The reactor system has multivariate, nonlinear, and strongly coupled dynamic characteristics, which puts high demands on the robustness, real-time demand, and accuracy of the control strategy. Conventional control approaches depend on the mathematical model of the system being controlled, making it challenging to handle the reactor system’s dynamic complexity and uncertainties. This paper proposes a multi-variable coupled control strategy for a nuclear reactor steam supply system based on a Deep Deterministic Policy Gradient reinforcement learning algorithm, designs and trains a multi-variable coupled intelligent controller to simultaneously realize the coordinated control of multiple parameters, such as the reactor power, average coolant temperature, steam pressure, etc., and performs a simulation validation of the control strategy under the typical transient variable load working conditions. Simulation results show that the reinforcement learning control effect is better than the PID control effect under a ±10% FP step variable load condition, a linear variable load condition, and a load dumping condition, and that the reactor power overshooting amount and regulation time, the maximum deviation of the coolant average temperature, the steam pressure, the pressure of pressurizer and relative liquid level, and the regulation time are improved by at least 15.5% compared with the traditional control method. Therefore, this study offers a theoretical framework for utilizing reinforcement learning in the field of nuclear reactor control.

Suggested Citation

Jie Chen & Kai Xiao & Ke Huang & Zhen Yang & Qing Chu & Guanfu Jiang, 2025. "A Multi-Variable Coupled Control Strategy Based on a Deep Deterministic Policy Gradient Reinforcement Learning Algorithm for a Small Pressurized Water Reactor," Energies, MDPI, vol. 18(6), pages 1-26, March.

Handle: RePEc:gam:jeners:v:18:y:2025:i:6:p:1517-:d:1615560

Download full text from publisher

References listed on IDEAS

Yi, Zonggen & Luo, Yusheng & Westover, Tyler & Katikaneni, Sravya & Ponkiya, Binaka & Sah, Suba & Mahmud, Sadab & Raker, David & Javaid, Ahmad & Heben, Michael J. & Khanna, Raghav, 2022. "Deep reinforcement learning based optimization for a tightly coupled nuclear renewable integrated energy system," Applied Energy, Elsevier, vol. 328(C).
Zhang, Tianhao & Dong, Zhe & Huang, Xiaojin, 2024. "Multi-objective optimization of thermal power and outlet steam temperature for a nuclear steam supply system with deep reinforcement learning," Energy, Elsevier, vol. 286(C).
Dong, Zhe & Huang, Xiaojin & Dong, Yujie & Zhang, Zuoyi, 2020. "Multilayer perception based reinforcement learning supervisory control of energy systems with application to a nuclear steam supply system," Applied Energy, Elsevier, vol. 259(C).

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Zhang, Qi & Xiao, Longhao & Wei, Xinyu & Sun, Peiwei, 2025. "Study on the automatic procedure startup/shutdown control of a PWR nuclear power plant with advanced control," Energy, Elsevier, vol. 334(C).
Becerra-Fernandez, Mauricio & Sarmiento, Alfonso T. & Cardenas, Laura M., 2023. "Sustainability assessment of the solar energy supply chain in Colombia," Energy, Elsevier, vol. 282(C).
Keerthana Sivamayil & Elakkiya Rajasekar & Belqasem Aljafari & Srete Nikolovski & Subramaniyaswamy Vairavasundaram & Indragandhi Vairavasundaram, 2023. "A Systematic Study on Reinforcement Learning Based Applications," Energies, MDPI, vol. 16(3), pages 1-23, February.
Hui, Jiuwu & Lee, Yi-Kuen & Yuan, Jingqi, 2023. "Load following control of a PWR with load-dependent parameters and perturbations via fixed-time fractional-order sliding mode and disturbance observer techniques," Renewable and Sustainable Energy Reviews, Elsevier, vol. 184(C).
Zhe Dong & Zhonghua Cheng & Yunlong Zhu & Xiaojin Huang & Yujie Dong & Zuoyi Zhang, 2023. "Review on the Recent Progress in Nuclear Plant Dynamical Modeling and Control," Energies, MDPI, vol. 16(3), pages 1-19, February.
Chen, Yan & Zhang, Ruiqian & Lyu, Jiayi & Hou, Yuqi, 2024. "AI and Nuclear: A perfect intersection of danger and potential?," Energy Economics, Elsevier, vol. 133(C).
Daeil Lee & Seoryong Koo & Inseok Jang & Jonghyun Kim, 2022. "Comparison of Deep Reinforcement Learning and PID Controllers for Automatic Cold Shutdown Operation," Energies, MDPI, vol. 15(8), pages 1-25, April.
Yang, Zhixue & Ren, Zhouyang & Li, Hui & Sun, Zhiyuan & Feng, Jianbing & Xia, Weiyi, 2024. "A multi-stage stochastic dispatching method for electricity‑hydrogen integrated energy systems driven by model and data," Applied Energy, Elsevier, vol. 371(C).
Song, Houde & Song, Meiqi & Liu, Xiaojing, 2022. "Online autonomous calibration of digital twins using machine learning with application to nuclear power plants," Applied Energy, Elsevier, vol. 326(C).
Sushanta Gautam & Austin Szczublewski & Aidan Fox & Sadab Mahmud & Ahmad Javaid & Temitayo O. Olowu & Tyler Westover & Raghav Khanna, 2025. "Digital Real-Time Simulation and Power Quality Analysis of a Hydrogen-Generating Nuclear-Renewable Integrated Energy System," Energies, MDPI, vol. 18(4), pages 1-22, February.
Gong, Lanxin & Peng, Changhong & Huang, Qingyu & Lin, Yuanfeng, 2025. "EKF-MCMC data assimilation framework for real-time state estimation and uncertainty quantification in reactor thermal-hydraulic analysis," Energy, Elsevier, vol. 340(C).
Mahmud, Sakib & Sayed, Aya Nabil & Himeur, Yassine & Nhlabatsi, Armstrong & Bensaali, Faycal, 2026. "A comprehensive review of deep reinforcement learning applications from centralized power generation to modern energy internet frameworks," Renewable and Sustainable Energy Reviews, Elsevier, vol. 226(PE).
Zheng, Qiankang & Lu, Le & Chen, Zhaofeng & Wu, Qiong & Yang, Mengmeng & Hou, Bin & Chen, Shijie & Zhang, Zhuoke & Yang, Lixia & Cui, Sheng, 2024. "The real-time detection of defects in nuclear power pipeline thermal insulation glass fiber by deep-learning," Energy, Elsevier, vol. 313(C).
Cui, Feifei & An, Dou & Xi, Huan & Ren, Zhigang, 2025. "Collaborative scheduling optimization of hydrogen-enhanced integrated energy system via goal-conditioned hierarchical reinforcement learning," Energy, Elsevier, vol. 338(C).
Wu, Shifa & Ma, Xiaolong & Liu, Junfeng & Wan, Jiashuang & Wang, Pengfei & Su, G.H., 2023. "A load following control strategy for Chinese Modular High-Temperature Gas-Cooled Reactor HTR-PM," Energy, Elsevier, vol. 263(PA).
Cui, Feifei & An, Dou & Xi, Huan, 2024. "Integrated energy hub dispatch with a multi-mode CAES–BESS hybrid system: An option-based hierarchical reinforcement learning approach," Applied Energy, Elsevier, vol. 374(C).
Huang, Qingyu & Zeng, Wei & Liu, Jia & Zhang, Zhuo & Deng, Jian & Qiu, Zhifang & Xu, Le & Wei, Zonglan & Lu, Qi & Gong, Lanxin & Shi, Chunsen & Zhong, Xianping, 2025. "Shaping the future of nuclear reactors with digital twins: Current developments and perspectives," Applied Energy, Elsevier, vol. 402(PA).
Islam, Md. Monirul & Shahbaz, Muhammad & Samargandi, Nahla, 2024. "The nexus between Russian uranium exports and US nuclear-energy consumption: Do the spillover effects of geopolitical risks matter?," Energy, Elsevier, vol. 293(C).
Hui, Jiuwu, 2024. "Discrete-time integral terminal sliding mode load following controller coupled with disturbance observer for a modular high-temperature gas-cooled reactor," Energy, Elsevier, vol. 292(C).
Yang, Ting & Wang, Qiancheng & Wang, Xudong & Wang, Lin & Geng, Yinan, 2025. "Low-carbon economic distributed dispatch for district-level integrated energy system considering privacy protection and demand response," Applied Energy, Elsevier, vol. 383(C).

More about this item

Keywords

; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jeners:v:18:y:2025:i:6:p:1517-:d:1615560. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager The email address of this maintainer does not seem to be valid anymore. Please ask MDPI Indexing Manager to update the entry or send us the correct address (email available below). General contact details of provider: https://www.mdpi.com .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

A Multi-Variable Coupled Control Strategy Based on a Deep Deterministic Policy Gradient Reinforcement Learning Algorithm for a Small Pressurized Water Reactor

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data