IDEAS home Printed from https://ideas.repec.org/a/gam/jeners/v18y2025i6p1517-d1615560.html
   My bibliography  Save this article

A Multi-Variable Coupled Control Strategy Based on a Deep Deterministic Policy Gradient Reinforcement Learning Algorithm for a Small Pressurized Water Reactor

Author

Listed:
  • Jie Chen

    (National Key Laboratory of Nuclear Reactor Technology, Nuclear Power Institute of China, Chengdu 610213, China)

  • Kai Xiao

    (National Key Laboratory of Nuclear Reactor Technology, Nuclear Power Institute of China, Chengdu 610213, China)

  • Ke Huang

    (National Key Laboratory of Nuclear Reactor Technology, Nuclear Power Institute of China, Chengdu 610213, China)

  • Zhen Yang

    (National Key Laboratory of Nuclear Reactor Technology, Nuclear Power Institute of China, Chengdu 610213, China)

  • Qing Chu

    (National Key Laboratory of Nuclear Reactor Technology, Nuclear Power Institute of China, Chengdu 610213, China)

  • Guanfu Jiang

    (National Key Laboratory of Nuclear Reactor Technology, Nuclear Power Institute of China, Chengdu 610213, China)

Abstract

The reactor system has multivariate, nonlinear, and strongly coupled dynamic characteristics, which puts high demands on the robustness, real-time demand, and accuracy of the control strategy. Conventional control approaches depend on the mathematical model of the system being controlled, making it challenging to handle the reactor system’s dynamic complexity and uncertainties. This paper proposes a multi-variable coupled control strategy for a nuclear reactor steam supply system based on a Deep Deterministic Policy Gradient reinforcement learning algorithm, designs and trains a multi-variable coupled intelligent controller to simultaneously realize the coordinated control of multiple parameters, such as the reactor power, average coolant temperature, steam pressure, etc., and performs a simulation validation of the control strategy under the typical transient variable load working conditions. Simulation results show that the reinforcement learning control effect is better than the PID control effect under a ±10% FP step variable load condition, a linear variable load condition, and a load dumping condition, and that the reactor power overshooting amount and regulation time, the maximum deviation of the coolant average temperature, the steam pressure, the pressure of pressurizer and relative liquid level, and the regulation time are improved by at least 15.5% compared with the traditional control method. Therefore, this study offers a theoretical framework for utilizing reinforcement learning in the field of nuclear reactor control.

Suggested Citation

  • Jie Chen & Kai Xiao & Ke Huang & Zhen Yang & Qing Chu & Guanfu Jiang, 2025. "A Multi-Variable Coupled Control Strategy Based on a Deep Deterministic Policy Gradient Reinforcement Learning Algorithm for a Small Pressurized Water Reactor," Energies, MDPI, vol. 18(6), pages 1-26, March.
  • Handle: RePEc:gam:jeners:v:18:y:2025:i:6:p:1517-:d:1615560
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1996-1073/18/6/1517/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1996-1073/18/6/1517/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Yi, Zonggen & Luo, Yusheng & Westover, Tyler & Katikaneni, Sravya & Ponkiya, Binaka & Sah, Suba & Mahmud, Sadab & Raker, David & Javaid, Ahmad & Heben, Michael J. & Khanna, Raghav, 2022. "Deep reinforcement learning based optimization for a tightly coupled nuclear renewable integrated energy system," Applied Energy, Elsevier, vol. 328(C).
    2. Zhang, Tianhao & Dong, Zhe & Huang, Xiaojin, 2024. "Multi-objective optimization of thermal power and outlet steam temperature for a nuclear steam supply system with deep reinforcement learning," Energy, Elsevier, vol. 286(C).
    3. Dong, Zhe & Huang, Xiaojin & Dong, Yujie & Zhang, Zuoyi, 2020. "Multilayer perception based reinforcement learning supervisory control of energy systems with application to a nuclear steam supply system," Applied Energy, Elsevier, vol. 259(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Becerra-Fernandez, Mauricio & Sarmiento, Alfonso T. & Cardenas, Laura M., 2023. "Sustainability assessment of the solar energy supply chain in Colombia," Energy, Elsevier, vol. 282(C).
    2. Keerthana Sivamayil & Elakkiya Rajasekar & Belqasem Aljafari & Srete Nikolovski & Subramaniyaswamy Vairavasundaram & Indragandhi Vairavasundaram, 2023. "A Systematic Study on Reinforcement Learning Based Applications," Energies, MDPI, vol. 16(3), pages 1-23, February.
    3. Hui, Jiuwu & Lee, Yi-Kuen & Yuan, Jingqi, 2023. "Load following control of a PWR with load-dependent parameters and perturbations via fixed-time fractional-order sliding mode and disturbance observer techniques," Renewable and Sustainable Energy Reviews, Elsevier, vol. 184(C).
    4. Zhe Dong & Zhonghua Cheng & Yunlong Zhu & Xiaojin Huang & Yujie Dong & Zuoyi Zhang, 2023. "Review on the Recent Progress in Nuclear Plant Dynamical Modeling and Control," Energies, MDPI, vol. 16(3), pages 1-19, February.
    5. Chen, Yan & Zhang, Ruiqian & Lyu, Jiayi & Hou, Yuqi, 2024. "AI and Nuclear: A perfect intersection of danger and potential?," Energy Economics, Elsevier, vol. 133(C).
    6. Daeil Lee & Seoryong Koo & Inseok Jang & Jonghyun Kim, 2022. "Comparison of Deep Reinforcement Learning and PID Controllers for Automatic Cold Shutdown Operation," Energies, MDPI, vol. 15(8), pages 1-25, April.
    7. Yang, Zhixue & Ren, Zhouyang & Li, Hui & Sun, Zhiyuan & Feng, Jianbing & Xia, Weiyi, 2024. "A multi-stage stochastic dispatching method for electricity‑hydrogen integrated energy systems driven by model and data," Applied Energy, Elsevier, vol. 371(C).
    8. Song, Houde & Song, Meiqi & Liu, Xiaojing, 2022. "Online autonomous calibration of digital twins using machine learning with application to nuclear power plants," Applied Energy, Elsevier, vol. 326(C).
    9. Sushanta Gautam & Austin Szczublewski & Aidan Fox & Sadab Mahmud & Ahmad Javaid & Temitayo O. Olowu & Tyler Westover & Raghav Khanna, 2025. "Digital Real-Time Simulation and Power Quality Analysis of a Hydrogen-Generating Nuclear-Renewable Integrated Energy System," Energies, MDPI, vol. 18(4), pages 1-22, February.
    10. Zheng, Qiankang & Lu, Le & Chen, Zhaofeng & Wu, Qiong & Yang, Mengmeng & Hou, Bin & Chen, Shijie & Zhang, Zhuoke & Yang, Lixia & Cui, Sheng, 2024. "The real-time detection of defects in nuclear power pipeline thermal insulation glass fiber by deep-learning," Energy, Elsevier, vol. 313(C).
    11. Wu, Shifa & Ma, Xiaolong & Liu, Junfeng & Wan, Jiashuang & Wang, Pengfei & Su, G.H., 2023. "A load following control strategy for Chinese Modular High-Temperature Gas-Cooled Reactor HTR-PM," Energy, Elsevier, vol. 263(PA).
    12. Cui, Feifei & An, Dou & Xi, Huan, 2024. "Integrated energy hub dispatch with a multi-mode CAES–BESS hybrid system: An option-based hierarchical reinforcement learning approach," Applied Energy, Elsevier, vol. 374(C).
    13. Islam, Md. Monirul & Shahbaz, Muhammad & Samargandi, Nahla, 2024. "The nexus between Russian uranium exports and US nuclear-energy consumption: Do the spillover effects of geopolitical risks matter?," Energy, Elsevier, vol. 293(C).
    14. Hui, Jiuwu, 2024. "Discrete-time integral terminal sliding mode load following controller coupled with disturbance observer for a modular high-temperature gas-cooled reactor," Energy, Elsevier, vol. 292(C).
    15. Mahmud, Sadab & Ponkiya, Binaka & Katikaneni, Sravya & Pandey, Srijana & Mattimadugu, Kranthikiran & Yi, Zonggen & Walker, Victor & Wang, Congjian & Westover, Tyler & Javaid, Ahmad Y. & Heben, Michael, 2024. "Design and optimization of a modular hydrogen-based integrated energy system to maximize revenue via nuclear-renewable sources," Energy, Elsevier, vol. 313(C).
    16. Jiang, Qingfeng & Wang, Pengfei, 2025. "NSGA-II algorithm based control parameters optimization strategy for megawatt novel nuclear power systems," Energy, Elsevier, vol. 316(C).
    17. Hui, Jiuwu, 2024. "Coordinated discrete-time super-twisting sliding mode controller coupled with time-delay estimator for PWR-based nuclear steam supply system," Energy, Elsevier, vol. 301(C).
    18. Prabawa, Panggah & Choi, Dae-Hyun, 2024. "Safe deep reinforcement learning-assisted two-stage energy management for active power distribution networks with hydrogen fueling stations," Applied Energy, Elsevier, vol. 375(C).
    19. Gao, Yuan & Tahir, Mustafa & Siano, Pierluigi & Bi, Yue & Hu, Sile & Yang, Jiaqiang, 2025. "Optimization of renewable energy-based integrated energy systems: A three-stage stochastic robust model," Applied Energy, Elsevier, vol. 377(PD).
    20. Wu, Qingyang & Li, Gen & Liu, Ming & Zhang, Yufeng & Yan, Junjie & Deguchi, Yoshihiro, 2024. "The enhancement of primary frequency regulation ability of combined water and power plant based on nuclear energy: Dynamic modelling and control strategy optimization," Energy, Elsevier, vol. 313(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jeners:v:18:y:2025:i:6:p:1517-:d:1615560. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.