A bi-level optimization strategy for flexible and economic operation of the CHP units based on reinforcement learning and multi-objective MPC

A bi-level optimization strategy for flexible and economic operation of the CHP units based on reinforcement learning and multi-objective MPC

Author

Listed:

Zhu, Keyan
Zhang, Guangming
Zhu, Chen
Niu, Yuguang
Liu, Jizhen

Abstract

Enhancing the comprehensive performance of the combined heat and power (CHP) units is crucial for accommodating renewable energy and achieving energy conservation. To this end, a bi-level optimization strategy based on reinforcement learning (RL) and multi-objective model predictive control (MOMPC) is proposed to enhance the CHP units flexibility and economic performance. Firstly, a CHP unit model is constructed, and its various parameters are incorporated into the rolling optimization of the MOMPC, serving as the lower-level follower to solve the fundamental control. Secondly, a bi-level optimization strategy integrating the twin delayed deep deterministic policy gradient (TD3) algorithm with MOMPC (TD3-MOMPC) is proposed. The TD3 agent is designated as the upper-level leader. By decomposing the complex flexibility requirements and the optimization control sequence of the CHP unit, tasks are assigned to both the upper-level leader and the lower-level follower for bi-level interactive optimization. Thirdly, with power flexibility, heating quality, and operational economy serving as leader guidance, a multi-criterion optimization reward function is designed for the upper-level. Then, the actions of the upper-level TD3 agent are designed as dynamic weights and time-varying prediction horizons for the rolling optimization of MOMPC, serving as a bridge to connect and guide the bi-level optimization. Finally, to verify the effectiveness of the bi-level optimization strategy, extensive tests on load variation and disturbance rejection were conducted on a 300 MW CHP unit. The results show that the proposed strategy enhances the unit's load flexibility, heating quality, and operational economy.

Suggested Citation

Zhu, Keyan & Zhang, Guangming & Zhu, Chen & Niu, Yuguang & Liu, Jizhen, 2025. "A bi-level optimization strategy for flexible and economic operation of the CHP units based on reinforcement learning and multi-objective MPC," Applied Energy, Elsevier, vol. 391(C).

Handle: RePEc:eee:appene:v:391:y:2025:i:c:s030626192500580x
DOI: 10.1016/j.apenergy.2025.125850

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Kong, Xiaobing & Abdelbaky, Mohamed Abdelkarim & Liu, Xiangjie & Lee, Kwang Y., 2023. "Stable feedback linearization-based economic MPC scheme for thermal power plant," Energy, Elsevier, vol. 268(C).
Yan, Rujing & Wang, Jiangjiang & Huo, Shuojie & Zhang, Jing & Tang, Saiqiu & Yang, Mei, 2023. "Comparative study for four technologies on flexibility improvement and renewable energy accommodation of combined heat and power system," Energy, Elsevier, vol. 263(PE).
Hou, Guolian & Gong, Linjuan & Hu, Bo & Su, Huilin & Huang, Ting & Huang, Congzhi & Fan, Wei & Zhao, Yuanzhu, 2022. "Application of fast adaptive moth-flame optimization in flexible operation modeling for supercritical unit," Energy, Elsevier, vol. 239(PA).
Kortela, J. & Jämsä-Jounela, S.-L., 2014. "Model predictive control utilizing fuel and moisture soft-sensors for the BioPower 5 combined heat and power (CHP) plant," Applied Energy, Elsevier, vol. 131(C), pages 189-200.
Wang, Zhu & Liu, Ming & Yan, Hui & Yan, Junjie, 2022. "Optimization on coordinate control strategy assisted by high-pressure extraction steam throttling to achieve flexible and efficient operation of thermal power plants," Energy, Elsevier, vol. 244(PA).
Zhao, Yongliang & Liu, Ming & Wang, Chaoyang & Li, Xin & Chong, Daotong & Yan, Junjie, 2018. "Increasing operational flexibility of supercritical coal-fired power plants by regulating thermal system configuration during transient processes," Applied Energy, Elsevier, vol. 228(C), pages 2375-2386.
Li, Jiawen & Yu, Tao & Zhang, Xiaoshun & Li, Fusheng & Lin, Dan & Zhu, Hanxin, 2021. "Efficient experience replay based deep deterministic policy gradient for AGC dispatch in integrated energy system," Applied Energy, Elsevier, vol. 285(C).
Zhang, Guangming & Zhang, Chao & Wang, Wei & Cao, Huan & Chen, Zhenyu & Niu, Yuguang, 2023. "Offline reinforcement learning control for electricity and heat coordination in a supercritical CHP unit," Energy, Elsevier, vol. 266(C).
Bao, Zhejing & Ye, Yangli & Liu, Ruijie & Cheng, Weidong & Zhao, Qiang & Wu, Ting, 2022. "Scheduling coordination of back pressure CHP coupled electricity-heat energy system with adaptive constraint strategy to accommodate uncertain wind power," Energy, Elsevier, vol. 240(C).
Lv, Chaoxian & Yu, Hao & Li, Peng & Wang, Chengshan & Xu, Xiandong & Li, Shuquan & Wu, Jianzhong, 2019. "Model predictive control based robust scheduling of community integrated energy system with operational flexibility," Applied Energy, Elsevier, vol. 243(C), pages 250-265.
Hou, Guolian & Huang, Ting & Zheng, Fumeng & Huang, Congzhi, 2024. "A hierarchical reinforcement learning GPC for flexible operation of ultra-supercritical unit considering economy," Energy, Elsevier, vol. 289(C).

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Hou, Guolian & Ye, Lingling & Cao, Huan, 2025. "Data-driven wide-load modeling and electricity-heat coordinated control for the supercritical combined heat and power unit," Energy, Elsevier, vol. 332(C).

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Hou, Guolian & Huang, Ting & Zheng, Fumeng & Huang, Congzhi, 2024. "A hierarchical reinforcement learning GPC for flexible operation of ultra-supercritical unit considering economy," Energy, Elsevier, vol. 289(C).
Hou, Guolian & Huang, Ting & Zheng, Fumeng & Gong, Linjuan & Huang, Congzhi & Zhang, Jianhua, 2023. "Application of multi-agent EADRC in flexible operation of combined heat and power plant considering carbon emission and economy," Energy, Elsevier, vol. 263(PB).
Hou, Guolian & Huang, Wenchuan & Ye, Lingling, 2025. "A flexible operation scheme for ultra-supercritical unit under wide load variations based on improved EADRC and modified northern goshawk optimizer," Energy, Elsevier, vol. 334(C).
Hou, Guolian & Liu, Zeyu, 2025. "Data-driven modeling for ultra-supercritical unit based on bidirectional test-time training and improved temporal convolutional network," Energy, Elsevier, vol. 326(C).
Liu, Zefeng & Wang, Chaoyang & Fan, Jianlin & Liu, Ming & Xing, Yong & Yan, Junjie, 2024. "Enhancing the flexibility and stability of coal-fired power plants by optimizing control schemes of throttling high-pressure extraction steam," Energy, Elsevier, vol. 288(C).
Liu, Xiang & Wu, Fengyongkang & Lv, Laiquan & Wei, Lijia & Zhou, Hao, 2024. "Performance of solid particles as thermal storage media in thermal power flexibility retrofits: Effects of charging and discharging flow rates on single piece stacking bed," Energy, Elsevier, vol. 308(C).
Wang, Peng & Wang, Chaoyang & Xiao, Qi & Huang, Chonghai & Liu, Ming & Chen, Weixiong & Yan, Junjie, 2025. "Enhancing the ramp-up flexibility of the coal-fired power plant under deep peak shaving work conditions by adopting a sliding steam temperature Scheme," Energy, Elsevier, vol. 335(C).
Hou, Guolian & Ye, Lingling & Huang, Ting & Huang, Congzhi, 2024. "Intelligent modeling of combined heat and power unit under full operating conditions via improved crossformer and precise sparrow search algorithm," Energy, Elsevier, vol. 308(C).
Dong, Zhe & Cheng, Zhonghua & Zhu, Yunlong & Huang, Xiaojin & Dong, Yujie & Zhang, Zuoyi, 2023. "Coordinated control of mHTGR-based nuclear steam supply systems considering cold helium temperature," Energy, Elsevier, vol. 284(C).
Du, Zeyu & Liu, Ming & Wang, Yang & Zhou, Yu & Zhao, Yongliang & Yan, Junjie, 2025. "Energy consumption characteristics and energy saving potential of thermal power plants under ultra-low power load ratio conditions," Energy, Elsevier, vol. 330(C).
Chen, Chen & Zhao, Chenyu & Liu, Ming & Wang, Chaoyang & Yan, Junjie, 2024. "Enhancing the load cycling rate of subcritical coal-fired power plants: A novel control strategy based on data-driven feedwater active regulation," Energy, Elsevier, vol. 312(C).
Pang, Dawei & Niu, Yuguang & Du, Ming, 2025. "Phase lead error-based active disturbance rejection control for 1000 MW ultra-supercritical unit under flexible operation," Energy, Elsevier, vol. 319(C).
Hou, Guolian & Huang, Ting & Huang, Congzhi, 2023. "Flexibility improvement of 1000 MW ultra-supercritical unit under full operating conditions by error-based ADRC and fast pigeon-inspired optimizer," Energy, Elsevier, vol. 270(C).
Wang, Zhenpu & Xu, Jing & Ma, Suxia & Zhao, Guanjia & Wang, Jianfei & Gu, Yujiong, 2025. "Comparative investigation on heat pump solutions for peak shaving and heat-power decoupling in combined heat and power plants," Renewable and Sustainable Energy Reviews, Elsevier, vol. 216(C).
Wang, Pengfei & Liang, Wenlong & Gong, Huijun & Chen, Jie, 2024. "Decoupling control of core power and axial power distribution for large pressurized water reactors based on reinforcement learning," Energy, Elsevier, vol. 313(C).
Xiang, Yue & Guo, Yongtao & Wu, Gang & Liu, Junyong & Sun, Wei & Lei, Yutian & Zeng, Pingliang, 2022. "Low-carbon economic planning of integrated electricity-gas energy systems," Energy, Elsevier, vol. 249(C).
Wu, Chunying & Sun, Lingfang & Piao, Heng & Yao, Lijia, 2024. "Adaptive fuzzy finite time integral sliding mode control of the coordinated system for 350 MW supercritical once-through boiler unit to enhance flexibility," Energy, Elsevier, vol. 302(C).
Shi, Jie & Wang, Luhao & Lee, Wei-Jen & Cheng, Xingong & Zong, Xiju, 2019. "Hybrid Energy Storage System (HESS) optimization enabling very short-term wind power generation scheduling based on output feature extraction," Applied Energy, Elsevier, vol. 256(C).
Zhao, Yongliang & Song, Jian & Liu, Ming & Zhao, Yao & Olympios, Andreas V. & Sapin, Paul & Yan, Junjie & Markides, Christos N., 2022. "Thermo-economic assessments of pumped-thermal electricity storage systems employing sensible heat storage materials," Renewable Energy, Elsevier, vol. 186(C), pages 431-456.
Zheng, Ling & Zhou, Bin & Cao, Yijia & Wing Or, Siu & Li, Yong & Wing Chan, Ka, 2022. "Hierarchical distributed multi-energy demand response for coordinated operation of building clusters," Applied Energy, Elsevier, vol. 308(C).

More about this item

Keywords

; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:appene:v:391:y:2025:i:c:s030626192500580x. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/405891/description#description .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

A bi-level optimization strategy for flexible and economic operation of the CHP units based on reinforcement learning and multi-objective MPC

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data