IDEAS home Printed from https://ideas.repec.org/a/gam/jeners/v18y2025i16p4272-d1722134.html
   My bibliography  Save this article

CCPP Power Prediction Using CatBoost with Domain Knowledge and Recursive Feature Elimination

Author

Listed:
  • Baicun Guo

    (School of Electrical Engineering, Xi’an Jiaotong University, Xi’an 710049, China)

  • Bowen Yang

    (School of Electrical Engineering, Xi’an Jiaotong University, Xi’an 710049, China)

  • Weizhan Shi

    (School of Electrical Engineering, Xi’an Jiaotong University, Xi’an 710049, China)

  • Fengye Yang

    (School of Electrical Engineering, Xi’an Jiaotong University, Xi’an 710049, China)

  • Dong Wang

    (School of Electrical Engineering, Xi’an Jiaotong University, Xi’an 710049, China)

  • Shuhong Wang

    (School of Electrical Engineering, Xi’an Jiaotong University, Xi’an 710049, China)

Abstract

Combined cycle power plants are modern power generation systems that provide an efficient and environmentally friendly way of generating electricity. With the development of smart grids, higher requirements have been put forward for their power prediction. Using a dataset comprising 9568 observations from a combined cycle power plant operating at full load for 6 years, a high-precision power prediction model integrating CatBoost and domain knowledge is proposed. Twenty new features were designed based on domain expertise, and Recursive Feature Elimination was applied to select the most informative features, optimizing model performance. Experimental results demonstrate that CatBoost outperformed six commonly used machine learning algorithms, both with and without domain knowledge integration. And the incorporation of domain knowledge improved the predictive performance of all evaluated models, underscoring the effectiveness and general applicability of the proposed features. Moreover, Recursive Feature Elimination was applied to select 11 features. The optimized CatBoost model achieved the best predictive accuracy with a root mean square error of 2.8545, a mean absolute error of 1.9645, and an R-squared of 0.9702. A comparative analysis with existing literature methods further validated the superior performance of the proposed approach. These findings highlight the effectiveness of integrating domain knowledge with machine learning and its potential for improving power output prediction in combined cycle power plants.

Suggested Citation

  • Baicun Guo & Bowen Yang & Weizhan Shi & Fengye Yang & Dong Wang & Shuhong Wang, 2025. "CCPP Power Prediction Using CatBoost with Domain Knowledge and Recursive Feature Elimination," Energies, MDPI, vol. 18(16), pages 1-23, August.
  • Handle: RePEc:gam:jeners:v:18:y:2025:i:16:p:4272-:d:1722134
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1996-1073/18/16/4272/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1996-1073/18/16/4272/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Qu, Zhijian & Xu, Juan & Wang, Zixiao & Chi, Rui & Liu, Hanxin, 2021. "Prediction of electricity generation from a combined cycle power plant based on a stacking ensemble and its hyperparameter optimization with a grid-search method," Energy, Elsevier, vol. 227(C).
    2. Shaker, Hamid & Manfre, Daniel & Zareipour, Hamidreza, 2020. "Forecasting the aggregated output of a large fleet of small behind-the-meter solar photovoltaic sites," Renewable Energy, Elsevier, vol. 147(P1), pages 1861-1869.
    3. Wang, Yun & Wang, Haibo & Srinivasan, Dipti & Hu, Qinghua, 2019. "Robust functional regression for wind speed forecasting based on Sparse Bayesian learning," Renewable Energy, Elsevier, vol. 132(C), pages 43-60.
    4. Ivan Lorencin & Nikola Anđelić & Vedran Mrzljak & Zlatan Car, 2019. "Genetic Algorithm Approach to Design of Multi-Layer Perceptron for Combined Cycle Power Plant Electrical Power Output Estimation," Energies, MDPI, vol. 12(22), pages 1-26, November.
    5. Aasim, & Singh, S.N. & Mohapatra, Abheejeet, 2019. "Repeated wavelet transform based ARIMA model for very short-term wind speed forecasting," Renewable Energy, Elsevier, vol. 136(C), pages 758-768.
    6. Asif Afzal & Saad Alshahrani & Abdulrahman Alrobaian & Abdulrajak Buradi & Sher Afghan Khan, 2021. "Power Plant Energy Predictions Based on Thermal Factors Using Ridge and Support Vector Regressor Algorithms," Energies, MDPI, vol. 14(21), pages 1-22, November.
    7. Hundi, Prabhas & Shahsavari, Rouzbeh, 2020. "Comparative studies among machine learning models for performance estimation and health monitoring of thermal power plants," Applied Energy, Elsevier, vol. 265(C).
    8. Wang, Yun & Wang, Jianzhou & Wei, Xiang, 2015. "A hybrid wind speed forecasting model based on phase space reconstruction theory and Markov model: A case study of wind farms in northwest China," Energy, Elsevier, vol. 91(C), pages 556-572.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wang, Huaizhi & Xue, Wenli & Liu, Yitao & Peng, Jianchun & Jiang, Hui, 2020. "Probabilistic wind power forecasting based on spiking neural network," Energy, Elsevier, vol. 196(C).
    2. Wang, Yun & Zou, Runmin & Liu, Fang & Zhang, Lingjun & Liu, Qianyi, 2021. "A review of wind speed and wind power forecasting with deep neural networks," Applied Energy, Elsevier, vol. 304(C).
    3. Xiang Ying & Keke Zhao & Zhiqiang Liu & Jie Gao & Dongxiao He & Xuewei Li & Wei Xiong, 2022. "Wind Speed Prediction via Collaborative Filtering on Virtual Edge Expanding Graphs," Mathematics, MDPI, vol. 10(11), pages 1-16, June.
    4. Qu, Zhijian & Xu, Juan & Wang, Zixiao & Chi, Rui & Liu, Hanxin, 2021. "Prediction of electricity generation from a combined cycle power plant based on a stacking ensemble and its hyperparameter optimization with a grid-search method," Energy, Elsevier, vol. 227(C).
    5. Asif Afzal & Saad Alshahrani & Abdulrahman Alrobaian & Abdulrajak Buradi & Sher Afghan Khan, 2021. "Power Plant Energy Predictions Based on Thermal Factors Using Ridge and Support Vector Regressor Algorithms," Energies, MDPI, vol. 14(21), pages 1-22, November.
    6. Qu, Zhijian & Hou, Xinxing & Li, Jian & Hu, Wenbo, 2024. "Short-term wind farm cluster power prediction based on dual feature extraction and quadratic decomposition aggregation," Energy, Elsevier, vol. 290(C).
    7. Siavash Asiaban & Nezmin Kayedpour & Arash E. Samani & Dimitar Bozalakov & Jeroen D. M. De Kooning & Guillaume Crevecoeur & Lieven Vandevelde, 2021. "Wind and Solar Intermittency and the Associated Integration Challenges: A Comprehensive Review Including the Status in the Belgian Power System," Energies, MDPI, vol. 14(9), pages 1-41, May.
    8. Noman Khan & Fath U Min Ullah & Ijaz Ul Haq & Samee Ullah Khan & Mi Young Lee & Sung Wook Baik, 2021. "AB-Net: A Novel Deep Learning Assisted Framework for Renewable Energy Generation Forecasting," Mathematics, MDPI, vol. 9(19), pages 1-18, October.
    9. Liu, Hui & Duan, Zhu, 2020. "A vanishing moment ensemble model for wind speed multi-step prediction with multi-objective base model selection," Applied Energy, Elsevier, vol. 261(C).
    10. Liu, Hui & Chen, Chao, 2019. "Data processing strategies in wind energy forecasting models and applications: A comprehensive review," Applied Energy, Elsevier, vol. 249(C), pages 392-408.
    11. Zhang, Jinliang & Wei, Yiming & Tan, Zhongfu, 2020. "An adaptive hybrid model for short term wind speed forecasting," Energy, Elsevier, vol. 190(C).
    12. Sun, Jingbo & Wang, Yang & He, Yuan & Cui, Wenrui & Chao, Qingchen & Shan, Baoguo & Wang, Zheng & Yang, Xiaofan, 2024. "The energy security risk assessment of inefficient wind and solar resources under carbon neutrality in China," Applied Energy, Elsevier, vol. 360(C).
    13. Konstantinos Blazakis & Yiannis Katsigiannis & Georgios Stavrakakis, 2022. "One-Day-Ahead Solar Irradiation and Windspeed Forecasting with Advanced Deep Learning Techniques," Energies, MDPI, vol. 15(12), pages 1-25, June.
    14. Wei, Danxiang & Wang, Jianzhou & Niu, Xinsong & Li, Zhiwu, 2021. "Wind speed forecasting system based on gated recurrent units and convolutional spiking neural networks," Applied Energy, Elsevier, vol. 292(C).
    15. Shang, Zhihao & He, Zhaoshuang & Chen, Yao & Chen, Yanhua & Xu, MingLiang, 2022. "Short-term wind speed forecasting system based on multivariate time series and multi-objective optimization," Energy, Elsevier, vol. 238(PC).
    16. Li, Min & Yang, Yi & He, Zhaoshuang & Guo, Xinbo & Zhang, Ruisheng & Huang, Bingqing, 2023. "A wind speed forecasting model based on multi-objective algorithm and interpretability learning," Energy, Elsevier, vol. 269(C).
    17. Liu, Xiaolei & Lin, Zi & Feng, Ziming, 2021. "Short-term offshore wind speed forecast by seasonal ARIMA - A comparison against GRU and LSTM," Energy, Elsevier, vol. 227(C).
    18. Zhang, Zongbin & Huang, Xiaoqiao & Li, Chengli & Cheng, Feiyan & Tai, Yonghang, 2025. "CRAformer: A cross-residual attention transformer for solar irradiation multistep forecasting," Energy, Elsevier, vol. 320(C).
    19. Qingyuan Wang & Longnv Huang & Jiehui Huang & Qiaoan Liu & Limin Chen & Yin Liang & Peter X. Liu & Chunquan Li, 2022. "A Hybrid Generative Adversarial Network Model for Ultra Short-Term Wind Speed Prediction," Sustainability, MDPI, vol. 14(15), pages 1-16, July.
    20. Yin, Sihua & Yang, Haidong & Xu, Kangkang & Zhu, Chengjiu & Zhang, Shaqing & Liu, Guosheng, 2022. "Dynamic real–time abnormal energy consumption detection and energy efficiency optimization analysis considering uncertainty," Applied Energy, Elsevier, vol. 307(C).

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jeners:v:18:y:2025:i:16:p:4272-:d:1722134. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.