IDEAS home Printed from https://ideas.repec.org/a/gam/jeners/v18y2025i16p4272-d1722134.html
   My bibliography  Save this article

CCPP Power Prediction Using CatBoost with Domain Knowledge and Recursive Feature Elimination

Author

Listed:
  • Baicun Guo

    (School of Electrical Engineering, Xi’an Jiaotong University, Xi’an 710049, China)

  • Bowen Yang

    (School of Electrical Engineering, Xi’an Jiaotong University, Xi’an 710049, China)

  • Weizhan Shi

    (School of Electrical Engineering, Xi’an Jiaotong University, Xi’an 710049, China)

  • Fengye Yang

    (School of Electrical Engineering, Xi’an Jiaotong University, Xi’an 710049, China)

  • Dong Wang

    (School of Electrical Engineering, Xi’an Jiaotong University, Xi’an 710049, China)

  • Shuhong Wang

    (School of Electrical Engineering, Xi’an Jiaotong University, Xi’an 710049, China)

Abstract

Combined cycle power plants are modern power generation systems that provide an efficient and environmentally friendly way of generating electricity. With the development of smart grids, higher requirements have been put forward for their power prediction. Using a dataset comprising 9568 observations from a combined cycle power plant operating at full load for 6 years, a high-precision power prediction model integrating CatBoost and domain knowledge is proposed. Twenty new features were designed based on domain expertise, and Recursive Feature Elimination was applied to select the most informative features, optimizing model performance. Experimental results demonstrate that CatBoost outperformed six commonly used machine learning algorithms, both with and without domain knowledge integration. And the incorporation of domain knowledge improved the predictive performance of all evaluated models, underscoring the effectiveness and general applicability of the proposed features. Moreover, Recursive Feature Elimination was applied to select 11 features. The optimized CatBoost model achieved the best predictive accuracy with a root mean square error of 2.8545, a mean absolute error of 1.9645, and an R-squared of 0.9702. A comparative analysis with existing literature methods further validated the superior performance of the proposed approach. These findings highlight the effectiveness of integrating domain knowledge with machine learning and its potential for improving power output prediction in combined cycle power plants.

Suggested Citation

  • Baicun Guo & Bowen Yang & Weizhan Shi & Fengye Yang & Dong Wang & Shuhong Wang, 2025. "CCPP Power Prediction Using CatBoost with Domain Knowledge and Recursive Feature Elimination," Energies, MDPI, vol. 18(16), pages 1-23, August.
  • Handle: RePEc:gam:jeners:v:18:y:2025:i:16:p:4272-:d:1722134
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1996-1073/18/16/4272/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1996-1073/18/16/4272/
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jeners:v:18:y:2025:i:16:p:4272-:d:1722134. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.