IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v18y2026i12p6366-d1972810.html

Hybrid Machine Learning Models for Predicting Gross CO 2 e Balance in Polish Forest Stands: A Tool for Sustainable Forest Carbon Assessment in the Circular Economy

Author

Listed:
  • Krzysztof Przybył

    (Department of Biosystems Engineering, Poznań University of Life Sciences, Wojska Polskiego 50, 60-627 Poznań, Poland)

  • Agnieszka A. Pilarska

    (Department of Hydraulic and Sanitary Engineering, Poznań University of Life Sciences, Piątkowska 94A, 60-649 Poznań, Poland)

  • Krzysztof Pilarski

    (Department of Biosystems Engineering, Poznań University of Life Sciences, Wojska Polskiego 50, 60-627 Poznań, Poland)

Abstract

Forest carbon assessment requires methods that capture the combined effects of stand structure, site conditions, carbon pools, operational emissions, and circular-economy processes. This study aimed to develop and optimize hybrid machine learning models for predicting the gross CO 2 e (carbon dioxide equivalent) balance of Polish forest stands using measurable stand- and site-related variables. The research was based on a primary dataset describing forest management in major Polish macroregions in 2020–2024. After data cleaning and preprocessing, multiple machine learning algorithms, including ensemble, boosting, neural, and hybrid models, were trained, validated, and tested. Model performance was assessed using standard regression metrics, overfitting diagnostics, learning curves, and SHAP (Shapley Additive Explanations). Most models achieved high predictive accuracy, with six of ten algorithms reaching R 2 values above 0.90 on the test set. The reduction in strongly correlated variables helped limit multicollinearity and excessive overlap between predictors and the target variable, supporting a more reliable interpretation of model performance. The CatBoost algorithm achieved the highest predictive performance on the test set (R 2 = 0.948), while also recording the lowest root mean squared error (RMSE = 152.242). However, the Decision Tree demonstrated the weakest generalization performance (R 2 = 0.806) on the test set. SHAP analysis identified tree height as the most influential predictor, followed by tree age, number of trees, species composition, and selected habitat features. The novelty of the study lies in integrating hybrid machine learning, interpretable modelling, and circular-economy-related carbon balance components into a single framework for rapid and operational forest carbon assessment in Polish forest stands.

Suggested Citation

  • Krzysztof Przybył & Agnieszka A. Pilarska & Krzysztof Pilarski, 2026. "Hybrid Machine Learning Models for Predicting Gross CO 2 e Balance in Polish Forest Stands: A Tool for Sustainable Forest Carbon Assessment in the Circular Economy," Sustainability, MDPI, vol. 18(12), pages 1-34, June.
  • Handle: RePEc:gam:jsusta:v:18:y:2026:i:12:p:6366-:d:1972810
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/18/12/6366/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/18/12/6366/
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:18:y:2026:i:12:p:6366-:d:1972810. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager The email address of this maintainer does not seem to be valid anymore. Please ask MDPI Indexing Manager to update the entry or send us the correct address (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.