Author
Listed:
- Bibars Amangeldy
(LLP “DigitAlem”, Almaty 050042, Kazakhstan
Faculty of Information Technologies, Al-Farabi Kazakh National University, Almaty 050040, Kazakhstan)
- Nurdaulet Tasmurzayev
(LLP “DigitAlem”, Almaty 050042, Kazakhstan
Faculty of Information Technologies, Al-Farabi Kazakh National University, Almaty 050040, Kazakhstan)
- Timur Imankulov
(LLP “DigitAlem”, Almaty 050042, Kazakhstan
Faculty of Information Technologies, Al-Farabi Kazakh National University, Almaty 050040, Kazakhstan)
- Baglan Imanbek
(Faculty of Information Technologies, Al-Farabi Kazakh National University, Almaty 050040, Kazakhstan)
- Waldemar Wójcik
(Institute of Electronics and Information Technology, Politechnika Lubelska, 20-618 Lublin, Poland)
- Yedil Nurakhov
(LLP “DigitAlem”, Almaty 050042, Kazakhstan
Faculty of Information Technologies, Al-Farabi Kazakh National University, Almaty 050040, Kazakhstan)
Abstract
Accurate forecasting of energy consumption in buildings is essential for achieving energy efficiency and reducing carbon emissions. However, many existing models rely on limited input variables and overlook the complex influence of indoor environmental quality (IEQ). In this study, we assess the performance of hybrid machine learning ensembles for predicting hourly energy demand in a smart office environment using high-frequency IEQ sensor data. Environmental variables including carbon dioxide concentration (CO 2 ), particulate matter (PM 2.5 ), total volatile organic compounds (TVOCs), noise levels, humidity, and temperature were recorded over a four-month period. We evaluated two ensemble configurations combining support vector regression (SVR) with either Random Forest or LightGBM as base learners and Ridge regression as a meta-learner, alongside single-model baselines such as SVR and artificial neural networks (ANN). The SVR combined with Random Forest and Ridge regression demonstrated the highest predictive performance, achieving a mean absolute error (MAE) of 1.20, a mean absolute percentage error (MAPE) of 8.92%, and a coefficient of determination (R 2 ) of 0.82. Feature importance analysis using SHAP values, together with non-parametric statistical testing, identified TVOCs, humidity, and PM 2.5 as the most influential predictors of energy use. These findings highlight the value of integrating high-resolution IEQ data into predictive frameworks and demonstrate that such data can significantly improve forecasting accuracy. This effect is attributed to the direct link between these IEQ variables and the activation of energy-intensive systems; fluctuations in humidity drive HVAC energy use for dehumidification, while elevated pollutant levels (TVOCs, PM 2.5 ) trigger increased ventilation to maintain indoor air quality, thus raising the total energy load.
Suggested Citation
Bibars Amangeldy & Nurdaulet Tasmurzayev & Timur Imankulov & Baglan Imanbek & Waldemar Wójcik & Yedil Nurakhov, 2025.
"A Hybrid Machine Learning Approach for High-Accuracy Energy Consumption Prediction Using Indoor Environmental Quality Sensors,"
Energies, MDPI, vol. 18(15), pages 1-31, August.
Handle:
RePEc:gam:jeners:v:18:y:2025:i:15:p:4164-:d:1718416
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jeners:v:18:y:2025:i:15:p:4164-:d:1718416. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.