IDEAS home Printed from https://ideas.repec.org/a/eee/energy/v340y2025ics0360544225048376.html

Deep learning-driven heat load prediction: Investigating the impacts of optimizer and learning rate scheduler strategies

Author

Listed:
  • Zhu, Junda
  • Yang, Junhong
  • Peng, Mengbo
  • Liang, Xinyue
  • Ben, Chaoran

Abstract

Heat load prediction in district heating systems (DHS) presents challenges such as large-scale systems, complex dynamic characteristics with significant time delays, and the inherent randomness and uncertainty in end-user demand. Traditional models often fail to capture the intricate patterns in historical data. In contrast, deep learning models can identify complex nonlinear relationships within extensive historical data, facilitating more accurate predictions. The optimizer fine-tunes model parameters to minimize the loss function, thereby supporting effective learning and generalization, while the scheduler dynamically adjusts hyperparameters (e.g., learning rate) during training to ensure both stability and fast convergence. This study develops seven heat load prediction models based on CNN, LSTM, and KAN architectures and investigates the performance of different combinations of three optimizers (SGD, Adam, AdamW) and three learning rate schedulers (ExponentialLR, StepLR, CosineAnnealingLR). The results indicate that Adam and AdamW outperform SGD, with SGD suffering from underfitting. Compared to SGD, the models under the Adam and AdamW frameworks exhibit significantly lower mean MAE, RMSE, and MAPE. Specifically, for Adam and AdamW, the mean MAE values are 0.9668 and 1.1746, the mean RMSE values are 1.4053 and 1.6154, and the mean MAPE values are 0.0047 and 0.0057, respectively. CosineAnnealingLR performs exceptionally well by smoothly adjusting the learning rate, enabling the model to effectively capture both periodic and non-stationary features, leading to stable and precise predictions. The CNN-KAN model stands out, demonstrating remarkable stability and robustness, with R2 values greater than 0.95 and 0.97 under the Adam and AdamW frameworks, respectively. Therefore, in the DHS heat load forecasting, the combination of Adam/AdamW and CosineAnnealingLR enables the model to maintain high-precision predictions, while the CNN-KAN architecture demonstrates stable and effective performance on complex heat load forecasting tasks. This study provides actionable guidance for the selection of optimizers and learning-rate schedulers in DHS heat-load forecasting.

Suggested Citation

  • Zhu, Junda & Yang, Junhong & Peng, Mengbo & Liang, Xinyue & Ben, Chaoran, 2025. "Deep learning-driven heat load prediction: Investigating the impacts of optimizer and learning rate scheduler strategies," Energy, Elsevier, vol. 340(C).
  • Handle: RePEc:eee:energy:v:340:y:2025:i:c:s0360544225048376
    DOI: 10.1016/j.energy.2025.139195
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0360544225048376
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.energy.2025.139195?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Sun, Chunhua & Yan, Hao & Cao, Shanshan & Xia, Guoqiang & Liu, Yanan & Wu, Xiangdong, 2024. "A control strategy considering buildings’ thermal characteristics to mitigate heat supply-demand mismatches in district heating systems," Energy, Elsevier, vol. 307(C).
    2. Yanqiao Deng & Minda Ma & Nan Zhou & Chenchen Zou & Zhili Ma & Ran Yan & Xin Ma, 2024. "Provincial allocation of China's commercial building operational carbon towards carbon neutrality," Papers 2412.14523, arXiv.org, revised Jan 2025.
    3. Shufan Zhang & Minda Ma & Nan Zhou & Jinyue Yan & Wei Feng & Ran Yan & Kairui You & Jingjing Zhang & Jing Ke, 2024. "Estimation of Global Building Stocks by 2070: Unlocking Renovation Potential," Papers 2406.04074, arXiv.org.
    4. Lizhen Wu & Chun Kong & Xiaohong Hao & Wei Chen, 2020. "A Short-Term Load Forecasting Method Based on GRU-CNN Hybrid Neural Network Model," Mathematical Problems in Engineering, Hindawi, vol. 2020, pages 1-10, March.
    5. Wan, Anping & Chang, Qing & AL-Bukhaiti, Khalil & He, Jiabo, 2023. "Short-term power load forecasting for combined heat and power using CNN-LSTM enhanced by attention mechanism," Energy, Elsevier, vol. 282(C).
    6. Deb, Chirag & Dai, Zhonghao & Schlueter, Arno, 2021. "A machine learning-based framework for cost-optimal building retrofit," Applied Energy, Elsevier, vol. 294(C).
    7. Cui, Mianshan, 2022. "District heating load prediction algorithm based on bidirectional long short-term memory network model," Energy, Elsevier, vol. 254(PA).
    8. Song, Jiancai & Zhu, Shuo & Li, Wen & Xue, Guixiang & Gao, Xiaoyu, 2025. "A novel robust heating load prediction algorithm based on hybrid residual network and temporal fusion transformer model," Energy, Elsevier, vol. 318(C).
    9. Robinson, Caleb & Dilkina, Bistra & Hubbs, Jeffrey & Zhang, Wenwen & Guhathakurta, Subhrajit & Brown, Marilyn A. & Pendyala, Ram M., 2017. "Machine learning approaches for estimating commercial building energy consumption," Applied Energy, Elsevier, vol. 208(C), pages 889-904.
    10. Duong Tran Anh & Dat Vi Thanh & Hoang Minh Le & Bang Tran Sy & Ahad Hasan Tanim & Quoc Bao Pham & Thanh Duc Dang & Son T. Mai & Nguyen Mai Dang, 2023. "Effect of Gradient Descent Optimizers and Dropout Technique on Deep Learning LSTM Performance in Rainfall-runoff Modeling," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 37(2), pages 639-657, January.
    11. Ziwei Mei & Zhentao Shi, 2022. "On LASSO for High Dimensional Predictive Regression," Papers 2212.07052, arXiv.org, revised Jan 2024.
    12. Huang, Yaohui & Zhao, Yuan & Wang, Zhijin & Liu, Xiufeng & Liu, Hanjing & Fu, Yonggang, 2023. "Explainable district heat load forecasting with active deep learning," Applied Energy, Elsevier, vol. 350(C).
    13. Wang, Tianyi & Ma, Minda & Zhou, Nan & Ma, Zhili, 2025. "Toward net zero: Assessing the decarbonization impact of global commercial building electrification," Applied Energy, Elsevier, vol. 383(C).
    14. Minda Ma & Shufan Zhang & Junhong Liu & Ran Yan & Weiguang Cai & Nan Zhou & Jinyue Yan, 2025. "Building floorspace and stock measurement: A review of global efforts, knowledge gaps, and research priorities," Papers 2503.05824, arXiv.org, revised May 2025.
    15. Deng, Yanqiao & Ma, Minda & Zhou, Nan & Zou, Chenchen & Ma, Zhili & Yan, Ran & Ma, Xin, 2025. "Provincial allocation of China's commercial building operational carbon toward carbon neutrality," Applied Energy, Elsevier, vol. 384(C).
    16. Huiting Zheng & Jiabin Yuan & Long Chen, 2017. "Short-Term Load Forecasting Using EMD-LSTM Neural Networks with a Xgboost Algorithm for Feature Importance Evaluation," Energies, MDPI, vol. 10(8), pages 1-20, August.
    17. Ran Yan & Nan Zhou & Minda Ma & Chao Mao, 2024. "India's residential space cooling transition: Decarbonization ambitions since the turn of millennium," Papers 2412.06360, arXiv.org, revised Mar 2025.
    18. Shi, Jiaqi & Li, Chenxi & Yan, Xiaohe, 2023. "Artificial intelligence for load forecasting: A stacking learning approach based on ensemble diversity regularization," Energy, Elsevier, vol. 262(PB).
    19. Chung, Won Hee & Gu, Yeong Hyeon & Yoo, Seong Joon, 2022. "District heater load forecasting based on machine learning and parallel CNN-LSTM attention," Energy, Elsevier, vol. 246(C).
    20. Mei, Ziwei & Shi, Zhentao, 2024. "On LASSO for high dimensional predictive regression," Journal of Econometrics, Elsevier, vol. 242(2).
    21. Zhang, Liang & Wen, Jin & Li, Yanfei & Chen, Jianli & Ye, Yunyang & Fu, Yangyang & Livingood, William, 2021. "A review of machine learning in building load prediction," Applied Energy, Elsevier, vol. 285(C).
    22. Esen, Hikmet & Inalli, Mustafa & Sengur, Abdulkadir & Esen, Mehmet, 2008. "Modeling a ground-coupled heat pump system by a support vector machine," Renewable Energy, Elsevier, vol. 33(8), pages 1814-1823.
    23. Chen, Yunxiao & Lin, Chaojing & Zhang, Yilan & Liu, Jinfu & Yu, Daren, 2024. "Day-ahead load forecast based on Conv2D-GRU_SC aimed to adapt to steep changes in load," Energy, Elsevier, vol. 302(C).
    24. Jebli, Imane & Belouadha, Fatima-Zahra & Kabbaj, Mohammed Issam & Tilioua, Amine, 2021. "Prediction of solar energy guided by pearson correlation using machine learning," Energy, Elsevier, vol. 224(C).
    25. Yan, Ran & Zhou, Nan & Ma, Minda & Mao, Chao, 2025. "India's residential space cooling transition: Decarbonization ambitions since the turn of millennium," Applied Energy, Elsevier, vol. 391(C).
    26. Verhelst J., & Van Ham G., & Saelens D., & Helsen L.,, 2017. "Model selection for continuous commissioning of HVAC-systems in office buildings: A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 76(C), pages 673-686.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bergougui, Brahim, 2025. "Institutional adaptability, skill-bias technological shifts, and energy efficiency in global decarbonization pathways: Exploring the role of artificial intelligence patents," Technology in Society, Elsevier, vol. 83(C).
    2. Zhang, Huijin & Hu, Wenbo, 2025. "Unveiling the reality of carbon reduction: Is the Paris Agreement turning the world green or just painting it green?," Energy Economics, Elsevier, vol. 148(C).
    3. Yan, Ran & Zhou, Nan & Ma, Minda & Mao, Chao, 2025. "India's residential space cooling transition: Decarbonization ambitions since the turn of millennium," Applied Energy, Elsevier, vol. 391(C).
    4. Song, Jiancai & Zhu, Shuo & Li, Wen & Xue, Guixiang & Gao, Xiaoyu, 2025. "A novel robust heating load prediction algorithm based on hybrid residual network and temporal fusion transformer model," Energy, Elsevier, vol. 318(C).
    5. Guanghua Zheng & Yifan He & Zhaohan Lu & Yuping Wu, 2025. "Research on Spatial and Temporal Divergence and Influencing Factors of the Coal Industry Transformation and Development Under Energy Security and Dual-Carbon Target," Sustainability, MDPI, vol. 17(6), pages 1-28, March.
    6. Cui, Xuyang & Zhu, Junda & Jia, Lifu & Wang, Jiahui & Wu, Yusen, 2024. "A novel heat load prediction model of district heating system based on hybrid whale optimization algorithm (WOA) and CNN-LSTM with attention mechanism," Energy, Elsevier, vol. 312(C).
    7. Hong Yuan & Minda Ma & Nan Zhou & Zhili Ma & Chunbo Zhang, 2025. "Assessing provincial carbon budgets for residential buildings to advance net-zero ambitions," Papers 2503.05795, arXiv.org, revised Oct 2025.
    8. Farid Moazzen & M. J. Hossain, 2024. "Multivariate Deep Learning Long Short-Term Memory-Based Forecasting for Microgrid Energy Management Systems," Energies, MDPI, vol. 17(17), pages 1-16, August.
    9. Wu, Yihui & Zha, Donglan & Cao, Yang & Yang, Yuting & Tiong, Robert Lee Kong, 2025. "Analyzing the city-level carbon peaking in China's residential building sector with explainable machine learning," Energy, Elsevier, vol. 332(C).
    10. Tan, Quanwei & Zhu, Jiebei & Xue, Guijun & Xie, Wenju, 2025. "A hybrid heat load forecasting model based on multistage decomposition and dynamic adaptive loss function," Energy, Elsevier, vol. 335(C).
    11. Gao, Jiti & Peng, Bin & Yan, Yayi, 2025. "Time-varying vector error-correction models: Estimation and inference," Journal of Econometrics, Elsevier, vol. 251(C).
    12. Yu, Solui & Hur, Jin, 2025. "An enhanced critical operating constraint forecasting (COCF) for power grids with large scale wind generating resources," Energy, Elsevier, vol. 331(C).
    13. Ling, Jihong & Zhang, Bingyang & Dai, Na & Xing, Jincheng, 2023. "Coupling input feature construction methods and machine learning algorithms for hourly secondary supply temperature prediction," Energy, Elsevier, vol. 278(C).
    14. Zhao, Xiaoyu & Duan, Pengfei & Cao, Xiaodong & Xue, Qingwen & Zhao, Bingxu & Hu, Jinxue & Zhang, Chenyang & Yuan, Xiaoyang, 2025. "A probabilistic load forecasting method for multi-energy loads based on inflection point optimization and integrated feature screening," Energy, Elsevier, vol. 327(C).
    15. Xu, Huifeng & Hu, Feihu & Liang, Xinhao & Zhao, Guoqing & Abugunmi, Mohammad, 2024. "A framework for electricity load forecasting based on attention mechanism time series depthwise separable convolutional neural network," Energy, Elsevier, vol. 299(C).
    16. Wang, Lining & Mao, Mingxuan & Xie, Jili & Liao, Zheng & Zhang, Hao & Li, Huanxin, 2023. "Accurate solar PV power prediction interval method based on frequency-domain decomposition and LSTM model," Energy, Elsevier, vol. 262(PB).
    17. Song, Jiancai & Wang, Kangning & Bian, Tianxiang & Li, Wen & Dong, Qianxing & Chen, Lei & Xue, Guixiang & Wu, Xiangdong, 2025. "A novel heat load prediction algorithm based on fuzzy C-mean clustering and mixed positional encoding informer," Applied Energy, Elsevier, vol. 388(C).
    18. Chengwang Liao & Ziwei Mei & Zhentao Shi, 2024. "Nickell Meets Stambaugh: A Tale of Two Biases in Panel Predictive Regressions," Papers 2410.09825, arXiv.org.
    19. Yu, Binbin & Li, Jianjing & Liu, Che & Sun, Bo, 2022. "A novel short-term electrical load forecasting framework with intelligent feature engineering," Applied Energy, Elsevier, vol. 327(C).
    20. Wang, Danhao & Peng, Daogang & Huang, Dongmei & Zhao, Huirong & Qu, Bogang, 2025. "MMEMformer: A multi-scale memory-enhanced transformer framework for short-term load forecasting in integrated energy systems," Energy, Elsevier, vol. 322(C).

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:energy:v:340:y:2025:i:c:s0360544225048376. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/energy .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.