Author
Listed:
- Zhu, Junda
- Yang, Junhong
- Peng, Mengbo
- Liang, Xinyue
- Ben, Chaoran
Abstract
Heat load prediction in district heating systems (DHS) presents challenges such as large-scale systems, complex dynamic characteristics with significant time delays, and the inherent randomness and uncertainty in end-user demand. Traditional models often fail to capture the intricate patterns in historical data. In contrast, deep learning models can identify complex nonlinear relationships within extensive historical data, facilitating more accurate predictions. The optimizer fine-tunes model parameters to minimize the loss function, thereby supporting effective learning and generalization, while the scheduler dynamically adjusts hyperparameters (e.g., learning rate) during training to ensure both stability and fast convergence. This study develops seven heat load prediction models based on CNN, LSTM, and KAN architectures and investigates the performance of different combinations of three optimizers (SGD, Adam, AdamW) and three learning rate schedulers (ExponentialLR, StepLR, CosineAnnealingLR). The results indicate that Adam and AdamW outperform SGD, with SGD suffering from underfitting. Compared to SGD, the models under the Adam and AdamW frameworks exhibit significantly lower mean MAE, RMSE, and MAPE. Specifically, for Adam and AdamW, the mean MAE values are 0.9668 and 1.1746, the mean RMSE values are 1.4053 and 1.6154, and the mean MAPE values are 0.0047 and 0.0057, respectively. CosineAnnealingLR performs exceptionally well by smoothly adjusting the learning rate, enabling the model to effectively capture both periodic and non-stationary features, leading to stable and precise predictions. The CNN-KAN model stands out, demonstrating remarkable stability and robustness, with R2 values greater than 0.95 and 0.97 under the Adam and AdamW frameworks, respectively. Therefore, in the DHS heat load forecasting, the combination of Adam/AdamW and CosineAnnealingLR enables the model to maintain high-precision predictions, while the CNN-KAN architecture demonstrates stable and effective performance on complex heat load forecasting tasks. This study provides actionable guidance for the selection of optimizers and learning-rate schedulers in DHS heat-load forecasting.
Suggested Citation
Zhu, Junda & Yang, Junhong & Peng, Mengbo & Liang, Xinyue & Ben, Chaoran, 2025.
"Deep learning-driven heat load prediction: Investigating the impacts of optimizer and learning rate scheduler strategies,"
Energy, Elsevier, vol. 340(C).
Handle:
RePEc:eee:energy:v:340:y:2025:i:c:s0360544225048376
DOI: 10.1016/j.energy.2025.139195
Download full text from publisher
As the access to this document is restricted, you may want to
for a different version of it.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:energy:v:340:y:2025:i:c:s0360544225048376. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/energy .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.