IDEAS home Printed from https://ideas.repec.org/a/eee/energy/v340y2025ics0360544225048376.html

Deep learning-driven heat load prediction: Investigating the impacts of optimizer and learning rate scheduler strategies

Author

Listed:
  • Zhu, Junda
  • Yang, Junhong
  • Peng, Mengbo
  • Liang, Xinyue
  • Ben, Chaoran

Abstract

Heat load prediction in district heating systems (DHS) presents challenges such as large-scale systems, complex dynamic characteristics with significant time delays, and the inherent randomness and uncertainty in end-user demand. Traditional models often fail to capture the intricate patterns in historical data. In contrast, deep learning models can identify complex nonlinear relationships within extensive historical data, facilitating more accurate predictions. The optimizer fine-tunes model parameters to minimize the loss function, thereby supporting effective learning and generalization, while the scheduler dynamically adjusts hyperparameters (e.g., learning rate) during training to ensure both stability and fast convergence. This study develops seven heat load prediction models based on CNN, LSTM, and KAN architectures and investigates the performance of different combinations of three optimizers (SGD, Adam, AdamW) and three learning rate schedulers (ExponentialLR, StepLR, CosineAnnealingLR). The results indicate that Adam and AdamW outperform SGD, with SGD suffering from underfitting. Compared to SGD, the models under the Adam and AdamW frameworks exhibit significantly lower mean MAE, RMSE, and MAPE. Specifically, for Adam and AdamW, the mean MAE values are 0.9668 and 1.1746, the mean RMSE values are 1.4053 and 1.6154, and the mean MAPE values are 0.0047 and 0.0057, respectively. CosineAnnealingLR performs exceptionally well by smoothly adjusting the learning rate, enabling the model to effectively capture both periodic and non-stationary features, leading to stable and precise predictions. The CNN-KAN model stands out, demonstrating remarkable stability and robustness, with R2 values greater than 0.95 and 0.97 under the Adam and AdamW frameworks, respectively. Therefore, in the DHS heat load forecasting, the combination of Adam/AdamW and CosineAnnealingLR enables the model to maintain high-precision predictions, while the CNN-KAN architecture demonstrates stable and effective performance on complex heat load forecasting tasks. This study provides actionable guidance for the selection of optimizers and learning-rate schedulers in DHS heat-load forecasting.

Suggested Citation

  • Zhu, Junda & Yang, Junhong & Peng, Mengbo & Liang, Xinyue & Ben, Chaoran, 2025. "Deep learning-driven heat load prediction: Investigating the impacts of optimizer and learning rate scheduler strategies," Energy, Elsevier, vol. 340(C).
  • Handle: RePEc:eee:energy:v:340:y:2025:i:c:s0360544225048376
    DOI: 10.1016/j.energy.2025.139195
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0360544225048376
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.energy.2025.139195?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:energy:v:340:y:2025:i:c:s0360544225048376. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/energy .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.