IDEAS home Printed from https://ideas.repec.org/a/eee/appene/v384y2025ics0306261925001503.html
   My bibliography  Save this article

Improving the learning process of deep reinforcement learning agents operating in collective heating environments

Author

Listed:
  • Jacobs, Stef
  • Ghane, Sara
  • Houben, Pieter Jan
  • Kabbara, Zakarya
  • Huybrechts, Thomas
  • Hellinckx, Peter
  • Verhaert, Ivan

Abstract

Deep reinforcement learning (DRL) can be used to optimise the performance of Collective Heating Systems (CHS) by reducing operational costs while ensuring thermal comfort. However, heating systems often exhibit slow responsiveness to control inputs due to thermal inertia, which delays the effects of actions such as adapting temperature set points. This delayed feedback complicates the learning process for DRL agents, as it becomes more difficult to associate specific control actions with their outcomes. To address this challenge, this study evaluates four hyperparameter schemes during training. The focus lies on schemes with varying learning rate (the rate at which weights in neural networks are adapted) and/or discount factor (the importance the DRL agent attaches to future rewards). In this respect, we introduce the GALER approach, which combines the progressive increase of the discount factor with the reduction of the learning rate throughout the training process. The effectiveness of the four learning schemes is evaluated using the actor-critic Proximal Policy Optimization (PPO) algorithm for three types of CHS with a multi-objective reward function balancing thermal comfort and energy use or operational costs. The results demonstrate that energy-based reward functions allow for limited optimisation possibilities, while the GALER scheme yields the highest potential for price-based optimisation across all considered concepts. It achieved a 3%–15% performance improvement over other successful training schemes. DRL agents trained with GALER schemes strategically anticipate on high-price times by lowering the supply temperature and vice versa. This research highlights the advantage of varying both learning rates and discount factors when training DRL agents to operate in complex multi-objective environments with slow responsiveness.

Suggested Citation

  • Jacobs, Stef & Ghane, Sara & Houben, Pieter Jan & Kabbara, Zakarya & Huybrechts, Thomas & Hellinckx, Peter & Verhaert, Ivan, 2025. "Improving the learning process of deep reinforcement learning agents operating in collective heating environments," Applied Energy, Elsevier, vol. 384(C).
  • Handle: RePEc:eee:appene:v:384:y:2025:i:c:s0306261925001503
    DOI: 10.1016/j.apenergy.2025.125420
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0306261925001503
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.apenergy.2025.125420?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Lund, Henrik & Østergaard, Poul Alberg & Chang, Miguel & Werner, Sven & Svendsen, Svend & Sorknæs, Peter & Thorsen, Jan Eric & Hvelplund, Frede & Mortensen, Bent Ole Gram & Mathiesen, Brian Vad & Boje, 2018. "The status of 4th generation district heating: Research and results," Energy, Elsevier, vol. 164(C), pages 147-159.
    2. Fuentes, E. & Arce, L. & Salom, J., 2018. "A review of domestic hot water consumption profiles for application in systems and buildings energy performance analysis," Renewable and Sustainable Energy Reviews, Elsevier, vol. 81(P1), pages 1530-1547.
    3. Jansen, Jelger & Jorissen, Filip & Helsen, Lieve, 2024. "Mixed-integer non-linear model predictive control of district heating networks," Applied Energy, Elsevier, vol. 361(C).
    4. Pinto, Giuseppe & Piscitelli, Marco Savino & Vázquez-Canteli, José Ramón & Nagy, Zoltán & Capozzoli, Alfonso, 2021. "Coordinated energy management for a cluster of buildings through deep reinforcement learning," Energy, Elsevier, vol. 229(C).
    5. Meireles, I. & Sousa, V. & Bleys, B. & Poncelet, B., 2022. "Domestic hot water consumption pattern: Relation with total water consumption and air temperature," Renewable and Sustainable Energy Reviews, Elsevier, vol. 157(C).
    6. Shaikh, Pervez Hameed & Nor, Nursyarizal Bin Mohd & Nallagownden, Perumal & Elamvazuthi, Irraivan & Ibrahim, Taib, 2014. "A review on optimized control systems for building energy and comfort management of smart sustainable buildings," Renewable and Sustainable Energy Reviews, Elsevier, vol. 34(C), pages 409-429.
    7. Arroyo, Javier & Manna, Carlo & Spiessens, Fred & Helsen, Lieve, 2022. "Reinforced model predictive control (RL-MPC) for building energy management," Applied Energy, Elsevier, vol. 309(C).
    8. Xie, Jiahan & Ajagekar, Akshay & You, Fengqi, 2023. "Multi-Agent attention-based deep reinforcement learning for demand response in grid-responsive buildings," Applied Energy, Elsevier, vol. 342(C).
    9. Heidari, Amirreza & Maréchal, François & Khovalyg, Dolaana, 2022. "An occupant-centric control framework for balancing comfort, energy use and hygiene in hot water systems: A model-free reinforcement learning approach," Applied Energy, Elsevier, vol. 312(C).
    10. Stef Jacobs & Margot De Pauw & Senne Van Minnebruggen & Sara Ghane & Thomas Huybrechts & Peter Hellinckx & Ivan Verhaert, 2023. "Grouped Charging of Decentralised Storage to Efficiently Control Collective Heating Systems: Limitations and Opportunities," Energies, MDPI, vol. 16(8), pages 1-28, April.
    11. Lund, H. & Möller, B. & Mathiesen, B.V. & Dyrelund, A., 2010. "The role of district heating in future renewable energy systems," Energy, Elsevier, vol. 35(3), pages 1381-1390.
    12. Zhong, Wei & Chen, Jiaying & Zhou, Yi & Li, Zhongbo & Lin, Xiaojie, 2019. "Network flexibility study of urban centralized heating system: Concept, modeling and evaluation," Energy, Elsevier, vol. 177(C), pages 334-346.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Østergaard, P.A. & Lund, H. & Thellufsen, J.Z. & Sorknæs, P. & Mathiesen, B.V., 2022. "Review and validation of EnergyPLAN," Renewable and Sustainable Energy Reviews, Elsevier, vol. 168(C).
    2. Jing, Mengke & Zhang, Shujie & Fu, Lisong & Cao, Guoquan & Wang, Rui, 2023. "Reducing heat losses from aging district heating pipes by using cured-in-place pipe liners," Energy, Elsevier, vol. 273(C).
    3. Omar Al-Ani & Sanjoy Das, 2022. "Reinforcement Learning: Theory and Applications in HEMS," Energies, MDPI, vol. 15(17), pages 1-37, September.
    4. Nweye, Kingsley & Sankaranarayanan, Siva & Nagy, Zoltan, 2023. "MERLIN: Multi-agent offline and transfer learning for occupant-centric operation of grid-interactive communities," Applied Energy, Elsevier, vol. 346(C).
    5. Schaffer, Markus & Vera-Valdés, J. Eduardo & Marszal-Pomianowska, Anna, 2024. "Exploring smart heat meter data: A co-clustering driven approach to analyse the energy use of single-family houses," Applied Energy, Elsevier, vol. 371(C).
    6. Wirtz, Marco & Kivilip, Lukas & Remmen, Peter & Müller, Dirk, 2020. "5th Generation District Heating: A novel design approach based on mathematical optimization," Applied Energy, Elsevier, vol. 260(C).
    7. Mitridati, Lesia & Kazempour, Jalal & Pinson, Pierre, 2021. "Design and game-Theoretic analysis of community-Based market mechanisms in heat and electricity systems," Omega, Elsevier, vol. 99(C).
    8. Golmohamadi, Hessam & Larsen, Kim Guldstrand & Jensen, Peter Gjøl & Hasrat, Imran Riaz, 2022. "Integration of flexibility potentials of district heating systems into electricity markets: A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 159(C).
    9. Hall, Rebecca & Kenway, Steven & O'Brien, Katherine & Memon, Fayyaz, 2025. "Quantification of residential water-related energy needs cohesion, validation and global representation to unlock efficiency gains," Renewable and Sustainable Energy Reviews, Elsevier, vol. 207(C).
    10. Tahiri, Abdelkarim & Smith, Kevin Michael & Thorsen, Jan Eric & Hviid, Christian Anker & Svendsen, Svend, 2023. "Staged control of domestic hot water storage tanks to support district heating efficiency," Energy, Elsevier, vol. 263(PB).
    11. Atienza-Márquez, Antonio & Domínguez-Muñoz, Fernando & Fernández Hernández, Francisco & Cejudo López, José Manuel, 2024. "Solar thermal hot water system in hospitals: Robust design methodology considering uncertainties," Renewable Energy, Elsevier, vol. 234(C).
    12. Moser, Simon & Puschnigg, Stefan & Rodin, Valerie, 2020. "Designing the Heat Merit Order to determine the value of industrial waste heat for district heating systems," Energy, Elsevier, vol. 200(C).
    13. Chen, Wei-Han & You, Fengqi, 2024. "Sustainable energy management and control for Decarbonization of complex multi-zone buildings with renewable solar and geothermal energies using machine learning, robust optimization, and predictive c," Applied Energy, Elsevier, vol. 372(C).
    14. Petrović, Stefan & Bühler, Fabian & Radoman, Uroš & McKenna, Russell, 2022. "Power transformers as excess heat sources – a case study for Denmark," Energy, Elsevier, vol. 239(PE).
    15. Nis Bertelsen & Brian Vad Mathiesen, 2020. "EU-28 Residential Heat Supply and Consumption: Historical Development and Status," Energies, MDPI, vol. 13(8), pages 1-21, April.
    16. Toffanin, Riccardo & Curti, Vinicio & Barbato, Maurizio C., 2021. "Impact of Legionella regulation on a 4th generation district heating substation energy use and cost: the case of a Swiss single-family household," Energy, Elsevier, vol. 228(C).
    17. Ayas Shaqour & Aya Hagishima, 2022. "Systematic Review on Deep Reinforcement Learning-Based Energy Management for Different Building Types," Energies, MDPI, vol. 15(22), pages 1-27, November.
    18. Calikus, Ece & Nowaczyk, Sławomir & Sant'Anna, Anita & Gadd, Henrik & Werner, Sven, 2019. "A data-driven approach for discovering heat load patterns in district heating," Applied Energy, Elsevier, vol. 252(C), pages 1-1.
    19. Fester, Jakob & Østergaard, Peter Friis & Bentsen, Fredrik & Nielsen, Brian Kongsgaard, 2023. "A data-driven method for heat loss estimation from district heating service pipes using heat meter- and GIS data," Energy, Elsevier, vol. 277(C).
    20. Han, Gwangwoo & Joo, Hong-Jin & Lim, Hee-Won & An, Young-Sub & Lee, Wang-Je & Lee, Kyoung-Ho, 2023. "Data-driven heat pump operation strategy using rainbow deep reinforcement learning for significant reduction of electricity cost," Energy, Elsevier, vol. 270(C).

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:appene:v:384:y:2025:i:c:s0306261925001503. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/405891/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.