IDEAS home Printed from https://ideas.repec.org/a/eee/appene/v384y2025ics0306261925001503.html
   My bibliography  Save this article

Improving the learning process of deep reinforcement learning agents operating in collective heating environments

Author

Listed:
  • Jacobs, Stef
  • Ghane, Sara
  • Houben, Pieter Jan
  • Kabbara, Zakarya
  • Huybrechts, Thomas
  • Hellinckx, Peter
  • Verhaert, Ivan

Abstract

Deep reinforcement learning (DRL) can be used to optimise the performance of Collective Heating Systems (CHS) by reducing operational costs while ensuring thermal comfort. However, heating systems often exhibit slow responsiveness to control inputs due to thermal inertia, which delays the effects of actions such as adapting temperature set points. This delayed feedback complicates the learning process for DRL agents, as it becomes more difficult to associate specific control actions with their outcomes. To address this challenge, this study evaluates four hyperparameter schemes during training. The focus lies on schemes with varying learning rate (the rate at which weights in neural networks are adapted) and/or discount factor (the importance the DRL agent attaches to future rewards). In this respect, we introduce the GALER approach, which combines the progressive increase of the discount factor with the reduction of the learning rate throughout the training process. The effectiveness of the four learning schemes is evaluated using the actor-critic Proximal Policy Optimization (PPO) algorithm for three types of CHS with a multi-objective reward function balancing thermal comfort and energy use or operational costs. The results demonstrate that energy-based reward functions allow for limited optimisation possibilities, while the GALER scheme yields the highest potential for price-based optimisation across all considered concepts. It achieved a 3%–15% performance improvement over other successful training schemes. DRL agents trained with GALER schemes strategically anticipate on high-price times by lowering the supply temperature and vice versa. This research highlights the advantage of varying both learning rates and discount factors when training DRL agents to operate in complex multi-objective environments with slow responsiveness.

Suggested Citation

  • Jacobs, Stef & Ghane, Sara & Houben, Pieter Jan & Kabbara, Zakarya & Huybrechts, Thomas & Hellinckx, Peter & Verhaert, Ivan, 2025. "Improving the learning process of deep reinforcement learning agents operating in collective heating environments," Applied Energy, Elsevier, vol. 384(C).
  • Handle: RePEc:eee:appene:v:384:y:2025:i:c:s0306261925001503
    DOI: 10.1016/j.apenergy.2025.125420
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0306261925001503
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.apenergy.2025.125420?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Stef Jacobs & Margot De Pauw & Senne Van Minnebruggen & Sara Ghane & Thomas Huybrechts & Peter Hellinckx & Ivan Verhaert, 2023. "Grouped Charging of Decentralised Storage to Efficiently Control Collective Heating Systems: Limitations and Opportunities," Energies, MDPI, vol. 16(8), pages 1-28, April.
    2. Lund, H. & Möller, B. & Mathiesen, B.V. & Dyrelund, A., 2010. "The role of district heating in future renewable energy systems," Energy, Elsevier, vol. 35(3), pages 1381-1390.
    3. Lund, Henrik & Østergaard, Poul Alberg & Chang, Miguel & Werner, Sven & Svendsen, Svend & Sorknæs, Peter & Thorsen, Jan Eric & Hvelplund, Frede & Mortensen, Bent Ole Gram & Mathiesen, Brian Vad & Boje, 2018. "The status of 4th generation district heating: Research and results," Energy, Elsevier, vol. 164(C), pages 147-159.
    4. Xie, Jiahan & Ajagekar, Akshay & You, Fengqi, 2023. "Multi-Agent attention-based deep reinforcement learning for demand response in grid-responsive buildings," Applied Energy, Elsevier, vol. 342(C).
    5. Jansen, Jelger & Jorissen, Filip & Helsen, Lieve, 2024. "Mixed-integer non-linear model predictive control of district heating networks," Applied Energy, Elsevier, vol. 361(C).
    6. Pinto, Giuseppe & Piscitelli, Marco Savino & Vázquez-Canteli, José Ramón & Nagy, Zoltán & Capozzoli, Alfonso, 2021. "Coordinated energy management for a cluster of buildings through deep reinforcement learning," Energy, Elsevier, vol. 229(C).
    7. Meireles, I. & Sousa, V. & Bleys, B. & Poncelet, B., 2022. "Domestic hot water consumption pattern: Relation with total water consumption and air temperature," Renewable and Sustainable Energy Reviews, Elsevier, vol. 157(C).
    8. Shaikh, Pervez Hameed & Nor, Nursyarizal Bin Mohd & Nallagownden, Perumal & Elamvazuthi, Irraivan & Ibrahim, Taib, 2014. "A review on optimized control systems for building energy and comfort management of smart sustainable buildings," Renewable and Sustainable Energy Reviews, Elsevier, vol. 34(C), pages 409-429.
    9. Heidari, Amirreza & Maréchal, François & Khovalyg, Dolaana, 2022. "An occupant-centric control framework for balancing comfort, energy use and hygiene in hot water systems: A model-free reinforcement learning approach," Applied Energy, Elsevier, vol. 312(C).
    10. Zhong, Wei & Chen, Jiaying & Zhou, Yi & Li, Zhongbo & Lin, Xiaojie, 2019. "Network flexibility study of urban centralized heating system: Concept, modeling and evaluation," Energy, Elsevier, vol. 177(C), pages 334-346.
    11. Fuentes, E. & Arce, L. & Salom, J., 2018. "A review of domestic hot water consumption profiles for application in systems and buildings energy performance analysis," Renewable and Sustainable Energy Reviews, Elsevier, vol. 81(P1), pages 1530-1547.
    12. Arroyo, Javier & Manna, Carlo & Spiessens, Fred & Helsen, Lieve, 2022. "Reinforced model predictive control (RL-MPC) for building energy management," Applied Energy, Elsevier, vol. 309(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jing, Mengke & Zhang, Shujie & Fu, Lisong & Cao, Guoquan & Wang, Rui, 2023. "Reducing heat losses from aging district heating pipes by using cured-in-place pipe liners," Energy, Elsevier, vol. 273(C).
    2. Nweye, Kingsley & Sankaranarayanan, Siva & Nagy, Zoltan, 2023. "MERLIN: Multi-agent offline and transfer learning for occupant-centric operation of grid-interactive communities," Applied Energy, Elsevier, vol. 346(C).
    3. Schaffer, Markus & Vera-Valdés, J. Eduardo & Marszal-Pomianowska, Anna, 2024. "Exploring smart heat meter data: A co-clustering driven approach to analyse the energy use of single-family houses," Applied Energy, Elsevier, vol. 371(C).
    4. Golmohamadi, Hessam & Larsen, Kim Guldstrand & Jensen, Peter Gjøl & Hasrat, Imran Riaz, 2022. "Integration of flexibility potentials of district heating systems into electricity markets: A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 159(C).
    5. Hall, Rebecca & Kenway, Steven & O'Brien, Katherine & Memon, Fayyaz, 2025. "Quantification of residential water-related energy needs cohesion, validation and global representation to unlock efficiency gains," Renewable and Sustainable Energy Reviews, Elsevier, vol. 207(C).
    6. Tahiri, Abdelkarim & Smith, Kevin Michael & Thorsen, Jan Eric & Hviid, Christian Anker & Svendsen, Svend, 2023. "Staged control of domestic hot water storage tanks to support district heating efficiency," Energy, Elsevier, vol. 263(PB).
    7. Nis Bertelsen & Brian Vad Mathiesen, 2020. "EU-28 Residential Heat Supply and Consumption: Historical Development and Status," Energies, MDPI, vol. 13(8), pages 1-21, April.
    8. Toffanin, Riccardo & Curti, Vinicio & Barbato, Maurizio C., 2021. "Impact of Legionella regulation on a 4th generation district heating substation energy use and cost: the case of a Swiss single-family household," Energy, Elsevier, vol. 228(C).
    9. Fester, Jakob & Østergaard, Peter Friis & Bentsen, Fredrik & Nielsen, Brian Kongsgaard, 2023. "A data-driven method for heat loss estimation from district heating service pipes using heat meter- and GIS data," Energy, Elsevier, vol. 277(C).
    10. Stef Jacobs & Margot De Pauw & Senne Van Minnebruggen & Sara Ghane & Thomas Huybrechts & Peter Hellinckx & Ivan Verhaert, 2023. "Grouped Charging of Decentralised Storage to Efficiently Control Collective Heating Systems: Limitations and Opportunities," Energies, MDPI, vol. 16(8), pages 1-28, April.
    11. Lyons, Ben & O’Dwyer, Edward & Shah, Nilay, 2020. "Model reduction for Model Predictive Control of district and communal heating systems within cooperative energy systems," Energy, Elsevier, vol. 197(C).
    12. Li, Haoran & Hou, Juan & Hong, Tianzhen & Ding, Yuemin & Nord, Natasa, 2021. "Energy, economic, and environmental analysis of integration of thermal energy storage into district heating systems using waste heat from data centres," Energy, Elsevier, vol. 219(C).
    13. Michael Mans & Tobias Blacha & Thomas Schreiber & Dirk Müller, 2022. "Development and Application of an Open-Source Framework for Automated Thermal Network Generation and Simulations in Modelica," Energies, MDPI, vol. 15(12), pages 1-25, June.
    14. Meibodi, Saleh S. & Loveridge, Fleur, 2022. "The future role of energy geostructures in fifth generation district heating and cooling networks," Energy, Elsevier, vol. 240(C).
    15. Fischer, David & Madani, Hatef, 2017. "On heat pumps in smart grids: A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 70(C), pages 342-357.
    16. Moreno, Diana & Nielsen, Steffen & Sorknæs, Peter & Lund, Henrik & Thellufsen, Jakob Zinck & Mathiesen, Brian Vad, 2024. "Exploring the location and use of baseload district heating supply. What can current heat sources tell us about future opportunities?," Energy, Elsevier, vol. 288(C).
    17. Østergaard, P.A. & Lund, H. & Thellufsen, J.Z. & Sorknæs, P. & Mathiesen, B.V., 2022. "Review and validation of EnergyPLAN," Renewable and Sustainable Energy Reviews, Elsevier, vol. 168(C).
    18. Omar Al-Ani & Sanjoy Das, 2022. "Reinforcement Learning: Theory and Applications in HEMS," Energies, MDPI, vol. 15(17), pages 1-37, September.
    19. Wirtz, Marco & Kivilip, Lukas & Remmen, Peter & Müller, Dirk, 2020. "5th Generation District Heating: A novel design approach based on mathematical optimization," Applied Energy, Elsevier, vol. 260(C).
    20. Mitridati, Lesia & Kazempour, Jalal & Pinson, Pierre, 2021. "Design and game-Theoretic analysis of community-Based market mechanisms in heat and electricity systems," Omega, Elsevier, vol. 99(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:appene:v:384:y:2025:i:c:s0306261925001503. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/405891/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.