A Simulation Environment for Training a Reinforcement Learning Agent Trading a Battery Storage

My bibliography Save this article

A Simulation Environment for Training a Reinforcement Learning Agent Trading a Battery Storage

Author

Listed:

Harri Aaltonen
(Department of Electrical Engineering and Automation, School of Electrical Engineering, Aalto University, FI-00076 Espoo, Finland)
Seppo Sierla
(Department of Electrical Engineering and Automation, School of Electrical Engineering, Aalto University, FI-00076 Espoo, Finland)
Rakshith Subramanya
(Department of Electrical Engineering and Automation, School of Electrical Engineering, Aalto University, FI-00076 Espoo, Finland)
Valeriy Vyatkin
(Department of Electrical Engineering and Automation, School of Electrical Engineering, Aalto University, FI-00076 Espoo, Finland
Department of Computer Science, Electrical and Space Engineering, Luleå University of Technology, 97187 Luleå, Sweden
International Research Laboratory of Computer Technologies, ITMO University, 197101 St. Petersburg, Russia)

Registered:

Abstract

Battery storages are an essential element of the emerging smart grid. Compared to other distributed intelligent energy resources, batteries have the advantage of being able to rapidly react to events such as renewable generation fluctuations or grid disturbances. There is a lack of research on ways to profitably exploit this ability. Any solution needs to consider rapid electrical phenomena as well as the much slower dynamics of relevant electricity markets. Reinforcement learning is a branch of artificial intelligence that has shown promise in optimizing complex problems involving uncertainty. This article applies reinforcement learning to the problem of trading batteries. The problem involves two timescales, both of which are important for profitability. Firstly, trading the battery capacity must occur on the timescale of the chosen electricity markets. Secondly, the real-time operation of the battery must ensure that no financial penalties are incurred from failing to meet the technical specification. The trading-related decisions must be done under uncertainties, such as unknown future market prices and unpredictable power grid disturbances. In this article, a simulation model of a battery system is proposed as the environment to train a reinforcement learning agent to make such decisions. The system is demonstrated with an application of the battery to Finnish primary frequency reserve markets.

Suggested Citation

Harri Aaltonen & Seppo Sierla & Rakshith Subramanya & Valeriy Vyatkin, 2021. "A Simulation Environment for Training a Reinforcement Learning Agent Trading a Battery Storage," Energies, MDPI, vol. 14(17), pages 1-20, September.

Handle: RePEc:gam:jeners:v:14:y:2021:i:17:p:5587-:d:630250

Download full text from publisher

References listed on IDEAS

Tsianikas, Stamatis & Yousefi, Nooshin & Zhou, Jian & Rodgers, Mark D. & Coit, David, 2021. "A storage expansion planning framework using reinforcement learning and simulation-based optimization," Applied Energy, Elsevier, vol. 290(C).
Sunyong Kim & Hyuk Lim, 2018. "Reinforcement Learning Based Energy Management Algorithm for Smart Energy Buildings," Energies, MDPI, vol. 11(8), pages 1-19, August.
Herre, Lars & Tomasini, Federica & Paridari, Kaveh & Söder, Lennart & Nordström, Lars, 2020. "Simplified model of integrated paper mill for optimal bidding in energy and reserve markets," Applied Energy, Elsevier, vol. 279(C).
Bialek, Janusz, 2020. "What does the GB power outage on 9 August 2019 tell us about the current state of decarbonised power systems?," Energy Policy, Elsevier, vol. 146(C).
Rakshith Subramanya & Matti Yli-Ojanperä & Seppo Sierla & Taneli Hölttä & Jori Valtakari & Valeriy Vyatkin, 2021. "A Virtual Power Plant Solution for Aggregating Photovoltaic Systems and Other Distributed Energy Resources for Northern European Primary Frequency Reserves," Energies, MDPI, vol. 14(5), pages 1-23, February.
Grace Muriithi & Sunetra Chowdhury, 2021. "Optimal Energy Management of a Grid-Tied Solar PV-Battery Microgrid: A Reinforcement Learning Approach," Energies, MDPI, vol. 14(9), pages 1-24, May.
Yu Sui & Shiming Song, 2020. "A Multi-Agent Reinforcement Learning Framework for Lithium-ion Battery Scheduling Problems," Energies, MDPI, vol. 13(8), pages 1-13, April.
Evgeny Nefedov & Seppo Sierla & Valeriy Vyatkin, 2018. "Internet of Energy Approach for Sustainable Use of Electric Vehicles as Energy Storage of Prosumer Buildings," Energies, MDPI, vol. 11(8), pages 1-18, August.
Jin-Gyeom Kim & Bowon Lee, 2020. "Automatic P2P Energy Trading Model Based on Reinforcement Learning Using Long Short-Term Delayed Reward," Energies, MDPI, vol. 13(20), pages 1-27, October.
Malik, Anam & Ravishankar, Jayashri, 2018. "A hybrid control approach for regulating frequency through demand response," Applied Energy, Elsevier, vol. 210(C), pages 1347-1362.
Loukatou, Angeliki & Johnson, Paul & Howell, Sydney & Duck, Peter, 2021. "Optimal valuation of wind energy projects co-located with battery storage," Applied Energy, Elsevier, vol. 283(C).
Bialek, J., 2020. "What does the power outage on 9 August 2019 tell us about GB power system," Cambridge Working Papers in Economics 2018, Faculty of Economics, University of Cambridge.
Sepúlveda-Mora, Sergio B. & Hegedus, Steven, 2021. "Making the case for time-of-use electric rates to boost the value of battery storage in commercial buildings with grid connected PV systems," Energy, Elsevier, vol. 218(C).
Denis Sidorov & Daniil Panasetsky & Nikita Tomin & Dmitriy Karamov & Aleksei Zhukov & Ildar Muftahov & Aliona Dreglea & Fang Liu & Yong Li, 2020. "Toward Zero-Emission Hybrid AC/DC Power Systems with Renewable Energy Sources and Storages: A Case Study from Lake Baikal Region," Energies, MDPI, vol. 13(5), pages 1-18, March.
Killer, Marvin & Farrokhseresht, Mana & Paterakis, Nikolaos G., 2020. "Implementation of large-scale Li-ion battery energy storage systems within the EMEA region," Applied Energy, Elsevier, vol. 260(C).
Brida V. Mbuwir & Frederik Ruelens & Fred Spiessens & Geert Deconinck, 2017. "Battery Energy Management in a Microgrid Using Batch Reinforcement Learning," Energies, MDPI, vol. 10(11), pages 1-19, November.
Pavić, Ivan & Capuder, Tomislav & Kuzle, Igor, 2016. "Low carbon technologies as providers of operational flexibility in future power systems," Applied Energy, Elsevier, vol. 168(C), pages 724-738.
Hyukjoon Lee & Dongjin Ji & Dong-Ho Cho, 2019. "Optimal Design of Wireless Charging Electric Bus System Based on Reinforcement Learning," Energies, MDPI, vol. 12(7), pages 1-20, March.
Ning Wang & Weisheng Xu & Weihui Shao & Zhiyu Xu, 2019. "A Q-Cube Framework of Reinforcement Learning Algorithm for Continuous Double Auction among Microgrids," Energies, MDPI, vol. 12(15), pages 1-26, July.
Christian Giovanelli & Seppo Sierla & Ryutaro Ichise & Valeriy Vyatkin, 2018. "Exploiting Artificial Neural Networks for the Prediction of Ancillary Energy Market Prices," Energies, MDPI, vol. 11(7), pages 1-22, July.
Xu, Bin & Shi, Junzhe & Li, Sixu & Li, Huayi & Wang, Zhe, 2021. "Energy consumption and battery aging minimization using a Q-learning strategy for a battery/ultracapacitor electric vehicle," Energy, Elsevier, vol. 229(C).

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Harri Aaltonen & Seppo Sierla & Ville Kyrki & Mahdi Pourakbari-Kasmaei & Valeriy Vyatkin, 2022. "Bidding a Battery on Electricity Markets and Minimizing Battery Aging Costs: A Reinforcement Learning Approach," Energies, MDPI, vol. 15(14), pages 1-19, July.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Harri Aaltonen & Seppo Sierla & Ville Kyrki & Mahdi Pourakbari-Kasmaei & Valeriy Vyatkin, 2022. "Bidding a Battery on Electricity Markets and Minimizing Battery Aging Costs: A Reinforcement Learning Approach," Energies, MDPI, vol. 15(14), pages 1-19, July.
Niko Karhula & Seppo Sierla & Valeriy Vyatkin, 2021. "Validating the Real-Time Performance of Distributed Energy Resources Participating on Primary Frequency Reserves," Energies, MDPI, vol. 14(21), pages 1-19, October.
Li, Kun & Wei, Lishen & Fang, Jiakun & Ai, Xiaomeng & Cui, Shichang & Zhu, Mengshu & Wen, Jinyu, 2024. "Incentive-compatible primary frequency response ancillary service market mechanism for incorporating diverse frequency support resources," Energy, Elsevier, vol. 306(C).
Zhu, Ziqing & Hu, Ze & Chan, Ka Wing & Bu, Siqi & Zhou, Bin & Xia, Shiwei, 2023. "Reinforcement learning in deregulated energy market: A comprehensive review," Applied Energy, Elsevier, vol. 329(C).
Alexander N. Kozlov & Nikita V. Tomin & Denis N. Sidorov & Electo E. S. Lora & Victor G. Kurbatsky, 2020. "Optimal Operation Control of PV-Biomass Gasifier-Diesel-Hybrid Systems Using Reinforcement Learning Techniques," Energies, MDPI, vol. 13(10), pages 1-20, May.
Bio Gassi, Karim & Baysal, Mustafa, 2023. "Improving real-time energy decision-making model with an actor-critic agent in modern microgrids with energy storage devices," Energy, Elsevier, vol. 263(PE).
Armin Razmjoo & Arezoo Ghazanfari & Poul Alberg Østergaard & Mehdi Jahangiri & Andreas Sumper & Sahar Ahmadzadeh & Reza Eslamipoor, 2024. "Moving Toward the Expansion of Energy Storage Systems in Renewable Energy Systems—A Techno-Institutional Investigation with Artificial Intelligence Consideration," Sustainability, MDPI, vol. 16(22), pages 1-25, November.
Grace Muriithi & Sunetra Chowdhury, 2021. "Optimal Energy Management of a Grid-Tied Solar PV-Battery Microgrid: A Reinforcement Learning Approach," Energies, MDPI, vol. 14(9), pages 1-24, May.
Lilia Tightiz & Joon Yoo, 2022. "A Review on a Data-Driven Microgrid Management System Integrating an Active Distribution Network: Challenges, Issues, and New Trends," Energies, MDPI, vol. 15(22), pages 1-24, November.
Badesa, Luis & Matamala, Carlos & Strbac, Goran, 2025. "Who should pay for frequency-containment ancillary services? Making responsible units bear the cost to shape investment in generation and loads," Energy Policy, Elsevier, vol. 196(C).
Ritu Kandari & Neeraj Neeraj & Alexander Micallef, 2022. "Review on Recent Strategies for Integrating Energy Storage Systems in Microgrids," Energies, MDPI, vol. 16(1), pages 1-24, December.
Stelios C. Dimoulias & Eleftherios O. Kontis & Grigoris K. Papagiannis, 2022. "Inertia Estimation of Synchronous Devices: Review of Available Techniques and Comparative Assessment of Conventional Measurement-Based Approaches," Energies, MDPI, vol. 15(20), pages 1-30, October.
Khawaja Haider Ali & Marvin Sigalo & Saptarshi Das & Enrico Anderlini & Asif Ali Tahir & Mohammad Abusara, 2021. "Reinforcement Learning for Energy-Storage Systems in Grid-Connected Microgrids: An Investigation of Online vs. Offline Implementation," Energies, MDPI, vol. 14(18), pages 1-18, September.
Van-Hai Bui & Akhtar Hussain & Hak-Man Kim, 2019. "Q-Learning-Based Operation Strategy for Community Battery Energy Storage System (CBESS) in Microgrid System," Energies, MDPI, vol. 12(9), pages 1-17, May.
Ziqian Zhang & Carina Lehmal & Philipp Hackl & Robert Schuerhuber, 2022. "Transient Stability Analysis and Post-Fault Restart Strategy for Current-Limited Grid-Forming Converter," Energies, MDPI, vol. 15(10), pages 1-26, May.
Xiao, Yunpeng & Zhu, Yuerong & Qu, Ying & Xie, Haipeng & Wang, Xiuli & Wang, Xifan, 2025. "A market for power system resilience provision," Applied Energy, Elsevier, vol. 382(C).
Hu, Chenxi & Zhang, Jun & Yuan, Hongxia & Gao, Tianlu & Jiang, Huaiguang & Yan, Jing & Wenzhong Gao, David & Wang, Fei-Yue, 2022. "Black swan event small-sample transfer learning (BEST-L) and its case study on electrical power prediction in COVID-19," Applied Energy, Elsevier, vol. 309(C).
Bomela, Walter & Zlotnik, Anatoly & Li, Jr-Shin, 2018. "A phase model approach for thermostatically controlled load demand response," Applied Energy, Elsevier, vol. 228(C), pages 667-680.
Khawaja Haider Ali & Mohammad Abusara & Asif Ali Tahir & Saptarshi Das, 2023. "Dual-Layer Q-Learning Strategy for Energy Management of Battery Storage in Grid-Connected Microgrids," Energies, MDPI, vol. 16(3), pages 1-17, January.
Davi-Arderius, Daniel & Jamasb, Tooraj & Rosellon, Juan, 2025. "Network Operation Constraints on the Path to Net Zero," Applied Energy, Elsevier, vol. 382(C).

More about this item

Keywords

; ; ; ; ; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jeners:v:14:y:2021:i:17:p:5587-:d:630250. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

A Simulation Environment for Training a Reinforcement Learning Agent Trading a Battery Storage

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data