An Application of Deep Reinforcement Learning to Algorithmic Trading

My bibliography Save this paper

An Application of Deep Reinforcement Learning to Algorithmic Trading

Author

Listed:

Thibaut Th'eate
Damien Ernst

Registered:

Abstract

This scientific research paper presents an innovative approach based on deep reinforcement learning (DRL) to solve the algorithmic trading problem of determining the optimal trading position at any point in time during a trading activity in stock markets. It proposes a novel DRL trading strategy so as to maximise the resulting Sharpe ratio performance indicator on a broad range of stock markets. Denominated the Trading Deep Q-Network algorithm (TDQN), this new trading strategy is inspired from the popular DQN algorithm and significantly adapted to the specific algorithmic trading problem at hand. The training of the resulting reinforcement learning (RL) agent is entirely based on the generation of artificial trajectories from a limited set of stock market historical data. In order to objectively assess the performance of trading strategies, the research paper also proposes a novel, more rigorous performance assessment methodology. Following this new performance assessment approach, promising results are reported for the TDQN strategy.

Suggested Citation

Thibaut Th'eate & Damien Ernst, 2020. "An Application of Deep Reinforcement Learning to Algorithmic Trading," Papers 2004.06627, arXiv.org, revised Oct 2020.

Handle: RePEc:arx:papers:2004.06627

Download full text from publisher

References listed on IDEAS

John P A Ioannidis, 2005. "Why Most Published Research Findings Are False," PLOS Medicine, Public Library of Science, vol. 2(8), pages 1-1, August.
Terrence Hendershott & Charles M. Jones & Albert J. Menkveld, 2011. "Does Algorithmic Trading Improve Liquidity?," Journal of Finance, American Finance Association, vol. 66(1), pages 1-33, February.
- Hendershott, Terrence & Jones, Charles M. & Menkveld, Albert J., 2008. "Does algorithmic trading improve liquidity?," CFS Working Paper Series 2008/41, Center for Financial Studies (CFS).
Wei Bao & Jun Yue & Yulei Rao, 2017. "A deep learning framework for financial time series using stacked autoencoders and long-short term memory," PLOS ONE, Public Library of Science, vol. 12(7), pages 1-24, July.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Thibaut Théate & Sébastien Mathieu & Damien Ernst, 2020. "An Artificial Intelligence Solution for Electricity Procurement in Forward Markets," Energies, MDPI, vol. 13(23), pages 1-17, December.
Kumar, S. Senthil & Srinivasan, C. & Balavignesh, S., 2025. "Enhancing grid integration of renewable energy sources for micro grid stability using forecasting and optimal dispatch strategies," Energy, Elsevier, vol. 322(C).
Antonio Briola & Jeremy Turiel & Riccardo Marcaccioli & Alvaro Cauderan & Tomaso Aste, 2021. "Deep Reinforcement Learning for Active High Frequency Trading," Papers 2101.07107, arXiv.org, revised Aug 2023.
Peng Chen & Dongyun Yi & Chengli Zhao, 2020. "Trading Strategy for Market Situation Estimation Based on Hidden Markov Model," Mathematics, MDPI, vol. 8(7), pages 1-13, July.
Tatiana de Macedo Nogueira Lima, 2022. "Documento de Trabalho 03/2022 - Aprendizado de máquina e antitruste," Documentos de Trabalho 2022030, Conselho Administrativo de Defesa Econômica (Cade), Departamento de Estudos Econômicos.
Eric Benhamou & David Saltiel & Sandrine Ungari & Abhishek Mukhopadhyay & Jamal Atif, 2020. "AAMDRL: Augmented Asset Management with Deep Reinforcement Learning," Papers 2010.08497, arXiv.org.
Eric Benhamou & David Saltiel & Sandrine Ungari & Abhishek Mukhopadhyay, 2020. "Bridging the gap between Markowitz planning and deep reinforcement learning," Papers 2010.09108, arXiv.org.
Thibaut Th'eate & S'ebastien Mathieu & Damien Ernst, 2020. "An Artificial Intelligence Solution for Electricity Procurement in Forward Markets," Papers 2006.05784, arXiv.org, revised Dec 2020.
Amjad Khan, 2023. "A Computational Model of Flexible Maze Navigation Through Hippocampal Replay: Bridging Neuroscience and AI," Frontiers in Computational Spatial Intelligence, 50sea, vol. 1(1), pages 10-17, July.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Zihao Zhang & Stefan Zohren & Stephen Roberts, 2018. "DeepLOB: Deep Convolutional Neural Networks for Limit Order Books," Papers 1808.03668, arXiv.org, revised Jan 2020.
Nils Köbis & Zoe Rahwan & Raluca Rilla & Bramantyo Ibrahim Supriyatno & Clara Bersch & Tamer Ajaj & Jean-François Bonnefon & Iyad Rahwan, 2025. "Delegation to artificial intelligence can increase dishonest behaviour," Nature, Nature, vol. 646(8083), pages 126-134, October.
- Nils Köbis & Zoe Rahwan & Raluca Rilla & Bramantyo Ibrahim Supriyatno & Clara Bersch & Tamer Ajaj & Jean-François Bonnefon & Iyad Rahwan, 2025. "Delegation to Artificial Intelligence can increase dishonest behaviour," Post-Print hal-05277822, HAL.
- Köbis, Nils & Rahwan, Zoe & Rilla, Raluca & Supriyatno, Bramantyo Ibrahim & Bersch, Clara & Ajaj, Tamer & Bonnefon, Jean-François & Rahwan, Iyad, 2025. "Delegation to Artificial Intelligence can increase dishonest behaviour," TSE Working Papers 25-1663, Toulouse School of Economics (TSE).
- Nils Köbis & Zoe Rahwan & Raluca Rilla & Bramantyo Ibrahim Supriyatno & Clara Bersch & Tamer Ajaj & Jean-François Bonnefon & Iyad Rahwan, 2025. "Delegation to Artificial Intelligence can increase dishonest behaviour," Working Papers hal-05273501, HAL.
Evangelos Benos & Satchit Sagade, 2012. "High-frequency trading behaviour and its impact on market quality: evidence from the UK equity market," Bank of England working papers 469, Bank of England.
Bellia, Mario & Christensen, Kim & Kolokolov, Aleksey & Pelizzon, Loriana & Renò, Roberto, 2022. "Do designated market makers provide liquidity during a flash crash?," SAFE Working Paper Series 270, Leibniz Institute for Financial Research SAFE, revised 2022.
Tamer Khraisha & Keren Arthur, 2018. "Can we have a general theory of financial innovation processes? A conceptual review," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 4(1), pages 1-27, December.
Alexander Frankel & Maximilian Kasy, 2022. "Which Findings Should Be Published?," American Economic Journal: Microeconomics, American Economic Association, vol. 14(1), pages 1-38, February.
- Kasy, Maximilian & Frankel, Alexander, 2018. "Which findings should be published?," MetaArXiv mbvz3, Center for Open Science.
Bruno Biais & Fany Declerck & Sophie Moinas, 2016. "Who supplies liquidity, how and when?," BIS Working Papers 563, Bank for International Settlements.
- Biais, Bruno & Declerck, Fany & Moinas, Sophie, 2017. "Who supplies liquidity, how and when?," IDEI Working Papers 874, Institut d'Économie Industrielle (IDEI), Toulouse.
- Biais, Bruno & Declerck, Fany & Moinas, Sophie, 2017. "Who supplies liquidity, how and when?," TSE Working Papers 17-818, Toulouse School of Economics (TSE).
Jyotirmoy Sarkar, 2018. "Will Pâ€ Value Triumph over Abuses and Attacks?," Biostatistics and Biometrics Open Access Journal, Juniper Publishers Inc., vol. 7(4), pages 66-71, July.
Indriawan, Ivan & Martinez, Valeria & Tse, Yiuman, 2021. "The impact of the change in USDA announcement release procedures on agricultural commodity futures," Journal of Commodity Markets, Elsevier, vol. 23(C).
Andrea Bucci, 2020. "Realized Volatility Forecasting with Neural Networks," Journal of Financial Econometrics, Oxford University Press, vol. 18(3), pages 502-531.
- Andrea Bucci, 0. "Realized Volatility Forecasting with Neural Networks," Journal of Financial Econometrics, Oxford University Press, vol. 18(3), pages 502-531.
- Bucci, Andrea, 2019. "Realized Volatility Forecasting with Neural Networks," MPRA Paper 95443, University Library of Munich, Germany.
Álvaro Cartea & José Penalva, 2012. "Where is the Value in High Frequency Trading?," Quarterly Journal of Finance (QJF), World Scientific Publishing Co. Pte. Ltd., vol. 2(03), pages 1-46.
- Álvaro Cartea & José Penalva, 2011. "Where is the value in high frequency trading?," Working Papers 1111, Banco de España.
NIdhi Aggarwal & Venkatesh Panchapagesan & Susan Thomas, 2022. "When is the Order to Trade Ratio fee effective?," Working Papers 8, xKDR.
Kang, Jongho & Kang, Jangkoo & Kwon, Kyung Yoon, 2022. "Market versus limit orders of speculative high-frequency traders and price discovery," Research in International Business and Finance, Elsevier, vol. 63(C).
Ahmed Baig & Nasim Sabah & Drew Winters, 2019. "Have Stock Prices become more Uniformly Distributed?," Economics Bulletin, AccessEcon, vol. 39(2), pages 1242-1250.
Robert J. Kauffman & Yuzhou Hu & Dan Ma, 2015. "Will high-frequency trading practices transform the financial markets in the Asia Pacific Region?," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 1(1), pages 1-27, December.
Uctum, Remzi & Renou-Maissant, Patricia & Prat, Georges & Lecarpentier-Moyal, Sylvie, 2017. "Persistence of announcement effects on the intraday volatility of stock returns: Evidence from individual data," Review of Financial Economics, Elsevier, vol. 35(C), pages 43-56.
- Remzi Uctum & Patricia Renou‐Maissant & Georges Prat & Sylvie Lecarpentier‐Moyal, 2017. "Persistence of announcement effects on the intraday volatility of stock returns: Evidence from individual data," Review of Financial Economics, John Wiley & Sons, vol. 35(1), pages 43-56, November.
- Sylvie Lecarpentier-Moyal & Georges Prat & Patricia Renou-Maissant & Remzi Uctum, 2013. "Persistence of announcement effects on the intraday volatility of stock returns: evidence from individual data," EconomiX Working Papers 2013-36, University of Paris Nanterre, EconomiX.
- Georges Prat & Remzi Uctum & Sylvie Lecarpentier-Moyal & Patricia Renou-Maissant, 2014. "Persistence of announcement effects on the intraday volatility of stock returns: evidence from individual data," Post-Print hal-01638222, HAL.
- Sylvie Lecarpentier-Moyal & Georges Prat & Patricia Renou-Maissant & Remzi Uctum, 2013. "Persistence of announcement effects on the intraday volatility of stock returns: evidence from individual data," Erudite Working Paper 2013-05, Erudite.
- Sylvie Lecarpentier Moyal & Georges Prat & Patricia Renou Maissant & Remzi Uctum, 2013. "Persistence of announcement effects on the intraday volatility of stock returns: evidence from individual data," Working Papers 2013-27, Department of Research, Ipag Business School.
- Sylvie Lecarpentier-Moyal & Georges Prat & Patricia Renou-Maissant & Remzi Uctum, 2014. "Persistence of announcement effects on the intraday volatility of stock returns: evidence from individual data," Post-Print hal-01411783, HAL.
- Remzi Uctum & Patricia Renou-Maissant & Georges Prat & Sylvie Lecarpentier-Moyal, 2017. "Persistence of announcement effects on the intraday volatility of stock returns: Evidence from individual data," Post-Print halshs-02080313, HAL.
Stanley, T. D. & Doucouliagos, Chris, 2019. "Practical Significance, Meta-Analysis and the Credibility of Economics," IZA Discussion Papers 12458, Institute of Labor Economics (IZA).
George Jiang & Ingrid Lo & Giorgio Valente, 2014. "High-Frequency Trading around Macroeconomic News Announcements: Evidence from the U.S. Treasury Market," Staff Working Papers 14-56, Bank of Canada.
Jaydip Sen & Sidra Mehtab & Abhishek Dutta & Saikat Mondal, 2022. "Precise Stock Price Prediction for Optimized Portfolio Design Using an LSTM Model," Papers 2203.01326, arXiv.org.
Aggarwal, Nidhi & Panchapagesan, Venkatesh & Thomas, Susan, 2023. "When is the order-to-trade ratio fee effective?," Journal of Financial Markets, Elsevier, vol. 62(C).
- NIdhi Aggarwal & Venkatesh Panchapagesan & Susan Thomas, 2022. "When is the order-to-trade ratio fee effective?," Working Papers 11, xKDR.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BIG-2020-04-20 (Big Data)
NEP-CMP-2020-04-20 (Computational Economics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2004.06627. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

An Application of Deep Reinforcement Learning to Algorithmic Trading

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data