Agent Inspired Trading Using Recurrent Reinforcement Learning and LSTM Neural Networks

Agent Inspired Trading Using Recurrent Reinforcement Learning and LSTM Neural Networks

Author

Listed:

David W. Lu

Abstract

With the breakthrough of computational power and deep neural networks, many areas that we haven't explore with various techniques that was researched rigorously in past is feasible. In this paper, we will walk through possible concepts to achieve robo-like trading or advising. In order to accomplish similar level of performance and generality, like a human trader, our agents learn for themselves to create successful strategies that lead to the human-level long-term rewards. The learning model is implemented in Long Short Term Memory (LSTM) recurrent structures with Reinforcement Learning or Evolution Strategies acting as agents The robustness and feasibility of the system is verified on GBPUSD trading.

Suggested Citation

David W. Lu, 2017. "Agent Inspired Trading Using Recurrent Reinforcement Learning and LSTM Neural Networks," Papers 1707.07338, arXiv.org.

Handle: RePEc:arx:papers:1707.07338

Download full text from publisher

References listed on IDEAS

Richard Bellman, 1957. "On a Dynamic Programming Approach to the Caterer Problem--I," Management Science, INFORMS, vol. 3(3), pages 270-278, April.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Ali Al-Ameer & Khaled Alshehri, 2021. "Conditional Value-at-Risk for Quantitative Trading: A Direct Reinforcement Learning Approach," Papers 2109.14438, arXiv.org.
Hans Buhler & Lukas Gonon & Josef Teichmann & Ben Wood, 2018. "Deep Hedging," Papers 1802.03042, arXiv.org.
Alexandre Carbonneau & Fr'ed'eric Godin, 2020. "Equal Risk Pricing of Derivatives with Deep Hedging," Papers 2002.08492, arXiv.org, revised Jun 2020.
Kinyua, Johnson D. & Mutigwe, Charles & Cushing, Daniel J. & Poggi, Michael, 2021. "An analysis of the impact of President Trump’s tweets on the DJIA and S&P 500 using machine learning and sentiment analysis," Journal of Behavioral and Experimental Finance, Elsevier, vol. 29(C).
Ahmet Murat Ozbayoglu & Mehmet Ugur Gudelek & Omer Berat Sezer, 2020. "Deep Learning for Financial Applications : A Survey," Papers 2002.05786, arXiv.org.
Alexandre Carbonneau & Frédéric Godin, 2023. "Deep Equal Risk Pricing of Financial Derivatives with Non-Translation Invariant Risk Measures," Risks, MDPI, vol. 11(8), pages 1-27, August.
Luca De Gennaro Aquino & Carole Bernard, 2019. "Bounds on Multi-asset Derivatives via Neural Networks," Papers 1911.05523, arXiv.org, revised Nov 2020.
Svitlana Vyetrenko & David Byrd & Nick Petosa & Mahmoud Mahfouz & Danial Dervovic & Manuela Veloso & Tucker Hybinette Balch, 2019. "Get Real: Realism Metrics for Robust Limit Order Book Market Simulations," Papers 1912.04941, arXiv.org.
Jonathan Sadighian, 2019. "Deep Reinforcement Learning in Cryptocurrency Market Making," Papers 1911.08647, arXiv.org.
Alexandre Carbonneau & Fr'ed'eric Godin, 2021. "Deep equal risk pricing of financial derivatives with non-translation invariant risk measures," Papers 2107.11340, arXiv.org.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Eric Sucky, 2006. "Kontraktlogistik—Ein stochastisch dynamischer Planungsansatz zur Logistikdienstleisterauswahl," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 17(2), pages 131-153, June.
Pierre Bernhard & Marc Deschamps, 2017. "Kalman on dynamics and contro, Linear System Theory, Optimal Control, and Filter," Working Papers 2017-10, CRESE.
Jones, Randall E. & Cacho, Oscar J., 2000. "A Dynamic Optimisation Model of Weed Control," 2000 Conference (44th), January 23-25, 2000, Sydney, Australia 123685, Australian Agricultural and Resource Economics Society.
- Cacho, Oscar J. & Jones, Randall E., 2000. "A Dynamic Optimisation Model of Weed Control," Working Papers 12902, University of New England, School of Economics.
Voelkel, Michael A. & Sachs, Anna-Lena & Thonemann, Ulrich W., 2020. "An aggregation-based approximate dynamic programming approach for the periodic review model with random yield," European Journal of Operational Research, Elsevier, vol. 281(2), pages 286-298.
Pam Norton & Ravi Phatarfod, 2008. "Optimal Strategies In One-Day Cricket," Asia-Pacific Journal of Operational Research (APJOR), World Scientific Publishing Co. Pte. Ltd., vol. 25(04), pages 495-511.
Mitri Kitti, 2013. "Subgame Perfect Equilibria in Discounted Stochastic Games," Discussion Papers 87, Aboa Centre for Economics.
Rempel, M. & Cai, J., 2021. "A review of approximate dynamic programming applications within military operations research," Operations Research Perspectives, Elsevier, vol. 8(C).
Aghayi, Nazila & Maleki, Bentolhoda, 2016. "Efficiency measurement of DMUs with undesirable outputs under uncertainty based on the directional distance function: Application on bank industry," Energy, Elsevier, vol. 112(C), pages 376-387.
Baldi, Simone & Michailidis, Iakovos & Ravanis, Christos & Kosmatopoulos, Elias B., 2015. "Model-based and model-free “plug-and-play” building energy efficient control," Applied Energy, Elsevier, vol. 154(C), pages 829-841.
Tan, Madeleine Sui-Lay, 2016. "Policy coordination among the ASEAN-5: A global VAR analysis," Journal of Asian Economics, Elsevier, vol. 44(C), pages 20-40.
Deepmala, 2014. "Existence Theorems for Solvability of a Functional Equation Arising in Dynamic Programming," International Journal of Mathematics and Mathematical Sciences, Hindawi, vol. 2014, pages 1-9, April.
D. W. K. Yeung, 2008. "Dynamically Consistent Solution For A Pollution Management Game In Collaborative Abatement With Uncertain Future Payoffs," International Game Theory Review (IGTR), World Scientific Publishing Co. Pte. Ltd., vol. 10(04), pages 517-538.
Mahmoudimehr, Javad & Sebghati, Parvin, 2019. "A novel multi-objective Dynamic Programming optimization method: Performance management of a solar thermal power plant as a case study," Energy, Elsevier, vol. 168(C), pages 796-814.
Mahes, Roshan & Mandjes, Michel & Boon, Marko & Taylor, Peter, 2024. "Adaptive scheduling in service systems: A Dynamic programming approach," European Journal of Operational Research, Elsevier, vol. 312(2), pages 605-626.
Korfhage, Thorben & Fischer-Weckemann, Björn, 2024. "Long-run consequences of informal elderly care and implications of public long-term care insurance," Journal of Health Economics, Elsevier, vol. 96(C).
Astaraky, Davood & Patrick, Jonathan, 2015. "A simulation based approximate dynamic programming approach to multi-class, multi-resource surgical scheduling," European Journal of Operational Research, Elsevier, vol. 245(1), pages 309-319.
Crutchfield, Stephen R. & Brazee, Richard J., "undated". "An Integrated Model of Surface and Ground Water Quality," 1990 Annual meeting, August 5-8, Vancouver, Canada 271011, American Agricultural Economics Association (New Name 2008: Agricultural and Applied Economics Association).
Deepti Rani & Maria Moreira, 2010. "Simulation–Optimization Modeling: A Survey and Potential Application in Reservoir Systems Operation," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 24(6), pages 1107-1138, April.
Mauro Gaggero & Giorgio Gnecco & Marcello Sanguineti, 2014. "Approximate dynamic programming for stochastic N-stage optimization with application to optimal consumption under uncertainty," Computational Optimization and Applications, Springer, vol. 58(1), pages 31-85, May.
Caglar, Metin, 1974. "Optimization of intraseasonal water allocation," ISU General Staff Papers 197401010800006976, Iowa State University, Department of Economics.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BIG-2017-07-30 (Big Data)
NEP-CMP-2017-07-30 (Computational Economics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1707.07338. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Agent Inspired Trading Using Recurrent Reinforcement Learning and LSTM Neural Networks

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data