Towards Generalizable Reinforcement Learning for Trade Execution

Towards Generalizable Reinforcement Learning for Trade Execution

Author

Listed:

Chuheng Zhang
Yitong Duan
Xiaoyu Chen
Jianyu Chen
Jian Li
Li Zhao

Abstract

Optimized trade execution is to sell (or buy) a given amount of assets in a given time with the lowest possible trading cost. Recently, reinforcement learning (RL) has been applied to optimized trade execution to learn smarter policies from market data. However, we find that many existing RL methods exhibit considerable overfitting which prevents them from real deployment. In this paper, we provide an extensive study on the overfitting problem in optimized trade execution. First, we model the optimized trade execution as offline RL with dynamic context (ORDC), where the context represents market variables that cannot be influenced by the trading policy and are collected in an offline manner. Under this framework, we derive the generalization bound and find that the overfitting issue is caused by large context space and limited context samples in the offline setting. Accordingly, we propose to learn compact representations for context to address the overfitting problem, either by leveraging prior knowledge or in an end-to-end manner. To evaluate our algorithms, we also implement a carefully designed simulator based on historical limit order book (LOB) data to provide a high-fidelity benchmark for different algorithms. Our experiments on the high-fidelity simulator demonstrate that our algorithms can effectively alleviate overfitting and achieve better performance.

Suggested Citation

Chuheng Zhang & Yitong Duan & Xiaoyu Chen & Jianyu Chen & Jian Li & Li Zhao, 2023. "Towards Generalizable Reinforcement Learning for Trade Execution," Papers 2307.11685, arXiv.org.

Handle: RePEc:arx:papers:2307.11685

Download full text from publisher

References listed on IDEAS

Olivier Gu'eant & Charles-Albert Lehalle & Joaquin Fernandez Tapia, 2011. "Optimal Portfolio Liquidation with Limit Orders," Papers 1106.3279, arXiv.org, revised Jul 2012.
- Olivier Guéant & Charles-Albert Lehalle & Joaquin Fernandez Tapia, 2012. "Optimal Portfolio Liquidation with Limit Orders," Post-Print hal-01393114, HAL.
Brian Bulthuis & Julio Concha & Tim Leung & Brian Ward, 2017. "Optimal execution of limit and market orders with trade director, speed limiter, and fill uncertainty," International Journal of Financial Engineering (IJFE), World Scientific Publishing Co. Pte. Ltd., vol. 4(02n03), pages 1-29, June.
- Brian Bulthuis & Julio Concha & Tim Leung & Brian Ward, 2016. "Optimal Execution of Limit and Market Orders with Trade Director, Speed Limiter, and Fill Uncertainty," Papers 1604.04963, arXiv.org, revised Apr 2017.
Hans Degryse & Frank Jong & Maarten Ravenswaaij & Gunther Wuyts, 2005. "Aggressive Orders and the Resiliency of a Limit Order Market," Review of Finance, Springer, vol. 9(2), pages 201-242, June.
- Hans Degryse & Frank De Jong & Maarten Van Ravenswaaij & Gunther Wuyts, 2005. "Aggressive Orders and the Resiliency of a Limit Order Market," Review of Finance, European Finance Association, vol. 9(2), pages 201-242.
- Degryse, H.A. & de Jong, F.C.J.M. & van Ravenswaaij, M. & Wuyts, G., 2002. "Aggressive Orders and the Resiliency of a Limit Order Market," Discussion Paper 2002-80, Tilburg University, Center for Economic Research.
- Degryse, H.A. & de Jong, F.C.J.M. & van Ravenswaaij, M. & Wuyts, G., 2002. "Aggressive Orders and the Resiliency of a Limit Order Market," Other publications TiSEM 8e62b849-399d-469e-91c6-4, Tilburg University, School of Economics and Management.
James Richard Cummings & Alex Frino, 2010. "Further analysis of the speed of response to large trades in interest rate futures," Journal of Futures Markets, John Wiley & Sons, Ltd., vol. 30(8), pages 705-724, August.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Anil Sharma & Freeman Chen & Jaesun Noh & Julio DeJesus & Mario Schlener, 2024. "Hedging and Pricing Structured Products Featuring Multiple Underlying Assets," Papers 2411.01121, arXiv.org.
Chuqiao Zong & Chaojie Wang & Molei Qin & Lei Feng & Xinrun Wang & Bo An, 2024. "MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading," Papers 2406.14537, arXiv.org.
Molei Qin & Shuo Sun & Wentao Zhang & Haochong Xia & Xinrun Wang & Bo An, 2023. "EarnHFT: Efficient Hierarchical Reinforcement Learning for High Frequency Trading," Papers 2309.12891, arXiv.org.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Matthias Schnaubelt & Jonas Rende & Christopher Krauss, 2019. "Testing Stylized Facts of Bitcoin Limit Order Books," JRFM, MDPI, vol. 12(1), pages 1-30, February.
Schnaubelt, Matthias, 2020. "Deep reinforcement learning for the optimal placement of cryptocurrency limit orders," FAU Discussion Papers in Economics 05/2020, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
Peter Gomber & Uwe Schweickert & Erik Theissen, 2015. "Liquidity Dynamics in an Electronic Open Limit Order Book: an Event Study Approach," European Financial Management, European Financial Management Association, vol. 21(1), pages 52-78, January.
- Gomber, Peter & Schweickert, Uwe & Theissen, Erik, 2011. "Liquidity dynamics in an electronic open limit order book: An event study approach," CFR Working Papers 11-14, University of Cologne, Centre for Financial Research (CFR).
Campi, Luciano & Zabaljauregui, Diego, 2020. "Optimal market making under partial information with general intensities," LSE Research Online Documents on Economics 104612, London School of Economics and Political Science, LSE Library.
Fengpei Li & Vitalii Ihnatiuk & Ryan Kinnear & Anderson Schneider & Yuriy Nevmyvaka, 2022. "Do price trajectory data increase the efficiency of market impact estimation?," Papers 2205.13423, arXiv.org, revised Mar 2023.
Large, Jeremy, 2011. "Estimating quadratic variation when quoted prices change by a constant increment," Journal of Econometrics, Elsevier, vol. 160(1), pages 2-11, January.
- Jeremy Large, 2007. "Estimating Quadratic Variation When Quoted Prices Change by a Constant Increment," Economics Series Working Papers 340, University of Oxford, Department of Economics.
Christopher Lorenz & Alexander Schied, 2013. "Drift dependence of optimal trade execution strategies under transient price impact," Finance and Stochastics, Springer, vol. 17(4), pages 743-770, October.
Aur'elien Alfonsi & Alexander Schied & Florian Klock, 2013. "Multivariate transient price impact and matrix-valued positive definite functions," Papers 1310.4471, arXiv.org, revised Sep 2015.
Ryan Donnelly & Zi Li, 2022. "Dynamic Inventory Management with Mean-Field Competition," Papers 2210.17208, arXiv.org, revised Apr 2025.
Sofiene El Aoud & Frédéric Abergel, 2015. "A stochastic control approach for options market making," Post-Print hal-01061852, HAL.
Wuyts, Gunther, 2008. "The impact of liquidity shocks through the limit order book," CFS Working Paper Series 2008/53, Center for Financial Studies (CFS).
Hendershott, Terrence & Menkveld, Albert J., 2014. "Price pressures," Journal of Financial Economics, Elsevier, vol. 114(3), pages 405-423.
- Hendershott, Terrence & Menkveld, Albert J., 2010. "Price pressures," CFS Working Paper Series 2010/14, Center for Financial Studies (CFS).
N Baradel & B Bouchard & Ngoc Minh Dang, 2016. "Optimal trading with online parameters revisions," Working Papers hal-01304019, HAL.
O’Sullivan, Conall & Papavassiliou, Vassilios G. & Wafula, Ronald Wekesa & Boubaker, Sabri, 2024. "New insights into liquidity resiliency," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 90(C).
- Conall O'Sullivan & Vassilios G. Papavassiliou & Ronald Wekesa Wafula & Sabri Boubaker, 2024. "New Insights into Liquidity Resiliency," Post-Print hal-04432411, HAL.
Philippe Bergault & David Evangelista & Olivier Gu'eant & Douglas Vieira, 2018. "Closed-form approximations in multi-asset market making," Papers 1810.04383, arXiv.org, revised Sep 2022.
Luca Lalor & Anatoliy Swishchuk, 2024. "Market Simulation under Adverse Selection," Papers 2409.12721, arXiv.org, revised Mar 2025.
Adam Blazejewski & Richard Coggins, 2004. "A piecewise linear model for trade sign inference," Finance 0412012, University Library of Munich, Germany.
Sensoy, Ahmet, 2017. "Firm size, ownership structure, and systematic liquidity risk: The case of an emerging market," Journal of Financial Stability, Elsevier, vol. 31(C), pages 62-80.
Hai-Chuan Xu & Wei Chen & Xiong Xiong & Wei Zhang & Wei-Xing Zhou & H Eugene Stanley, 2016. "Limit-order book resiliency after effective market orders: Spread, depth and intensity," Papers 1602.00731, arXiv.org, revised Feb 2017.
Thierry Foucault & Ohad Kadan & Eugene Kandel, 2005. "Limit Order Book as a Market for Liquidity," The Review of Financial Studies, Society for Financial Studies, vol. 18(4), pages 1171-1217.
- Foucault, Thierry & Kandel, Eugene & Kadan, Ohad, 2001. "Limit Order Book as a Market for Liquidity," CEPR Discussion Papers 2889, C.E.P.R. Discussion Papers.
- Thierry Foucault & Ohad Kadan & Eugene Kandel, 2011. "Limit Order Book as a Market for Liquidity," Working Papers hal-00597190, HAL.
- Thierry Foucault & Ohad Kadan & Eugene Kandel, 2005. "Limit Order Book as a Market for Liquidity," Post-Print hal-00459785, HAL.
- Thierry Foucault & Ohad Kadan & Eugene Kandel, 2003. "Limit Order Book as a Market for Liquidity," Discussion Paper Series dp321, The Federmann Center for the Study of Rationality, the Hebrew University, Jerusalem.
- FOUCAULT, Thierry & KADAN, Ohad & KANDEL, Eugene, 2001. "Limit order book as a market for liquidity," HEC Research Papers Series 728, HEC Paris.
- Thierry Foucault & Ohad Kadan & Eugene Kandel, 2005. "Limit Order Book as a Market for Liquidity," Post-Print halshs-00005043, HAL.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-CMP-2023-08-28 (Computational Economics)
NEP-MST-2023-08-28 (Market Microstructure)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2307.11685. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Towards Generalizable Reinforcement Learning for Trade Execution

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data