IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2307.11685.html

Towards Generalizable Reinforcement Learning for Trade Execution

Author

Listed:
  • Chuheng Zhang
  • Yitong Duan
  • Xiaoyu Chen
  • Jianyu Chen
  • Jian Li
  • Li Zhao

Abstract

Optimized trade execution is to sell (or buy) a given amount of assets in a given time with the lowest possible trading cost. Recently, reinforcement learning (RL) has been applied to optimized trade execution to learn smarter policies from market data. However, we find that many existing RL methods exhibit considerable overfitting which prevents them from real deployment. In this paper, we provide an extensive study on the overfitting problem in optimized trade execution. First, we model the optimized trade execution as offline RL with dynamic context (ORDC), where the context represents market variables that cannot be influenced by the trading policy and are collected in an offline manner. Under this framework, we derive the generalization bound and find that the overfitting issue is caused by large context space and limited context samples in the offline setting. Accordingly, we propose to learn compact representations for context to address the overfitting problem, either by leveraging prior knowledge or in an end-to-end manner. To evaluate our algorithms, we also implement a carefully designed simulator based on historical limit order book (LOB) data to provide a high-fidelity benchmark for different algorithms. Our experiments on the high-fidelity simulator demonstrate that our algorithms can effectively alleviate overfitting and achieve better performance.

Suggested Citation

  • Chuheng Zhang & Yitong Duan & Xiaoyu Chen & Jianyu Chen & Jian Li & Li Zhao, 2023. "Towards Generalizable Reinforcement Learning for Trade Execution," Papers 2307.11685, arXiv.org.
  • Handle: RePEc:arx:papers:2307.11685
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2307.11685
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Olivier Gu'eant & Charles-Albert Lehalle & Joaquin Fernandez Tapia, 2011. "Optimal Portfolio Liquidation with Limit Orders," Papers 1106.3279, arXiv.org, revised Jul 2012.
    2. Brian Bulthuis & Julio Concha & Tim Leung & Brian Ward, 2017. "Optimal execution of limit and market orders with trade director, speed limiter, and fill uncertainty," International Journal of Financial Engineering (IJFE), World Scientific Publishing Co. Pte. Ltd., vol. 4(02n03), pages 1-29, June.
    3. Hans Degryse & Frank Jong & Maarten Ravenswaaij & Gunther Wuyts, 2005. "Aggressive Orders and the Resiliency of a Limit Order Market," Review of Finance, Springer, vol. 9(2), pages 201-242, June.
    4. Peter Gomber & Uwe Schweickert & Erik Theissen, 2015. "Liquidity Dynamics in an Electronic Open Limit Order Book: an Event Study Approach," European Financial Management, European Financial Management Association, vol. 21(1), pages 52-78, January.
    5. James Richard Cummings & Alex Frino, 2010. "Further analysis of the speed of response to large trades in interest rate futures," Journal of Futures Markets, John Wiley & Sons, Ltd., vol. 30(8), pages 705-724, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Anil Sharma & Freeman Chen & Jaesun Noh & Julio DeJesus & Mario Schlener, 2024. "Hedging and Pricing Structured Products Featuring Multiple Underlying Assets," Papers 2411.01121, arXiv.org.
    2. Chuqiao Zong & Chaojie Wang & Molei Qin & Lei Feng & Xinrun Wang & Bo An, 2024. "MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading," Papers 2406.14537, arXiv.org.
    3. Molei Qin & Shuo Sun & Wentao Zhang & Haochong Xia & Xinrun Wang & Bo An, 2023. "EarnHFT: Efficient Hierarchical Reinforcement Learning for High Frequency Trading," Papers 2309.12891, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Matthias Schnaubelt & Jonas Rende & Christopher Krauss, 2019. "Testing Stylized Facts of Bitcoin Limit Order Books," JRFM, MDPI, vol. 12(1), pages 1-30, February.
    2. Schnaubelt, Matthias, 2020. "Deep reinforcement learning for the optimal placement of cryptocurrency limit orders," FAU Discussion Papers in Economics 05/2020, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    3. O’Sullivan, Conall & Papavassiliou, Vassilios G. & Wafula, Ronald Wekesa & Boubaker, Sabri, 2024. "New insights into liquidity resiliency," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 90(C).
    4. Hai-Chuan Xu & Wei Chen & Xiong Xiong & Wei Zhang & Wei-Xing Zhou & H Eugene Stanley, 2016. "Limit-order book resiliency after effective market orders: Spread, depth and intensity," Papers 1602.00731, arXiv.org, revised Feb 2017.
    5. Schnaubelt, Matthias, 2022. "Deep reinforcement learning for the optimal placement of cryptocurrency limit orders," European Journal of Operational Research, Elsevier, vol. 296(3), pages 993-1006.
    6. Benjamin Clapham & Martin Haferkorn & Kai Zimmermann, 2020. "Does Speed Matter? The Role Of High‐Frequency Trading For Order Book Resiliency," Journal of Financial Research, Southern Finance Association;Southwestern Finance Association, vol. 43(4), pages 933-964, December.
    7. Peter Gomber & Uwe Schweickert & Erik Theissen, 2015. "Liquidity Dynamics in an Electronic Open Limit Order Book: an Event Study Approach," European Financial Management, European Financial Management Association, vol. 21(1), pages 52-78, January.
    8. Daniel Havran & Kata Varadi, 2015. "Price Impact and the Recovery of the Limit Order Book: Why Should We Care About Informed Liquidity Providers?," CERS-IE WORKING PAPERS 1540, Institute of Economics, Centre for Economic and Regional Studies.
    9. Siikanen, Milla & Kanniainen, Juho & Valli, Jaakko, 2017. "Limit order books and liquidity around scheduled and non-scheduled announcements: Empirical evidence from NASDAQ Nordic," Finance Research Letters, Elsevier, vol. 21(C), pages 264-271.
    10. Martin Angerer & Marius Gramlich & Michael Hanke, 2025. "Order Book Liquidity on Crypto Exchanges," JRFM, MDPI, vol. 18(3), pages 1-29, February.
    11. Ilyas Ahnach & Said Tounsi, 2025. "La Technologie Blockchain Et La Resilience Du Marche Financier : Etude D'Impact Et De Relation, Cas De La Bourse De Casablanca," Post-Print hal-05135043, HAL.
    12. Campi, Luciano & Zabaljauregui, Diego, 2020. "Optimal market making under partial information with general intensities," LSE Research Online Documents on Economics 104612, London School of Economics and Political Science, LSE Library.
    13. Fengpei Li & Vitalii Ihnatiuk & Ryan Kinnear & Anderson Schneider & Yuriy Nevmyvaka, 2022. "Do price trajectory data increase the efficiency of market impact estimation?," Papers 2205.13423, arXiv.org, revised Mar 2023.
    14. Large, Jeremy, 2011. "Estimating quadratic variation when quoted prices change by a constant increment," Journal of Econometrics, Elsevier, vol. 160(1), pages 2-11, January.
    15. Christopher Lorenz & Alexander Schied, 2013. "Drift dependence of optimal trade execution strategies under transient price impact," Finance and Stochastics, Springer, vol. 17(4), pages 743-770, October.
    16. Olaf Korn & Paolo Krischak & Erik Theissen, 2019. "Illiquidity transmission from spot to futures markets," Journal of Futures Markets, John Wiley & Sons, Ltd., vol. 39(10), pages 1228-1249, October.
    17. Jonathan Fullwood & Daniele Massacci, 2018. "Liquidity resilience in the UK gilt futures market: evidence from the order book," Bank of England working papers 744, Bank of England.
    18. Aur'elien Alfonsi & Alexander Schied & Florian Klock, 2013. "Multivariate transient price impact and matrix-valued positive definite functions," Papers 1310.4471, arXiv.org, revised Sep 2015.
    19. Ryan Donnelly & Zi Li, 2022. "Dynamic Inventory Management with Mean-Field Competition," Papers 2210.17208, arXiv.org, revised Apr 2025.
    20. Sofiene El Aoud & Frédéric Abergel, 2015. "A stochastic control approach for options market making," Post-Print hal-01061852, HAL.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2307.11685. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.