When AI Trading Agents Compete: Adverse Selection of Meta-Orders by Reinforcement Learning-Based Market Making

When AI Trading Agents Compete: Adverse Selection of Meta-Orders by Reinforcement Learning-Based Market Making

Author

Listed:

Ali Raza Jafree
Konark Jain
Nick Firoozye

Abstract

We investigate the mechanisms by which medium-frequency trading agents are adversely selected by opportunistic high-frequency traders. We use reinforcement learning (RL) within a Hawkes Limit Order Book (LOB) model in order to replicate the behaviours of high-frequency market makers. In contrast to the classical models with exogenous price impact assumptions, the Hawkes model accounts for endogenous price impact and other key properties of the market (Jain et al. 2024a). Given the real-world impracticalities of the market maker updating strategies for every event in the LOB, we formulate the high-frequency market making agent via an impulse control reinforcement learning framework (Jain et al. 2025). The RL used in the simulation utilises Proximal Policy Optimisation (PPO) and self-imitation learning. To replicate the adverse selection phenomenon, we test the RL agent trading against a medium frequency trader (MFT) executing a meta-order and demonstrate that, with training against the MFT meta-order execution agent, the RL market making agent learns to capitalise on the price drift induced by the meta-order. Recent empirical studies have shown that medium-frequency traders are increasingly subject to adverse selection by high-frequency trading agents. As high-frequency trading continues to proliferate across financial markets, the slippage costs incurred by medium-frequency traders are likely to increase over time. However, we do not observe that increased profits for the market making RL agent necessarily cause significantly increased slippages for the MFT agent.

Suggested Citation

Ali Raza Jafree & Konark Jain & Nick Firoozye, 2025. "When AI Trading Agents Compete: Adverse Selection of Meta-Orders by Reinforcement Learning-Based Market Making," Papers 2510.27334, arXiv.org.

Handle: RePEc:arx:papers:2510.27334

Download full text from publisher

References listed on IDEAS

Jean-Edouard Colliard & Thierry Foucault & Stefano Lovo, 2022. "Algorithmic Pricing and Liquidity in Securities Markets," Working Papers hal-03890671, HAL.
- Colliard, Jean-Edouard & Foucault, Thierry & Lovo, Stefano, 2022. "Algorithmic Pricing and Liquidity in Securities Markets," HEC Research Papers Series 1459, HEC Paris.
- Colliard, Jean-Edouard & Foucault, Thierry & Lovo, Stefano, 2022. "Algorithmic Pricing and Liquidity in Securities Markets," CEPR Discussion Papers 17606, C.E.P.R. Discussion Papers.
Fengbin Zhu & Junfeng Li & Liangming Pan & Wenjie Wang & Fuli Feng & Chao Wang & Huanbo Luan & Tat-Seng Chua, 2025. "Towards Temporal-Aware Multi-Modal Retrieval Augmented Generation in Finance," Papers 2503.05185, arXiv.org, revised Aug 2025.
Manuel Naviglio & Giacomo Bormetti & Francesco Campigli & German Rodikov & Fabrizio Lillo, 2025. "Why is the estimation of metaorder impact with public market data so challenging?," Papers 2501.17096, arXiv.org, revised Dec 2025.
Chen Yao & Mao Ye, 2018. "Why Trading Speed Matters: A Tale of Queue Rationing under Price Controls," The Review of Financial Studies, Society for Financial Studies, vol. 31(6), pages 2157-2183.
Jonathan Brogaard & Terrence Hendershott & Ryan Riordan, 2019. "Price Discovery without Trading: Evidence from Limit Orders," Journal of Finance, American Finance Association, vol. 74(4), pages 1621-1658, August.
Jain, Konark & Firoozye, Nick & Kochems, Jonathan & Treleaven, Philip, 2024. "Limit Order Book dynamics and order size modelling using Compound Hawkes Process," Finance Research Letters, Elsevier, vol. 69(PA).
- Konark Jain & Nick Firoozye & Jonathan Kochems & Philip Treleaven, 2023. "Limit Order Book Dynamics and Order Size Modelling Using Compound Hawkes Process," Papers 2312.08927, arXiv.org, revised Aug 2024.
Guillaume Maitrier & Gr'egoire Loeper & Jean-Philippe Bouchaud, 2025. "Generating realistic metaorders from public data," Papers 2503.18199, arXiv.org, revised Apr 2025.
Konark Jain & Nick Firoozye & Jonathan Kochems & Philip Treleaven, 2024. "Limit Order Book Simulations: A Review," Papers 2402.17359, arXiv.org, revised Mar 2024.
Markus Baldauf & Joshua Mollner, 2020. "High‐Frequency Trading and Market Performance," Journal of Finance, American Finance Association, vol. 75(3), pages 1495-1526, June.
Remi Genet & Hugo Inzirillo, 2025. "LEMs: A Primer On Large Execution Models," Papers 2509.25211, arXiv.org.
Konark Jain & Nick Firoozye & Jonathan Kochems & Philip Treleaven, 2025. "An Impulse Control Approach to Market Making in a Hawkes LOB Market," Papers 2510.26438, arXiv.org, revised Oct 2025.
J. Donier & J. Bonart & I. Mastromatteo & J.-P. Bouchaud, 2015. "A fully consistent, minimal model for non-linear market impact," Quantitative Finance, Taylor & Francis Journals, vol. 15(7), pages 1109-1121, July.
Jonathan Donier & Julius Bonart & Iacopo Mastromatteo & Jean-Philippe Bouchaud, 2014. "A fully consistent, minimal model for non-linear market impact," Papers 1412.0141, arXiv.org, revised Mar 2015.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Nicholas Hirschey, 2021. "Do High-Frequency Traders Anticipate Buying and Selling Pressure?," Management Science, INFORMS, vol. 67(6), pages 3321-3345, June.
Ye, Mao & Zheng, Miles Y. & Zhu, Wei, 2023. "The effect of tick size on managerial learning from stock prices," Journal of Accounting and Economics, Elsevier, vol. 75(1).
Adele Ravagnani & Fabrizio Lillo, 2025. "Modeling metaorder impact with a Non-Markovian Zero Intelligence model," Papers 2503.05254, arXiv.org, revised Mar 2025.
Michael Goldstein & Amy Kwan & Richard Philip, 2023. "High-Frequency Trading Strategies," Management Science, INFORMS, vol. 69(8), pages 4413-4434, August.
Sudhanshu Pani, 2020. "A Theory of 'Auction as a Search' in speculative markets," Papers 2006.00775, arXiv.org.
Fengpei Li & Vitalii Ihnatiuk & Ryan Kinnear & Anderson Schneider & Yuriy Nevmyvaka, 2022. "Do price trajectory data increase the efficiency of market impact estimation?," Papers 2205.13423, arXiv.org, revised Mar 2023.
Kang, Jongho & Kang, Jangkoo & Kwon, Kyung Yoon, 2022. "Market versus limit orders of speculative high-frequency traders and price discovery," Research in International Business and Finance, Elsevier, vol. 63(C).
Fr'ed'eric Bucci & Michael Benzaquen & Fabrizio Lillo & Jean-Philippe Bouchaud, 2019. "Slow decay of impact in equity markets: insights from the ANcerno database," Papers 1901.05332, arXiv.org, revised Jan 2019.
Fr'ed'eric Bucci & Iacopo Mastromatteo & Michael Benzaquen & Jean-Philippe Bouchaud, 2019. "Impact is not just volatility," Papers 1905.04569, arXiv.org.
Comerton-Forde, Carole & Grégoire, Vincent & Zhong, Zhuo, 2019. "Inverted fee structures, tick size, and market quality," Journal of Financial Economics, Elsevier, vol. 134(1), pages 141-164.
Francesco Cordoni & Fabrizio Lillo, 2022. "Transient impact from the Nash equilibrium of a permanent market impact game," Papers 2205.00494, arXiv.org, revised Mar 2023.
Marjolein E. Verhulst & Philippe Debie & Stephan Hageboeck & Joost M. E. Pennings & Cornelis Gardebroek & Axel Naumann & Paul van Leeuwen & Andres A. Trujillo‐Barrera & Lorenzo Moneta, 2021. "When two worlds collide: Using particle physics tools to visualize the limit order book," Journal of Futures Markets, John Wiley & Sons, Ltd., vol. 41(11), pages 1715-1734, November.
- Marjolein E. Verhulst & Philippe Debie & Stephan Hageboeck & Joost M. E. Pennings & Cornelis Gardebroek & Axel Naumann & Paul van Leeuwen & Andres A. Trujillo-Barrera & Lorenzo Moneta, 2021. "When Two Worlds Collide: Using Particle Physics Tools to Visualize the Limit Order Book," Papers 2109.04812, arXiv.org.
Przemys{l}aw Rola, 2025. "Boltzmann Price: Toward Understanding the Fair Price in High-Frequency Markets," Papers 2507.09734, arXiv.org.
Louis Saddier & Matteo Marsili, 2023. "A Bayesian theory of market impact," Papers 2303.08867, arXiv.org, revised May 2024.
Frédéric Bucci & Michael Benzaquen & Fabrizio Lillo & Jean-Philippe Bouchaud, 2019. "Slow Decay of Impact in Equity Markets: Insights from the ANcerno Database," Post-Print hal-02323357, HAL.
Mohammed Salek & Damien Challet & Ioane Muni Toke, 2024. "Equity auction dynamics: latent liquidity models with activity acceleration," Quantitative Finance, Taylor & Francis Journals, vol. 24(10), pages 1381-1398, October.
- Mohammed Salek & Damien Challet & Ioane Muni Toke, 2024. "Equity auction dynamics: latent liquidity models with activity acceleration," Papers 2401.06724, arXiv.org, revised Jul 2024.
- Mohammed Salek & Damien Challet & Ioane Muni Toke, 2024. "Equity auction dynamics: latent liquidity models with activity acceleration," Post-Print hal-04391810, HAL.
Mohammed Salek & Damien Challet & Ioane Muni Toke, 2023. "Price impact in equity auctions: zero, then linear," Papers 2301.05677, arXiv.org, revised Sep 2023.
- Mohammed Salek & Damien Challet & Ioane Muni Toke, 2024. "Price impact in equity auctions: zero, then linear," Post-Print hal-03938660, HAL.
Salma Elomari-Kessab & Guillaume Maitrier & Julius Bonart & Jean-Philippe Bouchaud, 2024. ""Microstructure Modes" -- Disentangling the Joint Dynamics of Prices & Order Flow," Papers 2405.10654, arXiv.org.
Yuki Sato & Kiyoshi Kanazawa, 2023. "Exact solution to a generalised Lillo-Mike-Farmer model with heterogeneous order-splitting strategies," Papers 2306.13378, arXiv.org, revised Nov 2023.
Francesco Cordoni & Fabrizio Lillo, 2024. "Transient Impact from the Nash Equilibrium of a Permanent Market Impact Game," Dynamic Games and Applications, Springer, vol. 14(2), pages 333-361, May.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-AIN-2025-11-10 (Artificial Intelligence)
NEP-CMP-2025-11-10 (Computational Economics)
NEP-MST-2025-11-10 (Market Microstructure)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2510.27334. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

When AI Trading Agents Compete: Adverse Selection of Meta-Orders by Reinforcement Learning-Based Market Making

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data