IDEAS home Printed from https://ideas.repec.org/a/kap/compec/v60y2022i3d10.1007_s10614-021-10169-8.html
   My bibliography  Save this article

Exploring Statistical Arbitrage Opportunities Using Machine Learning Strategy

Author

Listed:
  • Baoqiang Zhan

    (Harbin Institute of Technology)

  • Shu Zhang

    (Guangdong University of Technology)

  • Helen S. Du

    (Guangdong University of Technology)

  • Xiaoguang Yang

    (Chinese Academy of Sciences)

Abstract

Arbitrage opportunity exploration is important to ensure the profitability of statistical arbitrage. Prior studies that concentrate on cointegration model and other predictive models suffer from various problems in both prediction and transaction. To prevent these problems, we propose a novel strategy based on machine learning to explore arbitrage opportunities and further predict whether they will make a profit or not. The experiment is conducted in the context of Chinese financial markets with high-frequency data of CSI 300 exchange traded fund (ETF) and CSI 300 index futures (IF) from 2012 to 2020. We find that machine learning strategy can explore more arbitrage opportunities with lower risks, which outperforms cointegration strategy in different aspects. Besides, we compare different algorithms and find that LSTM achieve better performance in predicting the positive arbitrage samples and obtaining higher ROI and Sharpe ratio. The profitability of machine learning strategy validate the mean reversion and price discovery function of asset price between spot market and futures market, which further substantiate the market efficiency. Our empirical results provide practical significance to the development of quantitative finance.

Suggested Citation

  • Baoqiang Zhan & Shu Zhang & Helen S. Du & Xiaoguang Yang, 2022. "Exploring Statistical Arbitrage Opportunities Using Machine Learning Strategy," Computational Economics, Springer;Society for Computational Economics, vol. 60(3), pages 861-882, October.
  • Handle: RePEc:kap:compec:v:60:y:2022:i:3:d:10.1007_s10614-021-10169-8
    DOI: 10.1007/s10614-021-10169-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10614-021-10169-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10614-021-10169-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Krauss, Christopher & Do, Xuan Anh & Huck, Nicolas, 2017. "Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500," European Journal of Operational Research, Elsevier, vol. 259(2), pages 689-702.
    2. Chen, Yu-Lun & Gau, Yin-Feng, 2010. "News announcements and price discovery in foreign exchange spot and futures markets," Journal of Banking & Finance, Elsevier, vol. 34(7), pages 1628-1636, July.
    3. Chaudhuri, Kausik & Wu, Yangru, 2003. "Random walk versus breaking trend in stock prices: Evidence from emerging markets," Journal of Banking & Finance, Elsevier, vol. 27(4), pages 575-592, April.
    4. Nicolae Gârleanu & Lasse Heje Pedersen, 2013. "Dynamic Trading with Predictable Returns and Transaction Costs," Journal of Finance, American Finance Association, vol. 68(6), pages 2309-2340, December.
    5. Fama, Eugene F & French, Kenneth R, 1988. "Permanent and Temporary Components of Stock Prices," Journal of Political Economy, University of Chicago Press, vol. 96(2), pages 246-273, April.
    6. Nicolas Huck, 2015. "Pairs trading: does volatility timing matter?," Applied Economics, Taylor & Francis Journals, vol. 47(57), pages 6239-6256, December.
    7. Broussard, John Paul & Vaihekoski, Mika, 2012. "Profitability of pairs trading strategy in an illiquid market with multiple share classes," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 22(5), pages 1188-1201.
    8. Nicolas Huck, 2015. "Pairs trading: does volatility timing matter?," Post-Print hal-01370246, HAL.
    9. Stephen A. Ross, 2013. "The Arbitrage Theory of Capital Asset Pricing," World Scientific Book Chapters, in: Leonard C MacLean & William T Ziemba (ed.), HANDBOOK OF THE FUNDAMENTALS OF FINANCIAL DECISION MAKING Part I, chapter 1, pages 11-30, World Scientific Publishing Co. Pte. Ltd..
    10. Kao, Chihwa, 1999. "Spurious regression and residual-based tests for cointegration in panel data," Journal of Econometrics, Elsevier, vol. 90(1), pages 1-44, May.
    11. Basak, Suryoday & Kar, Saibal & Saha, Snehanshu & Khaidem, Luckyson & Dey, Sudeepa Roy, 2019. "Predicting the direction of stock market prices using tree-based classifiers," The North American Journal of Economics and Finance, Elsevier, vol. 47(C), pages 552-567.
    12. Christopher Krauss & Anh Do & Nicolas Huck, 2017. "Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500," Post-Print hal-01768895, HAL.
    13. Jordà, Òscar & Taylor, Alan M., 2012. "The carry trade and fundamentals: Nothing to fear but FEER itself," Journal of International Economics, Elsevier, vol. 88(1), pages 74-90.
    14. Alain P. Chaboud & Benjamin Chiquoine & Erik Hjalmarsson & Clara Vega, 2014. "Rise of the Machines: Algorithmic Trading in the Foreign Exchange Market," Journal of Finance, American Finance Association, vol. 69(5), pages 2045-2084, October.
    15. Nicolas Huck, 2019. "Large data sets and machine learning: Applications to statistical arbitrage," Post-Print hal-02143971, HAL.
    16. Fama, Eugene F, 1970. "Efficient Capital Markets: A Review of Theory and Empirical Work," Journal of Finance, American Finance Association, vol. 25(2), pages 383-417, May.
    17. Neely, Christopher J. & Weller, Paul A., 2013. "Lessons from the evolution of foreign exchange trading strategies," Journal of Banking & Finance, Elsevier, vol. 37(10), pages 3783-3798.
    18. David G. McMillan & Alan E. H. Speight, 2006. "Nonlinear dynamics and competing behavioral interpretations: Evidence from intra‐day FTSE‐100 index and futures data," Journal of Futures Markets, John Wiley & Sons, Ltd., vol. 26(4), pages 343-368, April.
    19. Dong-Hyun Ahn & Jacob Boudoukh & Matthew Richardson & Robert F. Whitelaw, 2002. "Partial Adjustment or Stale Prices? Implications from Stock Index and Futures Return Autocorrelations," The Review of Financial Studies, Society for Financial Studies, vol. 15(2), pages 655-689, March.
    20. Basak, Suleyman & Croitoru, Benjamin, 2006. "On the role of arbitrageurs in rational markets," Journal of Financial Economics, Elsevier, vol. 81(1), pages 143-173, July.
    21. Roman Kozhan & Wing Wah Tham, 2012. "Execution Risk in High-Frequency Arbitrage," Management Science, INFORMS, vol. 58(11), pages 2131-2149, November.
    22. Abreu, Dilip & Brunnermeier, Markus K., 2002. "Synchronization risk and delayed arbitrage," Journal of Financial Economics, Elsevier, vol. 66(2-3), pages 341-360.
    23. Carlos Eduardo de Moura & Adrian Pizzinga & Jorge Zubelli, 2016. "A pairs trading strategy based on linear state space models and the Kalman filter," Quantitative Finance, Taylor & Francis Journals, vol. 16(10), pages 1559-1573, October.
    24. Schultz, Paul & Shive, Sophie, 2010. "Mispricing of dual-class shares: Profit opportunities, arbitrage, and trading," Journal of Financial Economics, Elsevier, vol. 98(3), pages 524-549, December.
    25. Mukarram Attari & Antonio S. Mello & Martin E. Ruckes, 2005. "Arbitraging Arbitrageurs," Journal of Finance, American Finance Association, vol. 60(5), pages 2471-2511, October.
    26. Sugato Chakravarty & Huseyin Gulen & Stewart Mayhew, 2004. "Informed Trading in Stock and Option Markets," Journal of Finance, American Finance Association, vol. 59(3), pages 1235-1258, June.
    27. Baker, Malcolm & Savasoglu, Serkan, 2002. "Limited arbitrage in mergers and acquisitions," Journal of Financial Economics, Elsevier, vol. 64(1), pages 91-115, April.
    28. Ronald Balvers & Yangru Wu & Erik Gilliland, 2000. "Mean Reversion across National Stock Markets and Parametric Contrarian Investment Strategies," Journal of Finance, American Finance Association, vol. 55(2), pages 745-772, April.
    29. Jonathan Brogaard & Terrence Hendershott & Ryan Riordan, 2014. "High-Frequency Trading and Price Discovery," The Review of Financial Studies, Society for Financial Studies, vol. 27(8), pages 2267-2306.
    30. Hogan, Steve & Jarrow, Robert & Teo, Melvyn & Warachka, Mitch, 2004. "Testing market efficiency using statistical arbitrage with applications to momentum and value strategies," Journal of Financial Economics, Elsevier, vol. 73(3), pages 525-565, September.
    31. Huck, Nicolas, 2019. "Large data sets and machine learning: Applications to statistical arbitrage," European Journal of Operational Research, Elsevier, vol. 278(1), pages 330-342.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Matthew Clegg & Christopher Krauss, 2018. "Pairs trading with partial cointegration," Quantitative Finance, Taylor & Francis Journals, vol. 18(1), pages 121-138, January.
    2. Emmanouil Mavrakis & Christos Alexakis, 2018. "Statistical Arbitrage Strategies under Different Market Conditions: The Case of the Greek Banking Sector," Journal of Emerging Market Finance, Institute for Financial Management and Research, vol. 17(2), pages 159-185, August.
    3. Alexander Jakob Dautel & Wolfgang Karl Härdle & Stefan Lessmann & Hsin-Vonn Seow, 2020. "Forex exchange rate forecasting using deep recurrent neural networks," Digital Finance, Springer, vol. 2(1), pages 69-96, September.
    4. Fabian Waldow & Matthias Schnaubelt & Christopher Krauss & Thomas Günter Fischer, 2021. "Machine Learning in Futures Markets," JRFM, MDPI, vol. 14(3), pages 1-14, March.
    5. Clegg, Matthew & Krauss, Christopher, 2016. "Pairs trading with partial cointegration," FAU Discussion Papers in Economics 05/2016, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    6. Erdinc Akyildirim & Ahmet Goncu & Alper Hekimoglu & Duc Khuong Nguyen & Ahmet Sensoy, 2023. "Statistical arbitrage: factor investing approach," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 45(4), pages 1295-1331, December.
    7. Schnaubelt, Matthias & Fischer, Thomas G. & Krauss, Christopher, 2020. "Separating the signal from the noise – Financial machine learning for Twitter," Journal of Economic Dynamics and Control, Elsevier, vol. 114(C).
    8. Flori, Andrea & Regoli, Daniele, 2021. "Revealing Pairs-trading opportunities with long short-term memory networks," European Journal of Operational Research, Elsevier, vol. 295(2), pages 772-791.
    9. Kasper Johansson & Thomas Schmelzer & Stephen Boyd, 2024. "Finding Moving-Band Statistical Arbitrages via Convex-Concave Optimization," Papers 2402.08108, arXiv.org.
    10. Pedro M. Mirete-Ferrer & Alberto Garcia-Garcia & Juan Samuel Baixauli-Soler & Maria A. Prats, 2022. "A Review on Machine Learning for Asset Management," Risks, MDPI, vol. 10(4), pages 1-46, April.
    11. Guillaume Coqueret & Tony Guida, 2020. "Training trees on tails with applications to portfolio choice," Post-Print hal-04144665, HAL.
    12. Zhou, Hao & Kalev, Petko S., 2019. "Algorithmic and high frequency trading in Asia-Pacific, now and the future," Pacific-Basin Finance Journal, Elsevier, vol. 53(C), pages 186-207.
    13. Kriebel, Johannes & Stitz, Lennart, 2022. "Credit default prediction from user-generated text in peer-to-peer lending using deep learning," European Journal of Operational Research, Elsevier, vol. 302(1), pages 309-323.
    14. Krauss, Christopher, 2015. "Statistical arbitrage pairs trading strategies: Review and outlook," FAU Discussion Papers in Economics 09/2015, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    15. Thierry Foucault & Roman Kozhan & Wing Wah Tham, 2017. "Toxic Arbitrage," The Review of Financial Studies, Society for Financial Studies, vol. 30(4), pages 1053-1094.
    16. Frino, Alex & Mollica, Vito & Webb, Robert I. & Zhang, Shunquan, 2017. "The impact of latency sensitive trading on high frequency arbitrage opportunities," Pacific-Basin Finance Journal, Elsevier, vol. 45(C), pages 91-102.
    17. Rama Cont & Mihai Cucuringu & Chao Zhang, 2021. "Cross-Impact of Order Flow Imbalance in Equity Markets," Papers 2112.13213, arXiv.org, revised Jun 2023.
    18. Guillaume Coqueret & Tony Guida, 2020. "Training trees on tails with applications to portfolio choice," Annals of Operations Research, Springer, vol. 288(1), pages 181-221, May.
    19. Committee, Nobel Prize, 2013. "Understanding Asset Prices," Nobel Prize in Economics documents 2013-1, Nobel Prize Committee.
    20. Kolesnikova, A. & Yang, Y. & Lessmann, S. & Ma, T. & Sung, M.-C. & Johnson, J.E.V., 2019. "Can Deep Learning Predict Risky Retail Investors? A Case Study in Financial Risk Behavior Forecasting," IRTG 1792 Discussion Papers 2019-023, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:compec:v:60:y:2022:i:3:d:10.1007_s10614-021-10169-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.