IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2103.09987.html
   My bibliography  Save this paper

Statistical Arbitrage Risk Premium by Machine Learning

Author

Listed:
  • Raymond C. W. Leung
  • Yu-Man Tam

Abstract

How to hedge factor risks without knowing the identities of the factors? We first prove a general theoretical result: even if the exact set of factors cannot be identified, any risky asset can use some portfolio of similar peer assets to hedge against its own factor exposures. A long position of a risky asset and a short position of a "replicate portfolio" of its peers represent that asset's factor residual risk. We coin the expected return of an asset's factor residual risk as its Statistical Arbitrage Risk Premium (SARP). The challenge in empirically estimating SARP is finding the peers for each asset and constructing the replicate portfolios. We use the elastic-net, a machine learning method, to project each stock's past returns onto that of every other stock. The resulting high-dimensional but sparse projection vector serves as investment weights in constructing the stocks' replicate portfolios. We say a stock has high (low) Statistical Arbitrage Risk (SAR) if it has low (high) R-squared with its peers. The key finding is that "unique" stocks have both a higher SARP and higher excess returns than "ubiquitous" stocks: in the cross-section, high SAR stocks have a monthly SARP (monthly excess returns) that is 1.101% (0.710%) greater than low SAR stocks. The average SAR across all stocks is countercyclical. Our results are robust to controlling for various known priced factors and characteristics.

Suggested Citation

  • Raymond C. W. Leung & Yu-Man Tam, 2021. "Statistical Arbitrage Risk Premium by Machine Learning," Papers 2103.09987, arXiv.org.
  • Handle: RePEc:arx:papers:2103.09987
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2103.09987
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Jeffrey Wurgler & Ekaterina Zhuravskaya, 2002. "Does Arbitrage Flatten Demand Curves for Stocks?," The Journal of Business, University of Chicago Press, vol. 75(4), pages 583-608, October.
    2. Fama, Eugene F & French, Kenneth R, 1992. "The Cross-Section of Expected Stock Returns," Journal of Finance, American Finance Association, vol. 47(2), pages 427-465, June.
    3. Stephen A. Ross, 2013. "The Arbitrage Theory of Capital Asset Pricing," World Scientific Book Chapters, in: Leonard C MacLean & William T Ziemba (ed.), HANDBOOK OF THE FUNDAMENTALS OF FINANCIAL DECISION MAKING Part I, chapter 1, pages 11-30, World Scientific Publishing Co. Pte. Ltd..
    4. Hansen, Lars Peter & Hodrick, Robert J, 1980. "Forward Exchange Rates as Optimal Predictors of Future Spot Rates: An Econometric Analysis," Journal of Political Economy, University of Chicago Press, vol. 88(5), pages 829-853, October.
    5. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    6. Guanhao Feng & Stefano Giglio & Dacheng Xiu, 2020. "Taming the Factor Zoo: A Test of New Factors," Journal of Finance, American Finance Association, vol. 75(3), pages 1327-1370, June.
    7. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    8. Lamont, Owen A., 2001. "Economic tracking portfolios," Journal of Econometrics, Elsevier, vol. 105(1), pages 161-184, November.
    9. Amihud, Yakov, 2002. "Illiquidity and stock returns: cross-section and time-series effects," Journal of Financial Markets, Elsevier, vol. 5(1), pages 31-56, January.
    10. Bruce N. Lehmann, 1990. "Fads, Martingales, and Market Efficiency," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 105(1), pages 1-28.
    11. Evan Gatev & William N. Goetzmann & K. Geert Rouwenhorst, 2006. "Pairs Trading: Performance of a Relative-Value Arbitrage Rule," The Review of Financial Studies, Society for Financial Studies, vol. 19(3), pages 797-827.
    12. Joachim Freyberger & Andreas Neuhierl & Michael Weber, 2020. "Dissecting Characteristics Nonparametrically," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2326-2377.
    13. Hedegaard, Esben & Hodrick, Robert J., 2016. "Estimating the risk-return trade-off with overlapping data inference," Journal of Banking & Finance, Elsevier, vol. 67(C), pages 135-145.
    14. William F. Sharpe, 1964. "Capital Asset Prices: A Theory Of Market Equilibrium Under Conditions Of Risk," Journal of Finance, American Finance Association, vol. 19(3), pages 425-442, September.
    15. Huck, Nicolas, 2019. "Large data sets and machine learning: Applications to statistical arbitrage," European Journal of Operational Research, Elsevier, vol. 278(1), pages 330-342.
    16. Newey, Whitney & West, Kenneth, 2014. "A simple, positive semi-definite, heteroscedasticity and autocorrelation consistent covariance matrix," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 33(1), pages 125-132.
    17. Nicolas Huck, 2019. "Large data sets and machine learning: Applications to statistical arbitrage," Post-Print hal-02143971, HAL.
    18. Jegadeesh, Narasimhan, 1990. "Evidence of Predictable Behavior of Security Returns," Journal of Finance, American Finance Association, vol. 45(3), pages 881-898, July.
    19. Christopher Krauss, 2017. "Statistical Arbitrage Pairs Trading Strategies: Review And Outlook," Journal of Economic Surveys, Wiley Blackwell, vol. 31(2), pages 513-545, April.
    20. Fama, Eugene F & MacBeth, James D, 1973. "Risk, Return, and Equilibrium: Empirical Tests," Journal of Political Economy, University of Chicago Press, vol. 81(3), pages 607-636, May-June.
    21. Hui Zou & Trevor Hastie, 2005. "Addendum: Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(5), pages 768-768, November.
    22. Marco Avellaneda & Jeong-Hyun Lee, 2010. "Statistical arbitrage in the US equities market," Quantitative Finance, Taylor & Francis Journals, vol. 10(7), pages 761-782.
    23. Lianjie Shu & Fangquan Shi & Guoliang Tian, 2020. "High-dimensional index tracking based on the adaptive elastic net," Quantitative Finance, Taylor & Francis Journals, vol. 20(9), pages 1513-1530, September.
    24. Hui Zou & Trevor Hastie, 2005. "Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(2), pages 301-320, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Mentor Lecaj & Granit Curri, 2021. "Advantages Of International Commercial Arbitration In Resolving The Commercial Contests," Perspectives of Law and Public Administration, Societatea de Stiinte Juridice si Administrative (Society of Juridical and Administrative Sciences), vol. 10(2), pages 96-101, June.
    2. Zhang, Tiantian & Nakagawa, Kei & Matsumoto, Ken'ichi, 2023. "Evaluating solar photovoltaic power efficiency based on economic dimensions for 26 countries using a three-stage data envelopment analysis," Applied Energy, Elsevier, vol. 335(C).
    3. Bennett, Davara L. & Webb, Calum J.R. & Mason, Kate E. & Schlüter, Daniela K. & Fahy, Katie & Alexiou, Alexandros & Wickham, Sophie & Barr, Ben & Taylor-Robinson, David, 2021. "Funding for preventative Children’s Services and rates of children becoming looked after: A natural experiment using longitudinal area-level data in England," Children and Youth Services Review, Elsevier, vol. 131(C).
    4. Sacker, Amanda & Lacey, Rebecca E. & Maughan, Barbara & Murray, Emily T., 2022. "Out-of-home care in childhood and socio-economic functioning in adulthood: ONS Longitudinal study 1971–2011," Children and Youth Services Review, Elsevier, vol. 132(C).
    5. Caratozzolo, Vincenzo & Misuri, Alessio & Cozzani, Valerio, 2022. "A generalized equipment vulnerability model for the quantitative risk assessment of horizontal vessels involved in Natech scenarios triggered by floods," Reliability Engineering and System Safety, Elsevier, vol. 223(C).
    6. Guillaume Coqueret, 2022. "Characteristics-driven returns in equilibrium," Papers 2203.07865, arXiv.org.
    7. Sojib Ahmed, M. & Rezwan Khan, M. & Haque, Anisul & Ryyan Khan, M., 2022. "Agrivoltaics analysis in a techno-economic framework: Understanding why agrivoltaics on rice will always be profitable," Applied Energy, Elsevier, vol. 323(C).
    8. Claudia ANTAL-VAIDA, 2021. "Basic Hyperparameters Tuning Methods for Classification Algorithms," Informatica Economica, Academy of Economic Studies - Bucharest, Romania, vol. 25(2), pages 64-74.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Cakici, Nusret & Zaremba, Adam, 2021. "Liquidity and the cross-section of international stock returns," Journal of Banking & Finance, Elsevier, vol. 127(C).
    2. Doron Avramov & Guy Kaplanski & Avanidhar Subrahmanyam, 2022. "Postfundamentals Price Drift in Capital Markets: A Regression Regularization Perspective," Management Science, INFORMS, vol. 68(10), pages 7658-7681, October.
    3. Flori, Andrea & Regoli, Daniele, 2021. "Revealing Pairs-trading opportunities with long short-term memory networks," European Journal of Operational Research, Elsevier, vol. 295(2), pages 772-791.
    4. Amit Goyal, 2012. "Empirical cross-sectional asset pricing: a survey," Financial Markets and Portfolio Management, Springer;Swiss Society for Financial Market Research, vol. 26(1), pages 3-38, March.
    5. Pedro M. Mirete-Ferrer & Alberto Garcia-Garcia & Juan Samuel Baixauli-Soler & Maria A. Prats, 2022. "A Review on Machine Learning for Asset Management," Risks, MDPI, vol. 10(4), pages 1-46, April.
    6. Ma, Tian & Leong, Wen Jun & Jiang, Fuwei, 2023. "A latent factor model for the Chinese stock market," International Review of Financial Analysis, Elsevier, vol. 87(C).
    7. Hanauer, Matthias X. & Kalsbach, Tobias, 2023. "Machine learning and the cross-section of emerging market stock returns," Emerging Markets Review, Elsevier, vol. 55(C).
    8. Cakici, Nusret & Zaremba, Adam, 2022. "Salience theory and the cross-section of stock returns: International and further evidence," Journal of Financial Economics, Elsevier, vol. 146(2), pages 689-725.
    9. Bradrania, Reza & Veron, Jose Francisco, 2023. "The beta anomaly in the Australian stock market and the lottery demand," Pacific-Basin Finance Journal, Elsevier, vol. 77(C).
    10. Thomas Conlon & John Cotter & Iason Kynigakis, 2021. "Machine Learning and Factor-Based Portfolio Optimization," Papers 2107.13866, arXiv.org.
    11. Lu, Zhongjin & Malliaris, Steven & Qin, Zhongling, 2023. "Heterogeneous liquidity providers and night-minus-day return predictability," Journal of Financial Economics, Elsevier, vol. 148(3), pages 175-200.
    12. Committee, Nobel Prize, 2013. "Understanding Asset Prices," Nobel Prize in Economics documents 2013-1, Nobel Prize Committee.
    13. Söhnke M. Bartram & Harald Lohre & Peter F. Pope & Ananthalakshmi Ranganathan, 2021. "Navigating the factor zoo around the world: an institutional investor perspective," Journal of Business Economics, Springer, vol. 91(5), pages 655-703, July.
    14. Xiang, Yun & He, Jiaxuan, 2022. "Pairs trading and asset pricing," Pacific-Basin Finance Journal, Elsevier, vol. 72(C).
    15. Cakici, Nusret & Zaremba, Adam, 2023. "Recency bias and the cross-section of international stock returns," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 84(C).
    16. Kristoffer Pons Bertelsen, 2022. "The Prior Adaptive Group Lasso and the Factor Zoo," CREATES Research Papers 2022-05, Department of Economics and Business Economics, Aarhus University.
    17. Xu, Zhongxiang & Chevapatrakul, Thanaset & Li, Xiafei, 2019. "Return asymmetry and the cross section of stock returns," Journal of International Money and Finance, Elsevier, vol. 97(C), pages 93-110.
    18. Chen, Ding & Guo, Biao & Zhou, Guofu, 2023. "Firm fundamentals and the cross-section of implied volatility shapes," Journal of Financial Markets, Elsevier, vol. 63(C).
    19. Hollstein, Fabian & Prokopczuk, Marcel & Wese Simen, Chardin, 2020. "Beta uncertainty," Journal of Banking & Finance, Elsevier, vol. 116(C).
    20. Han, Chulwoo & He, Zhaodong & Toh, Alenson Jun Wei, 2023. "Pairs trading via unsupervised learning," European Journal of Operational Research, Elsevier, vol. 307(2), pages 929-947.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2103.09987. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.