IDEAS home Printed from https://ideas.repec.org/a/eee/pacfin/v78y2023ics0927538x23000148.html
   My bibliography  Save this article

Predicting stock splits using ensemble machine learning and SMOTE oversampling

Author

Listed:
  • Li, Ang
  • Liu, Mark
  • Sheather, Simon

Abstract

This study predicts stock splits using two ensemble machine learning techniques: gradient boosting machines (GBMs) and random forests (RFs). The goal is to form implementable portfolios based on positive predictions to generate abnormal returns. Since splits are rare events, we use SMOTE oversampling to synthesize new observations of splits in the sample to improve predictions. When predicting stock splits in the next quarter, GBM and RF achieve area under the receiver operating characteristic curve (AUC) scores of around 0.86 and 0.87, respectively. GBM and RF predictions generate monthly five-factor alphas (Fama and French, 2015) of 0.26% and 0.95% among stocks in the smallest size quintile. Three important features for predicting stock splits in both ensemble ML methods are current price levels, the ratio of current price to the price at last split, and stock returns in the past twelve months. When predicting stock splits in the next year, GBMs generate monthly five-factor alphas of 0.38% among small stocks.

Suggested Citation

  • Li, Ang & Liu, Mark & Sheather, Simon, 2023. "Predicting stock splits using ensemble machine learning and SMOTE oversampling," Pacific-Basin Finance Journal, Elsevier, vol. 78(C).
  • Handle: RePEc:eee:pacfin:v:78:y:2023:i:c:s0927538x23000148
    DOI: 10.1016/j.pacfin.2023.101948
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0927538X23000148
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.pacfin.2023.101948?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Nicolas P. B. Bollen & Veronika K. Pool, 2012. "Suspicious Patterns in Hedge Fund Returns and the Risk of Fraud," Review of Financial Studies, Society for Financial Studies, vol. 25(9), pages 2673-2702.
    2. Isil Erel & Léa H Stern & Chenhao Tan & Michael S Weisbach, 2021. "Selecting Directors Using Machine Learning," NBER Chapters, in: Big Data: Long-Term Implications for Financial Markets and Firms, pages 3226-3264, National Bureau of Economic Research, Inc.
    3. Stefano Giglio & Yuan Liao & Dacheng Xiu, 2021. "Thousands of Alpha Tests," NBER Chapters, in: Big Data: Long-Term Implications for Financial Markets and Firms, pages 3456, National Bureau of Economic Research, Inc.
    4. David Easley & Marcos López de Prado & Maureen O’Hara & Zhibai Zhang & Wei Jiang, 2021. "Microstructure in the Machine Age [The risk of machine learning]," Review of Financial Studies, Society for Financial Studies, vol. 34(7), pages 3316-3363.
    5. Ikenberry, David L. & Rankine, Graeme & Stice, Earl K., 1996. "What Do Stock Splits Really Signal?," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 31(3), pages 357-375, September.
    6. Matthew T. Billett & Hui Xue, 2007. "The Takeover Deterrent Effect of Open Market Share Repurchases," Journal of Finance, American Finance Association, vol. 62(4), pages 1827-1850, August.
    7. Paul Schultz, 2000. "Stock Splits, Tick Size, and Sponsorship," Journal of Finance, American Finance Association, vol. 55(1), pages 429-450, February.
    8. Itay Goldstein & Chester S Spatt & Mao Ye, 2021. "Big Data in Finance," NBER Chapters, in: Big Data: Long-Term Implications for Financial Markets and Firms, pages 3213-3225, National Bureau of Economic Research, Inc.
    9. Grinblatt, Mark S. & Masulis, Ronald W. & Titman, Sheridan, 1984. "The valuation effects of stock splits and stock dividends," Journal of Financial Economics, Elsevier, vol. 13(4), pages 461-490, December.
    10. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    11. Amihud, Yakov, 2002. "Illiquidity and stock returns: cross-section and time-series effects," Journal of Financial Markets, Elsevier, vol. 5(1), pages 31-56, January.
    12. Fama, Eugene F. & French, Kenneth R., 2015. "A five-factor asset pricing model," Journal of Financial Economics, Elsevier, vol. 116(1), pages 1-22.
    13. Brennan, Michael J. & Copeland, Thomas E., 1988. "Stock splits, stock prices, and transaction costs," Journal of Financial Economics, Elsevier, vol. 22(1), pages 83-101, October.
    14. Kai Li & Feng Mai & Rui Shen & Xinyan Yan, 2021. "Measuring Corporate Culture Using Machine Learning," NBER Chapters, in: Big Data: Long-Term Implications for Financial Markets and Firms, pages 3265-3315, National Bureau of Economic Research, Inc.
    15. Amber Anand & Mehrdad Samadi & Jonathan Sokobin & Kumar Venkataraman, 2021. "Institutional Order Handling and Broker-Affiliated Trading Venues [Performance of institutional trading desks: An analysis of persistence in trading costs]," Review of Financial Studies, Society for Financial Studies, vol. 34(7), pages 3364-3402.
    16. Malcolm Baker & Robin Greenwood & Jeffrey Wurgler, 2009. "Catering through Nominal Share Prices," Journal of Finance, American Finance Association, vol. 64(6), pages 2559-2590, December.
    17. Chitru S. Fernando & Srinivasan Krishnamurthy & Paul A. Spindt, 1999. "Is Share Price Related to Marketability? Evidence from Mutual Fund Share Splits," Financial Management, Financial Management Association, vol. 28(3), Fall.
    18. Lakonishok, Josef & Lev, Baruch, 1987. "Stock Splits and Stock Dividends: Why, Who, and When," Journal of Finance, American Finance Association, vol. 42(4), pages 913-932, September.
    19. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    20. Fama, Eugene F. & French, Kenneth R., 1997. "Industry costs of equity," Journal of Financial Economics, Elsevier, vol. 43(2), pages 153-193, February.
    21. Chemmanur, Thomas J. & Hu, Gang & Huang, Jiekun, 2015. "Institutional Investors and the Information Production Theory of Stock Splits," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 50(3), pages 413-445, June.
    22. Dimmock, Stephen G. & Gerken, William C., 2012. "Predicting fraud by investment managers," Journal of Financial Economics, Elsevier, vol. 105(1), pages 153-173.
    23. K. J. Martijn Cremers & Vinay B. Nair & Kose John, 2009. "Takeovers and the Cross-Section of Returns," The Review of Financial Studies, Society for Financial Studies, vol. 22(4), pages 1409-1445, April.
    24. Fama, Eugene F, et al, 1969. "The Adjustment of Stock Prices to New Information," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 10(1), pages 1-21, February.
    25. David Easley & Marcos López de Prado & Maureen O’Hara & Zhibai Zhang, 2021. "Microstructure in the Machine Age," NBER Chapters, in: Big Data: Long-Term Implications for Financial Markets and Firms, pages 3316-3363, National Bureau of Economic Research, Inc.
    26. Amber Anand & Mehrdad Samadi & Jonathan Sokobin & Kumar Venkataraman, 2021. "Institutional Order Handling and Broker-Affiliated Trading Venues," NBER Chapters, in: Big Data: Long-Term Implications for Financial Markets and Firms, pages 3364-3402, National Bureau of Economic Research, Inc.
    27. Fama, Eugene F. & French, Kenneth R., 1993. "Common risk factors in the returns on stocks and bonds," Journal of Financial Economics, Elsevier, vol. 33(1), pages 3-56, February.
    28. Zheng Tracy Ke & Bryan T. Kelly & Dacheng Xiu, 2019. "Predicting Returns With Text Data," NBER Working Papers 26186, National Bureau of Economic Research, Inc.
    29. Michael T. Maloney & J. Harold Mulherin, 1992. "The Effects of Splitting on the Ex: A Microstructure Reconciliation," Financial Management, Financial Management Association, vol. 21(4), Winter.
    30. Kai Li & Feng Mai & Rui Shen & Xinyan Yan, 2021. "Measuring Corporate Culture Using Machine Learning [Machine learning methods that economists should know about]," Review of Financial Studies, Society for Financial Studies, vol. 34(7), pages 3265-3315.
    31. Lamoureux, Christopher G & Poon, Percy, 1987. "The Market Reaction to Stock Splits," Journal of Finance, American Finance Association, vol. 42(5), pages 1347-1370, December.
    32. Itay Goldstein & Chester S. Spatt & Mao Ye, 2021. "Big Data in Finance," NBER Working Papers 28615, National Bureau of Economic Research, Inc.
    33. McNichols, Maureen & Dravid, Ajay, 1990. "Stock Dividends, Stock Splits, and Signaling," Journal of Finance, American Finance Association, vol. 45(3), pages 857-879, July.
    34. Itay Goldstein & Chester S Spatt & Mao Ye, 2021. "Big Data in Finance [Institutional order handling and broker-affiliated trading venues]," Review of Financial Studies, Society for Financial Studies, vol. 34(7), pages 3213-3225.
    35. Emmanuel O. Ogundimu, 2019. "Prediction of default probability by using statistical models for rare events," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 182(4), pages 1143-1162, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lin, Ji-Chai & Singh, Ajai K. & Yu, Wen, 2009. "Stock splits, trading continuity, and the cost of equity capital," Journal of Financial Economics, Elsevier, vol. 93(3), pages 474-489, September.
    2. Li, Fengfei & Lin, Ji-Chai & Lin, Tse-Chun & Shang, Longfei, 2023. "Behavioral bias, distorted stock prices, and stock splits," Journal of Banking & Finance, Elsevier, vol. 154(C).
    3. Li, Fengyu & Liu, Mark H. & Shi, Yongdong (Eric), 2017. "Institutional ownership around stock splits," Pacific-Basin Finance Journal, Elsevier, vol. 46(PA), pages 14-40.
    4. Birru, Justin & Wang, Baolian, 2016. "Nominal price illusion," Journal of Financial Economics, Elsevier, vol. 119(3), pages 578-598.
    5. Maria Chiara Iannino & Sergey Zhuk, 2020. "Signaling through Timing of Stock Splits," Discussion Paper Series, School of Economics and Finance 202009, School of Economics and Finance, University of St Andrews, revised 18 Jun 2021.
    6. Guo, Fang & Zhou, Kaiguo & Cai, Jinghan, 2008. "Stock splits, liquidity, and information asymmetry--An empirical study on Tokyo Stock Exchange," Journal of the Japanese and International Economies, Elsevier, vol. 22(3), pages 417-438, September.
    7. Kristina Minnick & Kartik Raman, 2014. "Why are Stock Splits Declining?," Financial Management, Financial Management Association International, vol. 43(1), pages 29-60, March.
    8. Ranjan D'Mello & Oranee Tawatnuntachai & Devrim Yaman, 2003. "Why Do Firms Issue Equity after Splitting Stocks?," The Financial Review, Eastern Finance Association, vol. 38(3), pages 323-350, August.
    9. Adam Zaremba & Jacob Koby Shemer, 2018. "Price-Based Investment Strategies," Springer Books, Springer, number 978-3-319-91530-2, November.
    10. Maretno A. Harjoto & Dongshin Kim & Indrarini Laksmana & Richard C. Walton, 2019. "Corporate social responsibility and stock split," Review of Quantitative Finance and Accounting, Springer, vol. 53(2), pages 575-600, August.
    11. Nihat Gumus & Ayse Caglayan Gumus, 2021. "Do stock splits matter for returns, volatility, and liquidity? New Evidence from Borsa Istanbul," International Journal of Research in Business and Social Science (2147-4478), Center for the Strategic Studies in Business and Finance, vol. 10(4), pages 467-478, June.
    12. Gow-Cheng Huang & Kartono Liano & Ming-Shiun Pan, 2011. "REIT Stock Splits and Liquidity Changes," The Journal of Real Estate Finance and Economics, Springer, vol. 43(4), pages 527-547, November.
    13. Leledakis, George N. & Papaioannou, George J. & Travlos, Nickolaos G. & Tsangarakis, Nickolaos V., 2009. "Stock splits in a neutral transaction cost environment: Evidence from the Athens Stock Exchange," Journal of Multinational Financial Management, Elsevier, vol. 19(1), pages 12-25, February.
    14. Sunil Mohanty & Doocheol Moon, 2007. "Disentangling the signalling and liquidity effects of stock splits," Applied Financial Economics, Taylor & Francis Journals, vol. 17(12), pages 979-987.
    15. Erik Devos & William B. Elliott & Richard S. Warr, 2018. "The Propensity to Split and CEO Compensation," Financial Management, Financial Management Association International, vol. 47(1), pages 105-129, March.
    16. Gow-Cheng Huang & Kartono Liano & Ming-Shiun Pan, 2006. "Do stock splits signal future profitability?," Review of Quantitative Finance and Accounting, Springer, vol. 26(4), pages 347-367, June.
    17. Perez, M. Fabricio & Shkilko, Andriy & Sokolov, Konstantin, 2015. "Factor models for binary financial data," Journal of Banking & Finance, Elsevier, vol. 61(S2), pages 177-188.
    18. Hoang, Daniel & Wiegratz, Kevin, 2022. "Machine learning methods in finance: Recent applications and prospects," Working Paper Series in Economics 158, Karlsruhe Institute of Technology (KIT), Department of Economics and Management.
    19. Al-Yahyaee, Khamis Hamed, 2014. "Shareholder wealth effects of stock dividends in a unique environment," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 28(C), pages 66-81.
    20. Gow-Cheng Huang & Kartono Liano & Ming-Shiun Pan, 2015. "The effects of stock splits on stock liquidity," Journal of Economics and Finance, Springer;Academy of Economics and Finance, vol. 39(1), pages 119-135, January.

    More about this item

    Keywords

    Stock splits; Ensemble machine learning; Gradient boosting machines; Random forests; SHAP feature importance; Hyperparameter tuning; SMOTE oversampling;
    All these keywords.

    JEL classification:

    • G11 - Financial Economics - - General Financial Markets - - - Portfolio Choice; Investment Decisions
    • G14 - Financial Economics - - General Financial Markets - - - Information and Market Efficiency; Event Studies; Insider Trading
    • G17 - Financial Economics - - General Financial Markets - - - Financial Forecasting and Simulation

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:pacfin:v:78:y:2023:i:c:s0927538x23000148. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/pacfin .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.