IDEAS home Printed from https://ideas.repec.org/a/eee/pacfin/v94y2025ics0927538x25002720.html

Can machines learn Chinese mutual funds?

Author

Listed:
  • Wu, Haoran
  • Gao, Zhiwei
  • Nie, Boyang
  • Zhao, Binru

Abstract

This study applies machine learning techniques to predict mutual fund performance and construct high-performing portfolios in the Chinese mutual fund market. Using Random Forest, XGBoost, and LightGBM models, we frame fund selection as a classification problem, estimating each fund's probability of outperforming the market benchmark. Filtered fund characteristics serve as predictors, allowing us to build investable portfolios ranked by predicted outperformance likelihood. Empirical results reveal that machine learning-based portfolios consistently outperform the benchmark, with LightGBM delivering the strongest performance across key metrics. Furthermore, a flexible machine learning combination framework is introduced to improve predictive robustness and portfolio stability. These findings highlight the practical value of machine learning for fund selection and strategy design in complex financial environments.

Suggested Citation

  • Wu, Haoran & Gao, Zhiwei & Nie, Boyang & Zhao, Binru, 2025. "Can machines learn Chinese mutual funds?," Pacific-Basin Finance Journal, Elsevier, vol. 94(C).
  • Handle: RePEc:eee:pacfin:v:94:y:2025:i:c:s0927538x25002720
    DOI: 10.1016/j.pacfin.2025.102935
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0927538X25002720
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.pacfin.2025.102935?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Oleg Chuprinin & Denis Sosyura, 2018. "Family Descent as a Signal of Managerial Quality: Evidence from Mutual Funds," The Review of Financial Studies, Society for Financial Studies, vol. 31(10), pages 3756-3820.
    2. Alex Chinco & Adam D. Clark‐Joseph & Mao Ye, 2019. "Sparse Signals in the Cross‐Section of Returns," Journal of Finance, American Finance Association, vol. 74(1), pages 449-492, February.
    3. Grønborg, Niels S. & Lunde, Asger & Timmermann, Allan & Wermers, Russ, 2021. "Picking funds with confidence," Journal of Financial Economics, Elsevier, vol. 139(1), pages 1-28.
    4. Christopher S Jones & Haitao Mo, 2021. "Out-of-Sample Performance of Mutual Fund Predictors [Has U.S. corporate bond market liquidity deteriorated?]," The Review of Financial Studies, Society for Financial Studies, vol. 34(1), pages 149-193.
    5. Kang, Yanfei & Cao, Wei & Petropoulos, Fotios & Li, Feng, 2022. "Forecast with forecasts: Diversity matters," European Journal of Operational Research, Elsevier, vol. 301(1), pages 180-190.
    6. Laurent Barras & Patrick Gagliardini & Olivier Scaillet, 2022. "Skill, Scale, and Value Creation in the Mutual Fund Industry," Journal of Finance, American Finance Association, vol. 77(1), pages 601-638, February.
    7. Daniele Bianchi & Matthias Büchner & Andrea Tamoni, 2021. "Bond Risk Premiums with Machine Learning [Quadratic term structure models: Theory and evidence]," The Review of Financial Studies, Society for Financial Studies, vol. 34(2), pages 1046-1089.
    8. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    9. Leippold, Markus & Wang, Qian & Zhou, Wenyu, 2022. "Machine learning in the Chinese stock market," Journal of Financial Economics, Elsevier, vol. 145(2), pages 64-82.
    10. Yi, Li & Liu, Zilan & He, Lei & Qin, Zilong & Gan, Shunli, 2018. "Do Chinese mutual funds time the market?," Pacific-Basin Finance Journal, Elsevier, vol. 47(C), pages 1-19.
    11. Elton, Edwin J. & Gruber, Martin J. & Green, T. Clifton, 2007. "The Impact of Mutual Fund Family Membership on Investor Risk," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 42(2), pages 257-277, June.
    12. DeMiguel, Victor & Gil-Bazo, Javier & Nogales, Francisco J. & Santos, André A.P., 2023. "Machine learning and fund characteristics help to select mutual funds with positive alpha," Journal of Financial Economics, Elsevier, vol. 150(3).
    13. Doron Avramov & Si Cheng & Lior Metzker, 2023. "Machine Learning vs. Economic Restrictions: Evidence from Stock Return Predictability," Management Science, INFORMS, vol. 69(5), pages 2587-2619, May.
    14. Xi Dong & Yan Li & David E. Rapach & Guofu Zhou, 2022. "Anomalies and the Expected Market Return," Journal of Finance, American Finance Association, vol. 77(1), pages 639-681, February.
    15. Ľuboš Pástor & Robert F. Stambaugh & Lucian A. Taylor, 2017. "Do Funds Make More When They Trade More?," Journal of Finance, American Finance Association, vol. 72(4), pages 1483-1528, August.
    16. Ivo Welch & Amit Goyal, 2008. "A Comprehensive Look at The Empirical Performance of Equity Premium Prediction," The Review of Financial Studies, Society for Financial Studies, vol. 21(4), pages 1455-1508, July.
    17. Zhou, Dong-hai & Liu, Xiao-xing, 2024. "Does systemic risk in the fund markets predict future economic downturns?," International Review of Financial Analysis, Elsevier, vol. 92(C).
    18. Laurent Barras & Olivier Scaillet & Russ Wermers, 2010. "False Discoveries in Mutual Fund Performance: Measuring Luck in Estimated Alphas," Journal of Finance, American Finance Association, vol. 65(1), pages 179-216, February.
    19. Liu, Jianan & Stambaugh, Robert F. & Yuan, Yu, 2019. "Size and value in China," Journal of Financial Economics, Elsevier, vol. 134(1), pages 48-69.
    20. Robert Kosowski & Allan Timmermann & Russ Wermers & Hal White, 2006. "Can Mutual Fund “Stars” Really Pick Stocks? New Evidence from a Bootstrap Analysis," Journal of Finance, American Finance Association, vol. 61(6), pages 2551-2595, December.
    21. Ghoul, Sadok El & Karoui, Aymen, 2022. "Fund performance and social responsibility: New evidence using social active share and social tracking error," Journal of Banking & Finance, Elsevier, vol. 143(C).
    22. Zhang, Xili & Zheng, Yiran & Lien, Donald & Yu, Xiaojian, 2024. "Can mutual fund investors benefit from volatility managing? Evidence from China," Pacific-Basin Finance Journal, Elsevier, vol. 83(C).
    23. K. J. Martijn Cremers & Antti Petajisto, 2009. "How Active Is Your Fund Manager? A New Measure That Predicts Performance," The Review of Financial Studies, Society for Financial Studies, vol. 22(9), pages 3329-3365, September.
    24. Joachim Freyberger & Andreas Neuhierl & Michael Weber & Andrew KarolyiEditor, 2020. "Dissecting Characteristics Nonparametrically," Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2326-2377.
    25. Evans, Richard & Gómez, Juan-Pedro & Ma, Linlin & Tang, Yuehua, 2024. "Peer Versus Pure Benchmarks in the Compensation of Mutual Fund Managers," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 59(7), pages 3101-3138, November.
    26. Whitney Newey & Kenneth West, 2014. "A simple, positive semi-definite, heteroscedasticity and autocorrelation consistent covariance matrix," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 33(1), pages 125-132.
    27. Christopher S Jones & Haitao Mo & Lauren Cohen, 2021. "Out-of-Sample Performance of Mutual Fund Predictors," Review of Economic Studies, Oxford University Press, vol. 34(1), pages 149-193.
    28. Pástor, Ľuboš & Stambaugh, Robert F. & Taylor, Lucian A., 2015. "Scale and skill in active management," Journal of Financial Economics, Elsevier, vol. 116(1), pages 23-45.
    29. Carhart, Mark M, 1997. "On Persistence in Mutual Fund Performance," Journal of Finance, American Finance Association, vol. 52(1), pages 57-82, March.
    30. Bai, John (Jianqiu) & Ma, Linlin & Mullally, Kevin A. & Solomon, David H., 2019. "What a difference a (birth) month makes: The relative age effect and fund manager performance," Journal of Financial Economics, Elsevier, vol. 132(1), pages 200-221.
    31. Zhu, Min, 2018. "Informative fund size, managerial skill, and investor rationality," Journal of Financial Economics, Elsevier, vol. 130(1), pages 114-134.
    32. David E. Rapach & Jack K. Strauss & Guofu Zhou, 2010. "Out-of-Sample Equity Premium Prediction: Combination Forecasts and Links to the Real Economy," The Review of Financial Studies, Society for Financial Studies, vol. 23(2), pages 821-862, February.
    33. Yu, Xiaojian & Liu, Jianlin & Lien, Donald, 2023. "A new measure of fund window dressing and its application to Chinese mutual fund market," The Quarterly Review of Economics and Finance, Elsevier, vol. 89(C), pages 63-72.
    34. Wenbo Wu & Jiaqi Chen & Zhibin (Ben) Yang & Michael L. Tindall, 2021. "A Cross-Sectional Machine Learning Approach for Hedge Fund Return Prediction and Selection," Management Science, INFORMS, vol. 67(7), pages 4577-4601, July.
    35. Gordon J. Alexander & Gjergji Cici & Scott Gibson, 2007. "Does Motivation Matter When Assessing Trade Performance? An Analysis of Mutual Funds," The Review of Financial Studies, Society for Financial Studies, vol. 20(1), pages 125-150, January.
    36. Martijn Cremers & Antti Petajisto, 2006. "How Active is Your Fund Manager? A New Measure That Predicts Performance," Yale School of Management Working Papers amz2370, Yale School of Management, revised 01 May 2009.
    37. Bryan Kelly & Semyon Malamud & Kangying Zhou, 2024. "The Virtue of Complexity in Return Prediction," Journal of Finance, American Finance Association, vol. 79(1), pages 459-503, February.
    38. Krauss, Christopher & Do, Xuan Anh & Huck, Nicolas, 2017. "Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500," European Journal of Operational Research, Elsevier, vol. 259(2), pages 689-702.
    39. Li, Zhiyong & Rao, Xiao, 2023. "Exploring the zoo of predictors for mutual fund performance in China," Pacific-Basin Finance Journal, Elsevier, vol. 77(C).
    40. Marcin Kacperczyk & Clemens Sialm & Lu Zheng, 2005. "On the Industry Concentration of Actively Managed Equity Mutual Funds," Journal of Finance, American Finance Association, vol. 60(4), pages 1983-2011, August.
    41. Christopher Krauss & Anh Do & Nicolas Huck, 2017. "Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500," Post-Print hal-01768895, HAL.
    42. Bai, John Jianqiu & Tang, Yuehua & Wan, Chi & Yüksel, H. Zafer, 2022. "Fund manager skill in an era of globalization: Offshore concentration and fund performance," Journal of Financial Economics, Elsevier, vol. 145(2), pages 18-40.
    43. Jiang, Jinglin & Liao, Li & Wang, Zhengwei & Xiang, Hongyu, 2020. "Financial literacy and retail investors' financial welfare: Evidence from mutual fund investment outcomes in China," Pacific-Basin Finance Journal, Elsevier, vol. 59(C).
    44. Gompers, Paul A. & Gornall, Will & Kaplan, Steven N. & Strebulaev, Ilya A., 2020. "How do venture capitalists make decisions?," Journal of Financial Economics, Elsevier, vol. 135(1), pages 169-190.
    45. Sensoy, Berk A., 2009. "Performance evaluation and self-designated benchmark indexes in the mutual fund industry," Journal of Financial Economics, Elsevier, vol. 92(1), pages 25-39, April.
    46. Wang, Xiaoqian & Hyndman, Rob J. & Li, Feng & Kang, Yanfei, 2023. "Forecast combinations: An over 50-year review," International Journal of Forecasting, Elsevier, vol. 39(4), pages 1518-1547.
    47. Martijn Cremers & Antti Petajisto, 2006. "How Active is Your Fund Manager? A New Measure That Predicts Performance," Yale School of Management Working Papers amz2370, Yale School of Management, revised 01 May 2009.
    48. Büchner, Matthias & Kelly, Bryan, 2022. "A factor model for option returns," Journal of Financial Economics, Elsevier, vol. 143(3), pages 1140-1161.
    49. Paul Irvine & Jeong Ho (John) Kim & Jue Ren, 2024. "The Beta Anomaly and Mutual Fund Performance," Management Science, INFORMS, vol. 70(1), pages 143-163, January.
    50. Kaniel, Ron & Lin, Zihan & Pelger, Markus & Van Nieuwerburgh, Stijn, 2023. "Machine-learning the skill of mutual fund managers," Journal of Financial Economics, Elsevier, vol. 150(1), pages 94-138.
    51. Eugene F. Fama & Kenneth R. French, 2010. "Luck versus Skill in the Cross‐Section of Mutual Fund Returns," Journal of Finance, American Finance Association, vol. 65(5), pages 1915-1947, October.
    52. Ron Kaniel & Pingle Wang, 2025. "Unmasking Mutual Fund Derivative Use," The Review of Financial Studies, Society for Financial Studies, vol. 38(4), pages 1120-1166.
    53. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    54. Timothy Riley & Qing Yan, 2022. "Maximum Drawdown as Predictor of Mutual Fund Performance and Flows," Financial Analysts Journal, Taylor & Francis Journals, vol. 78(4), pages 59-76, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Victor DeMiguel & Javier Gil-Bazo & Francisco J. Nogales & André A. P. Santos, 2021. "Can machine learning help to select portfolios of mutual funds?," Economics Working Papers 1772, Department of Economics and Business, Universitat Pompeu Fabra.
    2. repec:bge:wpaper:1245 is not listed on IDEAS
    3. Li, Zhiyong & Rao, Xiao, 2023. "Exploring the zoo of predictors for mutual fund performance in China," Pacific-Basin Finance Journal, Elsevier, vol. 77(C).
    4. DeMiguel, Victor & Gil-Bazo, Javier & Nogales, Francisco J. & Santos, André A.P., 2023. "Machine learning and fund characteristics help to select mutual funds with positive alpha," Journal of Financial Economics, Elsevier, vol. 150(3).
    5. Cuthbertson, Keith & Nitzsche, Dirk & O'Sullivan, Niall, 2022. "Mutual fund performance persistence: Factor models and portfolio size," International Review of Financial Analysis, Elsevier, vol. 81(C).
    6. Kaniel, Ron & Lin, Zihan & Pelger, Markus & Van Nieuwerburgh, Stijn, 2023. "Machine-learning the skill of mutual fund managers," Journal of Financial Economics, Elsevier, vol. 150(1), pages 94-138.
    7. Li, Bin & Rossi, Alberto G. & Yan, Xuemin (Sterling) & Zheng, Lingling, 2025. "Machine learning from a “Universe” of signals: The role of feature engineering," Journal of Financial Economics, Elsevier, vol. 172(C).
    8. Livingston, Miles & Yao, Ping & Zhou, Lei, 2019. "The volatility of mutual fund performance," Journal of Economics and Business, Elsevier, vol. 104(C), pages 1-1.
    9. Changeun Kim & Younwoo Jeong & Bong-Gyu Jang, 2025. "Interpretable Deep Learning for Stock Returns: A Consensus-Bottleneck Asset Pricing Model," Papers 2512.16251, arXiv.org, revised Apr 2026.
    10. Malakhov, Alexey & Riley, Timothy B. & Yan, Qing, 2024. "Do hedge funds bet against beta?," International Review of Economics & Finance, Elsevier, vol. 93(PA), pages 1507-1525.
    11. Keith Cuthbertson & Dirk Nitzsche & Niall O’Sullivan, 2023. "UK mutual funds: performance persistence and portfolio size," Journal of Asset Management, Palgrave Macmillan, vol. 24(4), pages 284-298, July.
    12. Bai, John Jianqiu & Tang, Yuehua & Wan, Chi & Yüksel, H. Zafer, 2022. "Fund manager skill in an era of globalization: Offshore concentration and fund performance," Journal of Financial Economics, Elsevier, vol. 145(2), pages 18-40.
    13. Cakici, Nusret & Fieberg, Christian & Metko, Daniel & Zaremba, Adam, 2023. "Machine learning goes global: Cross-sectional return predictability in international stock markets," Journal of Economic Dynamics and Control, Elsevier, vol. 155(C).
    14. Russ Wermers & Tong Yao & Jane Zhao, 2012. "Forecasting Stock Returns Through an Efficient Aggregation of Mutual Fund Holdings," The Review of Financial Studies, Society for Financial Studies, vol. 25(12), pages 3490-3529.
    15. Hanauer, Matthias X. & Kalsbach, Tobias, 2023. "Machine learning and the cross-section of emerging market stock returns," Emerging Markets Review, Elsevier, vol. 55(C).
    16. Cakici, Nusret & Shahzad, Syed Jawad Hussain & Będowska-Sójka, Barbara & Zaremba, Adam, 2024. "Machine learning and the cross-section of cryptocurrency returns," International Review of Financial Analysis, Elsevier, vol. 94(C).
    17. Christiansen, Charlotte & Grønborg, Niels S. & Nielsen, Ole L., 2020. "Mutual fund selection for realistically short samples," Journal of Empirical Finance, Elsevier, vol. 55(C), pages 218-240.
    18. Saketh Aleti & Tim Bollerslev & Mathias Siggaard, 2025. "Intraday Market Return Predictability Culled from the Factor Zoo," Management Science, INFORMS, vol. 71(9), pages 7731-7751, September.
    19. Cuthbertson, Keith & Nitzsche, Dirk & O'Sullivan, Niall, 2016. "A review of behavioural and management effects in mutual fund performance," International Review of Financial Analysis, Elsevier, vol. 44(C), pages 162-176.
    20. Cong Wang, 2024. "Stock return prediction with multiple measures using neural network models," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 10(1), pages 1-34, December.
    21. Valentin Dimitrov & Prem C. Jain, 2026. "Yes, Virginia, there are superstar money managers," Review of Quantitative Finance and Accounting, Springer, vol. 66(2), pages 867-886, February.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    JEL classification:

    • G11 - Financial Economics - - General Financial Markets - - - Portfolio Choice; Investment Decisions
    • G17 - Financial Economics - - General Financial Markets - - - Financial Forecasting and Simulation
    • G23 - Financial Economics - - Financial Institutions and Services - - - Non-bank Financial Institutions; Financial Instruments; Institutional Investors

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:pacfin:v:94:y:2025:i:c:s0927538x25002720. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/pacfin .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.