IDEAS home Printed from https://ideas.repec.org/a/eee/pacfin/v94y2025ics0927538x25002835.html

Multi-factor portfolio optimization: A combined random Forest–AdaBoost model with cost-sensitive learning11This paper was supported by the National Natural Science Foundation of China (Nos. 71,871,071, 72071051); the Natural Science Foundation of Guangdong Province of China (Nos. 2025A1515010937, 2023A1515011354)

Author

Listed:
  • Yao, Haixiang
  • Wan, Chunzhuo

Abstract

This paper proposes a machine learning-driven multi-factor investment strategy, denoted as DE-CS-RFA, which integrates the Random Forest-AdaBoost (RFA) ensemble learning model, Cost-Sensitive (CS) learning, and the Differential Evolution algorithm (DE). The model utilizes 110 heterogeneous predictive features as input characteristics, eliminating redundant features via Kendall correlation analysis to enhance computational efficiency while comprehensively capturing market information. Subsequently, the Rank-Sum Ratio comprehensive evaluation method is employed to construct the initial investment universe and to develop an investment strategy based on the model's predicted data. Empirical results demonstrate that RFA outperforms other mainstream machine learning models on multiple evaluation metrics. Moreover, the simulation trading results indicate that the DE-CS-RFA model can effectively capture the market complexity and individual investor differences, enhancing the applicability and effectiveness of the investment strategy. Interpretability analysis further reveals the key factors influencing the stock price trends in the A-share market. Finally, robustness tests confirm that the DE-CS-RFA model can adapt to diverse financial market characteristics, holding potential to promote the widespread application of multi-factor investment strategies in the A-share market.

Suggested Citation

  • Yao, Haixiang & Wan, Chunzhuo, 2025. "Multi-factor portfolio optimization: A combined random Forest–AdaBoost model with cost-sensitive learning11This paper was supported by the National Natural Science Foundation of China (Nos. 71,871,071, 72071051); the Natural Science Foundation of Gua," Pacific-Basin Finance Journal, Elsevier, vol. 94(C).
  • Handle: RePEc:eee:pacfin:v:94:y:2025:i:c:s0927538x25002835
    DOI: 10.1016/j.pacfin.2025.102946
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0927538X25002835
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.pacfin.2025.102946?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Kwangwon Ahn & Linxiao Cong & Hanwool Jang & Daniel Sungyeon Kim, 2024. "Business cycle and herding behavior in stock returns: theory and evidence," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 10(1), pages 1-14, December.
    2. Jabeur, Sami Ben & Gharib, Cheima & Mefteh-Wali, Salma & Arfi, Wissal Ben, 2021. "CatBoost model and artificial intelligence techniques for corporate failure prediction," Technological Forecasting and Social Change, Elsevier, vol. 166(C).
    3. Yin, Anwen, 2020. "Equity premium prediction and optimal portfolio decision with Bagging," The North American Journal of Economics and Finance, Elsevier, vol. 54(C).
    4. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    5. De Bock, Koen W. & Coussement, Kristof & Lessmann, Stefan, 2020. "Cost-sensitive business failure prediction when misclassification costs are uncertain: A heterogeneous ensemble selection approach," European Journal of Operational Research, Elsevier, vol. 285(2), pages 612-630.
    6. Fama, Eugene F. & French, Kenneth R., 2015. "A five-factor asset pricing model," Journal of Financial Economics, Elsevier, vol. 116(1), pages 1-22.
    7. Richard G. Sloan & Annika Yu Wang, 2025. "Predictable EPS growth and the performance of value investing," Review of Accounting Studies, Springer, vol. 30(1), pages 33-78, March.
    8. Koen W. de Bock & Kristof Coussement & Stefan Lessmann, 2020. "Cost-sensitive business failure prediction when misclassification costs are uncertain: A heterogeneous ensemble selection approach," Post-Print hal-02863245, HAL.
    9. Hurst, Gareth & Docherty, Paul, 2015. "Trend salience, investor behaviours and momentum profitability," Pacific-Basin Finance Journal, Elsevier, vol. 35(PB), pages 471-484.
    10. Yao, Haixiang & Xia, Shenghao & Liu, Hao, 2022. "Six-factor asset pricing and portfolio investment via deep learning: Evidence from Chinese stock market," Pacific-Basin Finance Journal, Elsevier, vol. 76(C).
    11. Fama, Eugene F. & French, Kenneth R., 1993. "Common risk factors in the returns on stocks and bonds," Journal of Financial Economics, Elsevier, vol. 33(1), pages 3-56, February.
    12. Fanyi Wang & Ruobing Zhang & Faraz Ahmed & Syed Mir Muhammed Shah, 2022. "Impact of investment behaviour on financial markets during COVID-19: a case of UK," Economic Research-Ekonomska Istraživanja, Taylor & Francis Journals, vol. 35(1), pages 2273-2291, December.
    13. Abbas, Ghulam & Bashir, Usman & Wang, Shouyang & Zebende, Gilney Figueira & Ishfaq, Muhammad, 2019. "The return and volatility nexus among stock market and macroeconomic fundamentals for China," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 526(C).
    14. Dichtl, Hubert & Drobetz, Wolfgang & Otto, Tizian, 2023. "Forecasting Stock Market Crashes via Machine Learning," Journal of Financial Stability, Elsevier, vol. 65(C).
    15. Grudniewicz, Jan & Ślepaczuk, Robert, 2023. "Application of machine learning in algorithmic investment strategies on global stock markets," Research in International Business and Finance, Elsevier, vol. 66(C).
    16. Carhart, Mark M, 1997. "On Persistence in Mutual Fund Performance," Journal of Finance, American Finance Association, vol. 52(1), pages 57-82, March.
    17. Sami Ben Jabeur & Cheima Gharib & Salma Mefteh-Wali & Wissal Ben Arfi, 2021. "CatBoost model and artificial intelligence techniques for corporate failure prediction," Post-Print hal-05238300, HAL.
    18. Baltussen, Guido & Da, Zhi & Lammers, Sten & Martens, Martin, 2021. "Hedging demand and market intraday momentum," Journal of Financial Economics, Elsevier, vol. 142(1), pages 377-403.
    19. Ricardo P. Masini & Marcelo C. Medeiros & Eduardo F. Mendes, 2023. "Machine learning advances for time series forecasting," Journal of Economic Surveys, Wiley Blackwell, vol. 37(1), pages 76-111, February.
    20. Hanauer, Matthias X. & Jansen, Maarten & Swinkels, Laurens & Zhou, Weili, 2024. "Factor models for Chinese A-shares," International Review of Financial Analysis, Elsevier, vol. 91(C).
    21. Sun, Yucheng & Xu, Wen & Zhang, Chuanhai, 2023. "Identifying latent factors based on high-frequency data," Journal of Econometrics, Elsevier, vol. 233(1), pages 251-270.
    22. Song, Yu & Akagi, Fumio, 2016. "Application of artificial neural network for the prediction of stock market returns: The case of the Japanese stock marketAuthor-Name: Qiu, Mingyue," Chaos, Solitons & Fractals, Elsevier, vol. 85(C), pages 1-7.
    23. Jiang, Rui & Wen, Conghua & Zhang, Ruonan & Cui, Yu, 2022. "Investor's herding behavior in Asian equity markets during COVID-19 period," Pacific-Basin Finance Journal, Elsevier, vol. 73(C).
    24. Sheng, Yankai & Qu, Yuanyu & Ma, Ding, 2024. "Stock price crash prediction based on multimodal data machine learning models," Finance Research Letters, Elsevier, vol. 62(PA).
    25. Wang, Zijun, 2021. "The high volume return premium and economic fundamentals," Journal of Financial Economics, Elsevier, vol. 140(1), pages 325-345.
    26. Davide Pettenuzzo & Riccardo Sabbatucci & Allan Timmermann, 2020. "Cash Flow News and Stock Price Dynamics," Journal of Finance, American Finance Association, vol. 75(4), pages 2221-2270, August.
    27. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    28. Zhi Yang & Zhao Fei & Jing Wang, 2024. "Research on the Correlation between the Exchange Rate of Offshore RMB and the Stock Index Futures," Mathematics, MDPI, vol. 12(5), pages 1-15, February.
    29. Fischer, Thomas & Krauss, Christopher, 2018. "Deep learning with long short-term memory networks for financial market predictions," European Journal of Operational Research, Elsevier, vol. 270(2), pages 654-669.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ni, Xuanming & Zheng, Tiantian & Zhao, Huimin & Zhu, Shushang, 2023. "High-dimensional portfolio optimization based on tree-structured factor model," Pacific-Basin Finance Journal, Elsevier, vol. 81(C).
    2. Rizwan Ullah & Muhammad Naveed Jan & Muhammad Tahir, 2025. "Unveiling the optimal factor model in Pakistan: a machine learning approach using support vector regression and extreme gradient boosting algorithms," Future Business Journal, Springer, vol. 11(1), pages 1-20, December.
    3. Victor DeMiguel & Javier Gil-Bazo & Francisco J. Nogales & André A. P. Santos, 2021. "Can machine learning help to select portfolios of mutual funds?," Economics Working Papers 1772, Department of Economics and Business, Universitat Pompeu Fabra.
    4. repec:bge:wpaper:1245 is not listed on IDEAS
    5. Yao, Haixiang & Xia, Shenghao & Liu, Hao, 2022. "Six-factor asset pricing and portfolio investment via deep learning: Evidence from Chinese stock market," Pacific-Basin Finance Journal, Elsevier, vol. 76(C).
    6. Li, Yihan, 2024. "Trading on trends: How the ordering of historical volume predicts Chinese stock returns?," International Review of Financial Analysis, Elsevier, vol. 95(PC).
    7. Cong Wang, 2024. "Stock return prediction with multiple measures using neural network models," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 10(1), pages 1-34, December.
    8. Christian Fieberg & Daniel Metko & Thorsten Poddig & Thomas Loy, 2023. "Machine learning techniques for cross-sectional equity returns’ prediction," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 45(1), pages 289-323, March.
    9. Obaid, Khaled & Pukthuanthong, Kuntara, 2022. "A picture is worth a thousand words: Measuring investor sentiment by combining machine learning and photos from news," Journal of Financial Economics, Elsevier, vol. 144(1), pages 273-297.
    10. Clarke, Charles, 2022. "The level, slope, and curve factor model for stocks," Journal of Financial Economics, Elsevier, vol. 143(1), pages 159-187.
    11. Yan, Jingda & Yu, Jialin, 2023. "Cross-stock momentum and factor momentum," Journal of Financial Economics, Elsevier, vol. 150(2).
    12. Paul Handro & Bogdan Dima, 2024. "Analyzing Financial Markets Efficiency: Insights from a Bibliometric and Content Review," Journal of Financial Studies, Institute of Financial Studies, vol. 16(9), pages 119-175, May.
    13. DeMiguel, Victor & Gil-Bazo, Javier & Nogales, Francisco J. & Santos, André A.P., 2023. "Machine learning and fund characteristics help to select mutual funds with positive alpha," Journal of Financial Economics, Elsevier, vol. 150(3).
    14. Wu, Hongxu & Wang, Qiao & Li, Jianping & Deng, Zhibin, 2025. "Enhancing stock return prediction in the Chinese market: A GAN-based approach," Research in International Business and Finance, Elsevier, vol. 75(C).
    15. Jiaju Miao & Pawel Polak, 2023. "Online Ensemble Learning for Sector Rotation: A Gradient-Free Framework," Papers 2304.09947, arXiv.org, revised Nov 2025.
    16. Doron Avramov & Si Cheng & Lior Metzker, 2023. "Machine Learning vs. Economic Restrictions: Evidence from Stock Return Predictability," Management Science, INFORMS, vol. 69(5), pages 2587-2619, May.
    17. Allen Yikuan Huang & Zheqi Fan, 2026. "Beyond Prompting: An Autonomous Framework for Systematic Factor Investing via Agentic AI," Papers 2603.14288, arXiv.org, revised Apr 2026.
    18. Lioui, Abraham & Tarelli, Andrea, 2022. "Chasing the ESG factor," Journal of Banking & Finance, Elsevier, vol. 139(C).
    19. Min, Byoung-Kyu & Roh, Tai-Yong, 2025. "Can machine learning uncover abnormal returns in uncharted financial territories?," Pacific-Basin Finance Journal, Elsevier, vol. 94(C).
    20. Alona Bilokha & Mingying Cheng & Mengchuan Fu & Iftekhar Hasan, 2025. "Understanding CSR champions: a machine learning approach," Annals of Operations Research, Springer, vol. 347(1), pages 761-774, April.
    21. Flori, Andrea & Regoli, Daniele, 2021. "Revealing Pairs-trading opportunities with long short-term memory networks," European Journal of Operational Research, Elsevier, vol. 295(2), pages 772-791.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:pacfin:v:94:y:2025:i:c:s0927538x25002835. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/pacfin .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.