IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2508.02253.html
   My bibliography  Save this paper

Interpretable Factors of Firm Characteristics

Author

Listed:
  • Yuxiao Jiao
  • Guofu Zhou
  • Wu Zhu
  • Yingzi Zhu

Abstract

We develop a new framework for constructing factors from firm characteristics that balances statistical efficiency and economic interpretability. Instead of using all characteristics equally, our method groups related characteristics and derives one factor per group. The grouping combines economic intuition with data-driven clustering. Applied to the IPCA model by Kelly et al. (2019), our approach yields economically meaningful factors that match or exceed standard IPCA in pricing performance. Using 94 characteristics from Gu et al. (2020), we show that our parsimonious, transparent factors outperform benchmarks in out-of-sample tests, demonstrating the value of embedding economic structure into statistical modeling.

Suggested Citation

  • Yuxiao Jiao & Guofu Zhou & Wu Zhu & Yingzi Zhu, 2025. "Interpretable Factors of Firm Characteristics," Papers 2508.02253, arXiv.org.
  • Handle: RePEc:arx:papers:2508.02253
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2508.02253
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Luyang Chen & Markus Pelger & Jason Zhu, 2024. "Deep Learning in Asset Pricing," Management Science, INFORMS, vol. 70(2), pages 714-750, February.
    2. Kelly, Bryan T. & Pruitt, Seth & Su, Yinan, 2019. "Characteristics are covariances: A unified model of risk and return," Journal of Financial Economics, Elsevier, vol. 134(3), pages 501-524.
    3. Tarun Chordia & Amit Goyal & Alessio Saretto & Andrew KarolyiEditor, 2020. "Anomalies and False Rejections," Review of Finance, European Finance Association, vol. 33(5), pages 2134-2179.
    4. Lin William Cong & Tengyuan Liang & Xiao Zhang & Wu Zhu, 2024. "Textual Factors: A Scalable, Interpretable, and Data-driven Approach to Analyzing Unstructured Information," NBER Working Papers 33168, National Bureau of Economic Research, Inc.
    5. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    6. R. David Mclean & Jeffrey Pontiff, 2016. "Does Academic Research Destroy Stock Return Predictability?," Journal of Finance, American Finance Association, vol. 71(1), pages 5-32, February.
    7. Kozak, Serhiy & Nagel, Stefan & Santosh, Shrihari, 2020. "Shrinking the cross-section," Journal of Financial Economics, Elsevier, vol. 135(2), pages 271-292.
    8. Kewei Hou & Chen Xue & Lu Zhang, 2015. "Editor's Choice Digesting Anomalies: An Investment Approach," The Review of Financial Studies, Society for Financial Studies, vol. 28(3), pages 650-705.
    9. Guanhao Feng & Stefano Giglio & Dacheng Xiu, 2020. "Taming the Factor Zoo: A Test of New Factors," Journal of Finance, American Finance Association, vol. 75(3), pages 1327-1370, June.
    10. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    11. Fama, Eugene F. & French, Kenneth R., 2015. "A five-factor asset pricing model," Journal of Financial Economics, Elsevier, vol. 116(1), pages 1-22.
    12. Kent Daniel & Lira Mota & Simon Rottke & Tano Santos, 2020. "The Cross-Section of Risk and Returns," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 1927-1979.
    13. Siddhartha Chib & Lingxiao Zhao & Guofu Zhou, 2024. "Winners from Winners: A Tale of Risk Factors," Management Science, INFORMS, vol. 70(1), pages 396-414, January.
    14. Fama, Eugene F. & French, Kenneth R., 1993. "Common risk factors in the returns on stocks and bonds," Journal of Financial Economics, Elsevier, vol. 33(1), pages 3-56, February.
    15. Joachim Freyberger & Andreas Neuhierl & Michael Weber, 2020. "Dissecting Characteristics Nonparametrically," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2326-2377.
    16. John H. Cochrane, 2011. "Presidential Address: Discount Rates," Journal of Finance, American Finance Association, vol. 66(4), pages 1047-1108, August.
    17. Whitney Newey & Kenneth West, 2014. "A simple, positive semi-definite, heteroscedasticity and autocorrelation consistent covariance matrix," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 33(1), pages 125-132.
    18. Siddhartha Chib & Xiaming Zeng & Lingxiao Zhao, 2020. "On Comparing Asset Pricing Models," Journal of Finance, American Finance Association, vol. 75(1), pages 551-577, February.
    19. Joachim Freyberger & Andreas Neuhierl & Michael Weber & Andrew KarolyiEditor, 2020. "Dissecting Characteristics Nonparametrically," Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2326-2377.
    20. Yufeng Han & Ai He & David E Rapach & Guofu Zhou, 2024. "Cross-sectional expected returns: new Fama–MacBeth regressions in the era of machine learning," Review of Finance, European Finance Association, vol. 28(6), pages 1807-1831.
    21. Kent Daniel & Lira Mota & Simon Rottke & Tano Santos & Andrew KarolyiEditor, 2020. "The Cross-Section of Risk and Returns," Review of Finance, European Finance Association, vol. 33(5), pages 1927-1979.
    22. Victor DeMiguel & Alberto Martín-Utrera & Francisco J Nogales & Raman Uppal, 2020. "A Transaction-Cost Perspective on the Multitude of Firm Characteristics," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2180-2222.
    23. Tarun Chordia & Amit Goyal & Alessio Saretto, 2020. "Anomalies and False Rejections," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2134-2179.
    24. Palazzo, Berardino, 2012. "Cash holdings, risk, and expected returns," Journal of Financial Economics, Elsevier, vol. 104(1), pages 162-185.
    25. Gustavo Grullon & Evgeny Lyandres & Alexei Zhdanov, 2012. "Real Options, Volatility, and Stock Returns," Journal of Finance, American Finance Association, vol. 67(4), pages 1499-1537, August.
    26. Patton, Andrew J. & Weller, Brian M., 2020. "What you see is not what you get: The costs of trading market anomalies," Journal of Financial Economics, Elsevier, vol. 137(2), pages 515-549.
    27. Büchner, Matthias & Kelly, Bryan, 2022. "A factor model for option returns," Journal of Financial Economics, Elsevier, vol. 143(3), pages 1140-1161.
    28. Connor, Gregory & Korajczyk, Robert A., 1986. "Performance measurement with the arbitrage pricing theory : A new framework for analysis," Journal of Financial Economics, Elsevier, vol. 15(3), pages 373-394, March.
    29. Doron Avramov & Si Cheng & Lior Metzker & Stefan Voigt, 2023. "Integrating Factor Models," Journal of Finance, American Finance Association, vol. 78(3), pages 1593-1646, June.
    30. Victor DeMiguel & Alberto Martín-Utrera & Francisco J Nogales & Raman Uppal & Andrew KarolyiEditor, 2020. "A Transaction-Cost Perspective on the Multitude of Firm Characteristics," Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2180-2222.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Söhnke M. Bartram & Harald Lohre & Peter F. Pope & Ananthalakshmi Ranganathan, 2021. "Navigating the factor zoo around the world: an institutional investor perspective," Journal of Business Economics, Springer, vol. 91(5), pages 655-703, July.
    2. Fieberg, Christian & Liedtke, Gerrit & Zaremba, Adam & Cakici, Nusret, 2025. "A factor model for the cross-section of country equity risk premia," Journal of Banking & Finance, Elsevier, vol. 171(C).
    3. Clarke, Charles, 2022. "The level, slope, and curve factor model for stocks," Journal of Financial Economics, Elsevier, vol. 143(1), pages 159-187.
    4. Matteo Bagnara, 2024. "Asset Pricing and Machine Learning: A critical review," Journal of Economic Surveys, Wiley Blackwell, vol. 38(1), pages 27-56, February.
    5. G Andrew Karolyi & Stijn Van Nieuwerburgh, 2020. "New Methods for the Cross-Section of Returns," Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 1879-1890.
    6. Guilherme V. Moura & Andr'e P. Santos & Hudson S. Torrent, 2025. "Variable selection for minimum-variance portfolios," Papers 2508.14986, arXiv.org.
    7. Langlois, Hugues, 2023. "What matters in a characteristic?," Journal of Financial Economics, Elsevier, vol. 149(1), pages 52-72.
    8. Baba-Yara, Fahiz & Boons, Martijn & Tamoni, Andrea, 2024. "Persistent and transitory components of firm characteristics: Implications for asset pricing," Journal of Financial Economics, Elsevier, vol. 154(C).
    9. Doron Avramov & Si Cheng & Lior Metzker & Stefan Voigt, 2023. "Integrating Factor Models," Journal of Finance, American Finance Association, vol. 78(3), pages 1593-1646, June.
    10. Feng, Guanhao & He, Jingyu, 2022. "Factor investing: A Bayesian hierarchical approach," Journal of Econometrics, Elsevier, vol. 230(1), pages 183-200.
    11. De Nard, Gianluca & Zhao, Zhao, 2023. "Using, taming or avoiding the factor zoo? A double-shrinkage estimator for covariance matrices," Journal of Empirical Finance, Elsevier, vol. 72(C), pages 23-35.
    12. Bryzgalova, Svetlana & Huang, Jiantao & Julliard, Christian, 2023. "Bayesian solutions for the factor zoo: we just ran two quadrillion models," LSE Research Online Documents on Economics 126151, London School of Economics and Political Science, LSE Library.
    13. Wolfgang Drobetz & Tizian Otto, 2021. "Empirical asset pricing via machine learning: evidence from the European stock market," Journal of Asset Management, Palgrave Macmillan, vol. 22(7), pages 507-538, December.
    14. Smith, Simon C., 2022. "Time-variation, multiple testing, and the factor zoo," International Review of Financial Analysis, Elsevier, vol. 84(C).
    15. Chen, Ding & Guo, Biao & Zhou, Guofu, 2023. "Firm fundamentals and the cross-section of implied volatility shapes," Journal of Financial Markets, Elsevier, vol. 63(C).
    16. Doron Avramov & Si Cheng & Lior Metzker, 2023. "Machine Learning vs. Economic Restrictions: Evidence from Stock Return Predictability," Management Science, INFORMS, vol. 69(5), pages 2587-2619, May.
    17. Ni, Xuanming & Zheng, Tiantian & Zhao, Huimin & Zhu, Shushang, 2023. "High-dimensional portfolio optimization based on tree-structured factor model," Pacific-Basin Finance Journal, Elsevier, vol. 81(C).
    18. Andrew Y. Chen & Tom Zimmermann, 2022. "Open Source Cross-Sectional Asset Pricing," Critical Finance Review, now publishers, vol. 11(2), pages 207-264, May.
    19. Cakici, Nusret & Fieberg, Christian & Metko, Daniel & Zaremba, Adam, 2023. "Machine learning goes global: Cross-sectional return predictability in international stock markets," Journal of Economic Dynamics and Control, Elsevier, vol. 155(C).
    20. De Nard, Gianluca & Zhao, Zhao, 2022. "A large-dimensional test for cross-sectional anomalies:Efficient sorting revisited," International Review of Economics & Finance, Elsevier, vol. 80(C), pages 654-676.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2508.02253. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.