IDEAS home Printed from https://ideas.repec.org/p/tas/wpaper/29456384.html
   My bibliography  Save this paper

A Systematic Literature Review of Asset Pricing: Insights from AI and Big Data

Author

Abstract

This paper systematically reviews the role of big data and artificial intelligence (AI) in asset pricing, analysing 130 journal articles published between 1994 and 2024. We categorise the literature into three themes: AI in asset pricing, big data in asset pricing, and their integration. Publications have grown exponentially since 2019 suggesting structural changes in asset pricing research which we highlight using thematic analysis. The bibliometric analysis shows key trends in AI models for predictive analytics and factor analysis. We identify research gaps and call for adaptive regulatory frameworks to support the ethical use of AI and big data in financial modelling.

Suggested Citation

  • Barson, Zynobia & Ahadzie, Richard Mawulawoe & Daugaard, Dan & Vespignani, Joaquin, 2025. "A Systematic Literature Review of Asset Pricing: Insights from AI and Big Data," Working Papers 2025-03, University of Tasmania, Tasmanian School of Business and Economics.
  • Handle: RePEc:tas:wpaper:29456384
    as

    Download full text from publisher

    File URL: https://figshare.utas.edu.au/ndownloader/files/55918370/3
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    2. Jushan Bai & Serena Ng, 2002. "Determining the Number of Factors in Approximate Factor Models," Econometrica, Econometric Society, vol. 70(1), pages 191-221, January.
    3. Baoqiang Zhan & Shu Zhang & Helen S. Du & Xiaoguang Yang, 2022. "Exploring Statistical Arbitrage Opportunities Using Machine Learning Strategy," Computational Economics, Springer;Society for Computational Economics, vol. 60(3), pages 861-882, October.
    4. Joaquin Vespignani & Russell Smyth, 2024. "Artificial intelligence investments reduce risks to critical mineral supply," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    5. Kong, Xin-Bing & Liu, Zhi & Zhou, Wang, 2019. "A rank test for the number of factors with high-frequency data," Journal of Econometrics, Elsevier, vol. 211(2), pages 439-460.
    6. Carhart, Mark M, 1997. "On Persistence in Mutual Fund Performance," Journal of Finance, American Finance Association, vol. 52(1), pages 57-82, March.
    7. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    8. Aït-Sahalia, Yacine & Xiu, Dacheng, 2017. "Using principal component analysis to estimate a high dimensional factor model with high-frequency data," Journal of Econometrics, Elsevier, vol. 201(2), pages 384-399.
    9. Liao Zhu & Sumanta Basu & Robert A. Jarrow & Martin T. Wells, 2020. "High-Dimensional Estimation, Basis Assets, and the Adaptive Multi-Factor Model," Quarterly Journal of Finance (QJF), World Scientific Publishing Co. Pte. Ltd., vol. 10(04), pages 1-52, December.
    10. Jushan Bai, 2003. "Inferential Theory for Factor Models of Large Dimensions," Econometrica, Econometric Society, vol. 71(1), pages 135-171, January.
    11. Fama, Eugene F. & French, Kenneth R., 2015. "A five-factor asset pricing model," Journal of Financial Economics, Elsevier, vol. 116(1), pages 1-22.
    12. Jianqing Fan & Alex Furger & Dacheng Xiu, 2016. "Incorporating Global Industrial Classification Standard Into Portfolio Allocation: A Simple Factor-Based Large Covariance Matrix Estimator With High-Frequency Data," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(4), pages 489-503, October.
    13. Fama, Eugene F & French, Kenneth R, 1996. "Multifactor Explanations of Asset Pricing Anomalies," Journal of Finance, American Finance Association, vol. 51(1), pages 55-84, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sun, Yucheng & Xu, Wen & Zhang, Chuanhai, 2023. "Identifying latent factors based on high-frequency data," Journal of Econometrics, Elsevier, vol. 233(1), pages 251-270.
    2. Bakalli, Gaetan & Guerrier, Stéphane & Scaillet, Olivier, 2023. "A penalized two-pass regression to predict stock returns with time-varying risk premia," Journal of Econometrics, Elsevier, vol. 237(2).
    3. Gagliardini, Patrick & Ossola, Elisa & Scaillet, Olivier, 2020. "Estimation of large dimensional conditional factor models in finance," Handbook of Econometrics, in: Steven N. Durlauf & Lars Peter Hansen & James J. Heckman & Rosa L. Matzkin (ed.), Handbook of Econometrics, edition 1, volume 7, chapter 0, pages 219-282, Elsevier.
    4. Dai, Chaoxing & Lu, Kun & Xiu, Dacheng, 2019. "Knowing factors or factor loadings, or neither? Evaluating estimators of large covariance matrices with noisy and asynchronous data," Journal of Econometrics, Elsevier, vol. 208(1), pages 43-79.
    5. Clarke, Charles, 2022. "The level, slope, and curve factor model for stocks," Journal of Financial Economics, Elsevier, vol. 143(1), pages 159-187.
    6. Matteo Bagnara, 2024. "Asset Pricing and Machine Learning: A critical review," Journal of Economic Surveys, Wiley Blackwell, vol. 38(1), pages 27-56, February.
    7. Li, Y-N. & Chen, J. & Linton, O., 2021. "Estimation of Common Factors for Microstructure Noise and Efficient Price in a High-frequency Dual Factor Model," Cambridge Working Papers in Economics 2150, Faculty of Economics, University of Cambridge.
    8. Tae-Hwy Lee & Ekaterina Seregina, 2024. "Optimal Portfolio Using Factor Graphical Lasso," Journal of Financial Econometrics, Oxford University Press, vol. 22(3), pages 670-695.
    9. Paul Handro & Bogdan Dima, 2024. "Analyzing Financial Markets Efficiency: Insights from a Bibliometric and Content Review," Journal of Financial Studies, Institute of Financial Studies, vol. 16(9), pages 119-175, May.
    10. Cakici, Nusret & Zaremba, Adam, 2021. "Liquidity and the cross-section of international stock returns," Journal of Banking & Finance, Elsevier, vol. 127(C).
    11. Lin William Cong & Guanhao Feng & Jingyu He & Xin He, 2022. "Growing the Efficient Frontier on Panel Trees," NBER Working Papers 30805, National Bureau of Economic Research, Inc.
    12. Cong, Lin William & Feng, Guanhao & He, Jingyu & He, Xin, 2025. "Growing the efficient frontier on panel trees," Journal of Financial Economics, Elsevier, vol. 167(C).
    13. Söhnke M. Bartram & Harald Lohre & Peter F. Pope & Ananthalakshmi Ranganathan, 2021. "Navigating the factor zoo around the world: an institutional investor perspective," Journal of Business Economics, Springer, vol. 91(5), pages 655-703, July.
    14. Kwangmin Jung & Donggyu Kim & Seunghyeon Yu, 2022. "Next generation models for portfolio risk management: An approach using financial big data," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 89(3), pages 765-787, September.
    15. Zura Kakushadze, 2015. "Heterotic Risk Models," Papers 1508.04883, arXiv.org, revised Jan 2016.
    16. Zura Kakushadze & Willie Yu, 2016. "Statistical Risk Models," Papers 1602.08070, arXiv.org, revised Jan 2017.
    17. Jianqing Fan & Ricardo Masini & Marcelo C. Medeiros, 2021. "Bridging factor and sparse models," Papers 2102.11341, arXiv.org, revised Sep 2022.
    18. Simon Hediger & Jeffrey Näf & Marc S. Paolella & Paweł Polak, 2023. "Heterogeneous tail generalized common factor modeling," Digital Finance, Springer, vol. 5(2), pages 389-420, June.
    19. Francisco Peñaranda & Enrique Sentana, 2024. "Portfolio management with big data," Working Papers wp2024_2411, CEMFI.
    20. Zura Kakushadze & Willie Yu, 2016. "Multifactor Risk Models and Heterotic CAPM," Papers 1602.04902, arXiv.org, revised Mar 2016.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;
    ;

    JEL classification:

    • G12 - Financial Economics - - General Financial Markets - - - Asset Pricing; Trading Volume; Bond Interest Rates
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis
    • C61 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling - - - Optimization Techniques; Programming Models; Dynamic Analysis

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:tas:wpaper:29456384. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Oscar Pavlov (email available below). General contact details of provider: https://edirc.repec.org/data/dutasau.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.