IDEAS home Printed from https://ideas.repec.org/a/eee/pacfin/v90y2025ics0927538x25000022.html
   My bibliography  Save this article

Predicting financial fraud in Chinese listed companies: An enterprise portrait and machine learning approach

Author

Listed:
  • Zhang, Zejun
  • Wang, Zhao
  • Cai, Lixin

Abstract

Financial fraud of listed companies is a frequent problem in the capital market. Due to factors such as information asymmetry and inadequate regulation, financial fraud severely restricts stakeholders' capital allocation behavior and hinders the sustainable development of the capital market. However, existing research lacks systematic and quantitative insights into the characteristics of firms involved in financial fraud, making it difficult to achieve quantitative identification of most such firms. This limitation arises from a predominant focus on the causal relationships between various financial indicators and financial fraud. In this paper, we integrate machine learning and enterprise portrait methods, using listed companies in the Chinese capital market as research subjects to predict corporate financial fraud. Firstly, a comprehensive system of indicators is established, covering seven dimensions: basic corporate information, profitability, solvency, operating efficiency, capital structure, corporate governance, and emotional attitude. Subsequently, the feature visualization portrait is created using Gaussian mixture model (GMM) clustering and label classification, while the predictive role of multidimensional enterprise portrait features in assessing the risk of corporate financial fraud is examined. The results indicate that unstructured indicators, such as Management Discussion and Analysis (MD&A), can significantly enhance predictive capability for corporate financial fraud. The SHapley Additive exPlanations (SHAP) method is introduced to reveal the influencing factors and characteristics of financial fraud. The empirical findings show that firms involved in financial fraud typically exhibit characteristics such as shorter listing times, weaker solvency and operating efficiency, higher capital structure, and poor corporate governance ability. Moreover, the XGBoost model demonstrates superior predictive performance among various models. The findings of this study provide a new perspective for in-depth exploration of the impact mechanisms of financial fraud and related regulatory warnings. These findings contribute to enhancing the effectiveness of governance and the capital allocation function within the capital market.

Suggested Citation

  • Zhang, Zejun & Wang, Zhao & Cai, Lixin, 2025. "Predicting financial fraud in Chinese listed companies: An enterprise portrait and machine learning approach," Pacific-Basin Finance Journal, Elsevier, vol. 90(C).
  • Handle: RePEc:eee:pacfin:v:90:y:2025:i:c:s0927538x25000022
    DOI: 10.1016/j.pacfin.2025.102665
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0927538X25000022
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.pacfin.2025.102665?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Dan Amiram & Zahn Bozanic & James D. Cox & Quentin Dupont & Jonathan M. Karpoff & Richard Sloan, 2018. "Financial reporting fraud and other forms of misconduct: a multidisciplinary review of the literature," Review of Accounting Studies, Springer, vol. 23(2), pages 732-783, June.
    2. Lopez-Gracia, Jose & Aybar-Arias, Cristina, 2000. "An Empirical Approach to the Financial Behaviour of Small and Medium Sized Companies," Small Business Economics, Springer, vol. 14(1), pages 55-63, February.
    3. Li, Jing & Li, Nan & Xia, Tongshui & Guo, Jinjin, 2023. "Textual analysis and detection of financial fraud: Evidence from Chinese manufacturing firms," Economic Modelling, Elsevier, vol. 126(C).
    4. Zhang, Yi & Liu, Tianxiang & Li, Weiping, 2024. "Corporate fraud detection based on linguistic readability vector: Application to financial companies in China," International Review of Financial Analysis, Elsevier, vol. 95(PB).
    5. Joseph P. Gaspar & Redona Methasani & Maurice E. Schweitzer, 2022. "Emotional Intelligence and Deception: A Theoretical Model and Propositions," Journal of Business Ethics, Springer, vol. 177(3), pages 567-584, May.
    6. Jonathan M. Karpoff & D. Scott Lee & Gerald S. Martin, 2014. "The Consequences to Managers for Financial Misrepresentation," Springer Books, in: Roberto Pietra & Stuart McLeay & Joshua Ronen (ed.), Accounting and Regulation, edition 127, chapter 0, pages 339-375, Springer.
    7. Yi Jiang & Stewart Jones, 2018. "Corporate distress prediction in China: a machine learning approach," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 58(4), pages 1063-1109, December.
    8. Karpoff, Jonathan M., 2021. "The future of financial fraud," Journal of Corporate Finance, Elsevier, vol. 66(C).
    9. Chen, Yuh-Jen & Liou, Wan-Ching & Chen, Yuh-Min & Wu, Jyun-Han, 2019. "Fraud detection for financial statements of business groups," International Journal of Accounting Information Systems, Elsevier, vol. 32(C), pages 1-23.
    10. Md Jahidur Rahman & Hongtao Zhu, 2023. "Predicting accounting fraud using imbalanced ensemble learning classifiers – evidence from China," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 63(3), pages 3455-3486, September.
    11. Patricia M. Dechow & Weili Ge & Chad R. Larson & Richard G. Sloan, 2011. "Predicting Material Accounting Misstatements," Contemporary Accounting Research, John Wiley & Sons, vol. 28(1), pages 17-82, March.
    12. Xinchun Liu & Miaochao Chen, 2021. "Empirical Analysis of Financial Statement Fraud of Listed Companies Based on Logistic Regression and Random Forest Algorithm," Journal of Mathematics, Hindawi, vol. 2021, pages 1-9, December.
    13. Qingbo Yuan & Yunyan Zhang & Steven Cahan, 2016. "The real effects of corporate fraud: evidence from class action lawsuits," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 56(3), pages 879-911, September.
    14. Davidson, Robert H., 2022. "Who did it matters: Executive equity compensation and financial reporting fraud," Journal of Accounting and Economics, Elsevier, vol. 73(2).
    15. Quentin Dupont & Jonathan M. Karpoff, 2020. "The Trust Triangle: Laws, Reputation, and Culture in Empirical Finance Research," Journal of Business Ethics, Springer, vol. 163(2), pages 217-238, May.
    16. Belinna Bai & Jerome Yen & Xiaoguang Yang, 2008. "False Financial Statements: Characteristics Of China'S Listed Companies And Cart Detecting Approach," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 7(02), pages 339-359.
    17. Shih, Kuang Hsun & Cheng, Ching Chan & Wang, Yi Hsien, 2011. "Financial Information Fraud Risk Warning for Manufacturing Industry - Using Logistic Regression and Neural Network," Journal for Economic Forecasting, Institute for Economic Forecasting, vol. 0(1), pages 54-71, March.
    18. Sun, Xiaojun & Lei, Yalin, 2021. "Research on financial early warning of mining listed companies based on BP neural network model," Resources Policy, Elsevier, vol. 73(C).
    19. Xi Chen & Yang Ha (Tony) Cho & Yiwei Dou & Baruch Lev, 2022. "Predicting Future Earnings Changes Using Machine Learning and Detailed Financial Data," Journal of Accounting Research, Wiley Blackwell, vol. 60(2), pages 467-515, May.
    20. Messod D. Beneish, 1999. "The Detection of Earnings Manipulation," Financial Analysts Journal, Taylor & Francis Journals, vol. 55(5), pages 24-36, September.
    21. Qian, Yilei & Wang, Feng & Zhang, Muyang & Zhong, Ninghua, 2024. "Political uncertainty, bank loans, and corporate behavior: New investigation with machine learning," Pacific-Basin Finance Journal, Elsevier, vol. 87(C).
    22. Zhitao, Wang & Xiang, Ma, 2023. "Financial mismatch on corporate debt default risk: Evidence from China," Pacific-Basin Finance Journal, Elsevier, vol. 80(C).
    23. Bo Huang & Xiao Yao & Yinqing Luo & Jing Li, 2023. "Improving financial distress prediction using textual sentiment of annual reports," Annals of Operations Research, Springer, vol. 330(1), pages 457-484, November.
    24. Guay, Wayne & Samuels, Delphine & Taylor, Daniel, 2016. "Guiding through the Fog: Financial statement complexity and voluntary disclosure," Journal of Accounting and Economics, Elsevier, vol. 62(2), pages 234-269.
    25. Klein, Benjamin & Leffler, Keith B, 1981. "The Role of Market Forces in Assuring Contractual Performance," Journal of Political Economy, University of Chicago Press, vol. 89(4), pages 615-641, August.
    26. Muhammed Lawal Subair & Ramat Titlayo Salman & Ayodeji Fatai Abolarin & Abdulrasheed Taiwo Abdullahi & Akeem Sisofa Othman, 2020. "Board Characteristics And The Likelihood Of Financial Statement Fraud," Copernican Journal of Finance & Accounting, Uniwersytet Mikolaja Kopernika, vol. 9(1), pages 57-76.
    27. Xin‐Ping Song & Zhi‐Hua Hu & Jian‐Guo Du & Zhao‐Han Sheng, 2014. "Application of Machine Learning Methods to Risk Assessment of Financial Statement Fraud: Evidence from China," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 33(8), pages 611-626, December.
    28. Craja, Patricia & Kim, Alisa & Lessmann, Stefan, 2020. "Deep Learning application for fraud detection in financial statements," IRTG 1792 Discussion Papers 2020-007, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    29. Zhou, Xinghua & Reesor, R. Mark, 2015. "Misrepresentation and capital structure: Quantifying the impact on corporate debt value," Journal of Corporate Finance, Elsevier, vol. 34(C), pages 293-310.
    30. Surjeet Dalal & Bijeta Seth & Magdalena Radulescu & Carmen Secara & Claudia Tolea, 2022. "Predicting Fraud in Financial Payment Services through Optimized Hyper-Parameter-Tuned XGBoost Model," Mathematics, MDPI, vol. 10(24), pages 1-17, December.
    31. Yonghai Wang & Qingyun Ye & Jenny Jing Wang & Yunqian Wang, 2023. "Earnings manipulation and similarity of annual report disclosure: Evidence from China," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 63(S1), pages 1137-1156, April.
    32. Lee, In & Shin, Yong Jae, 2020. "Machine learning for enterprises: Applications, algorithm selection, and challenges," Business Horizons, Elsevier, vol. 63(2), pages 157-170.
    33. Li, Jingyu & Guo, Ce & Lv, Sijia & Xie, Qiwei & Zheng, Xiaolong, 2024. "Financial fraud detection for Chinese listed firms: Does managers' abnormal tone matter?," Emerging Markets Review, Elsevier, vol. 62(C).
    34. Choi, Daewoung & Gam, Yong Kyu & Shin, Hojong, 2020. "Corporate fraud under pyramidal ownership structure: Evidence from a regulatory reform," Emerging Markets Review, Elsevier, vol. 45(C).
    35. Wang, Hua & Wang, Wei & Alhaleh, Shadi Emad Areef, 2021. "Mixed ownership and financial investment: Evidence from Chinese state-owned enterprises," Economic Analysis and Policy, Elsevier, vol. 70(C), pages 159-171.
    36. de Souza, João Antônio Salvador & Rissatti, Jean Carlo & Rover, Suliani & Borba, José Alonso, 2019. "The linguistic complexities of narrative accounting disclosure on financial statements: An analysis based on readability characteristics," Research in International Business and Finance, Elsevier, vol. 48(C), pages 59-74.
    37. Tim Loughran & Bill Mcdonald, 2011. "When Is a Liability Not a Liability? Textual Analysis, Dictionaries, and 10‐Ks," Journal of Finance, American Finance Association, vol. 66(1), pages 35-65, February.
    38. David J. Scheaf & Matthew S. Wood, 2022. "Entrepreneurial Fraud: A Multidisciplinary Review and Synthesized Framework," Entrepreneurship Theory and Practice, , vol. 46(3), pages 607-642, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Quentin Dupont & Jonathan M. Karpoff, 2020. "The Trust Triangle: Laws, Reputation, and Culture in Empirical Finance Research," Journal of Business Ethics, Springer, vol. 163(2), pages 217-238, May.
    2. Yunchuan Sun & Xiaoping Zeng & Ying Xu & Hong Yue & Xipu Yu, 2024. "An intelligent detecting model for financial frauds in Chinese A‐share market," Economics and Politics, Wiley Blackwell, vol. 36(2), pages 1110-1136, July.
    3. Dan Amiram & Zahn Bozanic & James D. Cox & Quentin Dupont & Jonathan M. Karpoff & Richard Sloan, 2018. "Financial reporting fraud and other forms of misconduct: a multidisciplinary review of the literature," Review of Accounting Studies, Springer, vol. 23(2), pages 732-783, June.
    4. Li, Guowen & Wang, Shuai & Feng, Yuyao, 2024. "Making differences work: Financial fraud detection based on multi-subject perceptions," Emerging Markets Review, Elsevier, vol. 60(C).
    5. Li, Jingyu & Guo, Ce & Lv, Sijia & Xie, Qiwei & Zheng, Xiaolong, 2024. "Financial fraud detection for Chinese listed firms: Does managers' abnormal tone matter?," Emerging Markets Review, Elsevier, vol. 62(C).
    6. Zaman, Rashid, 2024. "When corporate culture matters: The case of stakeholder violations," The British Accounting Review, Elsevier, vol. 56(1).
    7. Karpoff, Jonathan M., 2021. "The future of financial fraud," Journal of Corporate Finance, Elsevier, vol. 66(C).
    8. Humphery-Jenner, Mark & Liu, Yun & Nanda, Vikram & Silveri, Sabatino & Sun, Minxing, 2024. "Of fogs and bogs: Does litigation risk make financial reports less readable?," Journal of Banking & Finance, Elsevier, vol. 163(C).
    9. Cebi, Selcuk & Karakurt, Necip Fazıl & Kurtulus, Erkan & Tokgoz, Bunyamin, 2024. "Development of a decision support system for client acceptance in independent audit process," International Journal of Accounting Information Systems, Elsevier, vol. 53(C).
    10. Zhu, Hongtao & Rahman, Md Jahidur, 2025. "Reprint of: Ex-ante expected changes in ESG and future stock returns based on machine learning," The British Accounting Review, Elsevier, vol. 57(1).
    11. Shiqi Wang & Zhibo Zhang & Libing Fang & Cam-Tu Nguyen & Wenzhong Li, 2025. "Corporate Fraud Detection in Rich-yet-Noisy Financial Graph," Papers 2502.19305, arXiv.org, revised May 2025.
    12. Belen Blanco & Sandip Dhole & Ferdinand A. Gul, 2023. "Financial statement comparability and accounting fraud," Journal of Business Finance & Accounting, Wiley Blackwell, vol. 50(7-8), pages 1166-1205, July.
    13. Nam Tran & Don O'Sullivan, 2020. "The relationship between corporate social responsibility, financial misstatements and SEC enforcement actions," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 60(S1), pages 1111-1147, April.
    14. Li, Jing & Li, Nan & Xia, Tongshui & Guo, Jinjin, 2023. "Textual analysis and detection of financial fraud: Evidence from Chinese manufacturing firms," Economic Modelling, Elsevier, vol. 126(C).
    15. Rahman, Md Jahidur & Zhu, Hongtao, 2024. "Detecting accounting fraud in family firms: Evidence from machine learning approaches," Advances in accounting, Elsevier, vol. 64(C).
    16. Marie Herly & Nikolaj Niebuhr Lambertsen, 2023. "Restatement costs and reporting bias," Journal of Business Finance & Accounting, Wiley Blackwell, vol. 50(1-2), pages 91-117, January.
    17. Richardson, Grant & Obaydin, Ivan & Liu, Chelsea, 2022. "The effect of accounting fraud on future stock price crash risk," Economic Modelling, Elsevier, vol. 117(C).
    18. Bashir Ahmad & Maria Ciupac-Ulici & Daniela-Georgeta Beju, 2021. "Economic and Non-Economic Variables Affecting Fraud in European Countries," Risks, MDPI, vol. 9(6), pages 1-17, June.
    19. Bidisha Chakrabarty & Pamela C. Moulton & Leonid Pugachev & Xu (Frank) Wang, 2025. "Catch me if you can: In search of accuracy, scope, and ease of fraud prediction," Review of Accounting Studies, Springer, vol. 30(2), pages 1268-1308, June.
    20. Borochin, Paul & Wang, Xiaoqiong & Wei, Siqi, 2024. "Can long-term institutional owners improve market efficiency in parsing complex legal disputes?," International Review of Economics & Finance, Elsevier, vol. 96(PC).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:pacfin:v:90:y:2025:i:c:s0927538x25000022. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/pacfin .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.