IDEAS home Printed from https://ideas.repec.org/a/wsi/jicepx/v16y2025i02ns1793993325500085.html
   My bibliography  Save this article

Leveraging Tree-based Machine Learning for Predicting Earnings Management

Author

Listed:
  • Tam Phan Huy

    (University of Economics and Law, Ho Chi Minh City, Vietnam†Vietnam National University, Ho Chi Minh City, Vietnam)

  • Tuyet Pham Hong

    (University of Economics and Law, Ho Chi Minh City, Vietnam†Vietnam National University, Ho Chi Minh City, Vietnam)

  • An Bui Nguyen Quoc

    (��Department of Information Technology, Nha Trang University, Nha Trang, Khanh Hoa, Vietnam)

Abstract

Earnings management poses a critical challenge in corporate finance, as firms often manipulate financial statements to achieve specific targets, potentially misleading stakeholders. Traditional detection techniques, such as discretionary accrual models, face limitations in identifying the complex, nonlinear patterns of earnings manipulation. This research utilizes tree-based machine learning models — Decision Trees, Random Forests, and Gradient Boosting Machines (GBM) — to forecast earnings management in firms listed on Vietnamese stock exchanges, including the Ho Chi Minh City Stock Exchange (HOSE), Hanoi Stock Exchange (HNX), and Unlisted Public Company Market (UPCoM). The study analyzes data from 1,652 firms, covering the period from 2008 to 2023, resulting in 17,215 observations. Following rigorous data preprocessing to remove errors and outliers, the performance of the models was assessed based on their ability to predict earnings management, as indicated by discretionary accruals. The findings demonstrate that GBM surpasses the other models in key performance metrics, such as accuracy, precision, recall, and F1 score, establishing it as the most effective tool for detecting earnings manipulation. Furthermore, the study identifies Operating Cash Flow (OCF) as the most significant predictor, with Return on Assets (ROA) and firm size also playing vital roles. These results contribute to the expanding body of research on machine learning in financial analysis and provide valuable insights for financial analysts, auditors, corporate governance professionals, and regulators in improving the detection and prevention of earnings management.

Suggested Citation

  • Tam Phan Huy & Tuyet Pham Hong & An Bui Nguyen Quoc, 2025. "Leveraging Tree-based Machine Learning for Predicting Earnings Management," Journal of International Commerce, Economics and Policy (JICEP), World Scientific Publishing Co. Pte. Ltd., vol. 16(02), pages 1-20, June.
  • Handle: RePEc:wsi:jicepx:v:16:y:2025:i:02:n:s1793993325500085
    DOI: 10.1142/S1793993325500085
    as

    Download full text from publisher

    File URL: http://www.worldscientific.com/doi/abs/10.1142/S1793993325500085
    Download Restriction: Access to full text is restricted to subscribers

    File URL: https://libkey.io/10.1142/S1793993325500085?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    JEL classification:

    • G17 - Financial Economics - - General Financial Markets - - - Financial Forecasting and Simulation
    • M41 - Business Administration and Business Economics; Marketing; Accounting; Personnel Economics - - Accounting - - - Accounting
    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:jicepx:v:16:y:2025:i:02:n:s1793993325500085. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscinet.com/jicep/jicep.shtml .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.