Author
Listed:
- Risk Staff
- Jiaming Liu
- Bo Yuan
Abstract
In recent years the role of analyzing the management discussion and analysis (MD&A) text of listed companies in financial distress prediction models has gradually gained attention. This paper, by integrating text analysis and machine learning techniques, reveals the financial information hidden in MD&A text and accurately captures the emotional tendency of the text through a sentiment analysis lexicon, providing a more comprehensive and detailed method for predicting a company’s financial condition. This study explores the effect of integrating financial features, semantic features and sentiment features on the ability to predict the financial distress of listed companies. To do this we propose an innovative three-phase fusion model. First, semantic features are extracted from the MD&A sections of the annual reports of listed companies using deep learning techniques, and sentiment features are derived from the MD&A text content based on a sentiment dictionary. Then, initial prediction models are constructed separately based on financial, semantic and sentiment features. Finally, by introducing a stacking ensemble strategy, a heterogeneous stacking model is constructed by integrating these models to improve prediction accuracy. The research results indicate that financial features play a critical role in prediction models, having a decisive impact on prediction accuracy. The introduction of semantic and sentiment features significantly enhances the model’s predictive performance. Further, by comparing the application of different algorithms (naive Bayes, random forest, extreme gradient boosting, logistic regression and ridge regression) in the model, we find that the adoption of a heterogeneous stacking model not only enhances the overall prediction accuracy but also improves the model’s generalizability.
Suggested Citation
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:rsk:journ4:7961700. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Thomas Paine (email available below). General contact details of provider: https://www.risk.net/journal-of-risk .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.