IDEAS home Printed from https://ideas.repec.org/a/taf/applec/v53y2021i51p5948-5963.html

Predicting French SME failures: new evidence from machine learning techniques

Author

Listed:
  • Christophe Schalck
  • Meryem Yankol-Schalck

Abstract

The aim of this study is to provide new insights into French small and medium-sized enterprises (SME) failure prediction using a unique database of French SMEs over the 2012–2018 period including both financial and nonfinancial variables. We also include text variables related to the type of activity. We compare the predictive performance of three estimation methods: a dynamic Probit model, logistic Lasso regression, and XGBoost algorithm. The results show that the XGBoost algorithm has the highest performance in predicting business failure from a broad dataset. We use SHAP values to interpret the results and identify the main factors of failure. Our analysis shows that both financial and nonfinancial variables are failure factors. Our results confirm the role of financial variables in predicting business failure, while self-employment is the factor that most strongly increases the probability of failure. The size of the SME is also a business failure factor. Our results show that a number of nonfinancial variables, such as localization and economic conditions, are drivers of SME failure. The results also show that certain activities are associated with a prediction of lower failure probability while some activities are associated with a prediction of higher failure.

Suggested Citation

  • Christophe Schalck & Meryem Yankol-Schalck, 2021. "Predicting French SME failures: new evidence from machine learning techniques," Applied Economics, Taylor & Francis Journals, vol. 53(51), pages 5948-5963, November.
  • Handle: RePEc:taf:applec:v:53:y:2021:i:51:p:5948-5963
    DOI: 10.1080/00036846.2021.1934389
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/00036846.2021.1934389
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/00036846.2021.1934389?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to look for a different version below or

    for a different version of it.

    Other versions of this item:

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Dejian Yu & Bo Xiang & Zhuoya Pan, 2024. "Combining text analytics and network path extraction to trace CSR in the social sciences: Intellectual structures and diffusion trajectories," Corporate Social Responsibility and Environmental Management, John Wiley & Sons, vol. 31(5), pages 4532-4554, September.
    2. Hoang Hiep Nguyen & Jean-Laurent Viviani & Sami Ben Jabeur, 2023. "Bankruptcy prediction using machine learning and Shapley additive explanations," Post-Print hal-04223161, HAL.
    3. Alonso-Robisco, Andrés & Carbó, José Manuel, 2022. "Can machine learning models save capital for banks? Evidence from a Spanish credit portfolio," International Review of Financial Analysis, Elsevier, vol. 84(C).
    4. Hoang Hiep Nguyen & Jean-Laurent Viviani & Sami Ben Jabeur, 2025. "Bankruptcy prediction using machine learning and Shapley additive explanations," Review of Quantitative Finance and Accounting, Springer, vol. 65(1), pages 107-148, July.
    5. Dina Ait Lahcen, 2023. "Synthetic Reading Of The Different Approaches And Models For Assessing The Risk Of Business Failure [Lecture Synthétique Des Diverses Approches Et Modèles D'Évaluation Du Risque De La Défaillance Des Entreprises]," Post-Print hal-04009420, HAL.
    6. Enrico Schötz, 2025. "Resilience vs. survival: same song, new melody?," Future Business Journal, Springer, vol. 11(1), pages 1-29, December.

    More about this item

    JEL classification:

    • G33 - Financial Economics - - Corporate Finance and Governance - - - Bankruptcy; Liquidation
    • C41 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Duration Analysis; Optimal Timing Strategies
    • C46 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Specific Distributions

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:applec:v:53:y:2021:i:51:p:5948-5963. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/RAEC20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.