IDEAS home Printed from https://ideas.repec.org/a/aiy/jnjaer/v24y2025i2p584-621.html
   My bibliography  Save this article

Clustering of Russian Manufacturing Companies by Indicators of Their Financial Condition Using Machine Learning Technologies

Author

Listed:
  • Lev A. Bulanov
  • Alexei V. Kalina
  • Vadim V. Krivorotov

Abstract

Clustering of research objects and combining them into similar groups based on a set of characteristics is an important stage in solving many tasks of socio-economic development, especially tasks related to assessing the state of the socio-economic system, as well as modeling and forecasting indicators of its future development. The purpose of this study is a relative assessment of the financial condition of large Russian manufacturing companies based on data from accounting reporting forms using clusterization methods classified as machine learning without a teacher. The results of such an assessment are subsequently supposed to be used to build a model for assessing the financial condition of companies based on one of the machine learning algorithms with a teacher. The paper offers key indicators of the financial condition of companies, on the basis of which it is proposed to perform their clusterization. They were identified as a result of the analysis of modern methods and approaches to research and assessment of competitiveness and competitive position of companies. When conducting clusterization based on the proposed set of indicators, financial reporting data from 2,249 Russian manufacturing companies based on the results of 2023 were used. Companies with a turnover of more than 2 billion rubles and a staff of more than 251 people were considered as large companies.. K-Means++, hierarchical clustering, and DBSCAN were used as clustering algorithms. In order to obtain the best result, special data preprocessing and selection of the necessary hyperparameters for clustering algorithms were carried out. The quality of the final clustering was assessed using the Davies-Bouldin (DBI) and the Calinski–Harabasz (CHI) scores. The results showed that the production companies under consideration can be combined into a relatively small number of clusters (usually no more than 3) in terms of financial condition, which opens up wide opportunities for building models of the financial condition of companies. Based on the results of using 3 clustering methods, K-Means++ turned out to be the best algorithm by a small margin, the formed centroids of which can be called the average assessment of companies with poor, normal and good financial condition. The quality of the final clustering can be assessed as good.

Suggested Citation

  • Lev A. Bulanov & Alexei V. Kalina & Vadim V. Krivorotov, 2025. "Clustering of Russian Manufacturing Companies by Indicators of Their Financial Condition Using Machine Learning Technologies," Journal of Applied Economic Research, Graduate School of Economics and Management, Ural Federal University, vol. 24(2), pages 584-621.
  • Handle: RePEc:aiy:jnjaer:v:24:y:2025:i:2:p:584-621
    DOI: https://doi.org/10.15826/vestnik.2025.24.2.020
    as

    Download full text from publisher

    File URL: https://journalaer.ru//fileadmin/user_upload/site_15934/2025/08_Bulanov_Kalina_Krivorotov.pdf
    Download Restriction: no

    File URL: https://libkey.io/https://doi.org/10.15826/vestnik.2025.24.2.020?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Keywords

    financial analysis; machine learning; financial condition indicators; large company; clusterization of companies; K-Means++; hierarchical clusterization; DBSCAN;
    All these keywords.

    JEL classification:

    • D22 - Microeconomics - - Production and Organizations - - - Firm Behavior: Empirical Analysis
    • G30 - Financial Economics - - Corporate Finance and Governance - - - General
    • C45 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Neural Networks and Related Topics

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:aiy:jnjaer:v:24:y:2025:i:2:p:584-621. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Natalia Starodubets (email available below). General contact details of provider: https://edirc.repec.org/data/seurfru.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.