Author
Listed:
- Cai Yuanqing
- Zhenming Gao
- Zhang Jian
- Roohallah Alizadehsani
- Paweł Pławiak
Abstract
The financial sector has experienced swift growth over recent years, leading to the escalating prominence of credit risk among publicly traded companies. Consequently, forecasting credit risk for these firms has emerged as a critical task for banks, regulatory bodies, and investors. Traditional models include the z-score, the logit (logistic regression model), the kernel-based virtual machine (KVM), and neural network approaches. Nevertheless, the outcomes from these methods have often fallen short of expectations. Three major challenges in previous works are feature selection, imbalanced classification, and hyperparameter optimization. This paper presents a method for credit risk prediction for listed companies that uses an off-policy proximal policy optimization (PPO) algorithm for feature selection and imbalanced classification. The off-policy PPO, a reinforcement learning (RL) approach, enhances sample efficiency by more effectively utilizing past experiences during policy updates. This approach improves feature selection and the management of imbalanced classification by optimizing data use, thereby enhancing model training outcomes. Moreover, we use the Bayesian optimization hyperband (BOHB) approach to refine the hyperparameters of the method. BOHB merges Bayesian optimization and Hyperband, significantly speeding up the optimization process. We assess our model using the China Stock Market and Accounting Research (CSMAR), MorningStar, KMV default, Give Me Some Credit (GMSC), and the University of California, Irvine Credit Card Default (UCICCD) datasets. Our experimental findings demonstrate the excellence of the model over existing state-of-the-art models, achieving F-measures of 90.763%, 86.358%, 87.047%, 90.576%, and 89.485% on these datasets. These findings validate the efficiency of the method in economic settings, signifying a major progression in systems for predicting credit risk and enhancing investigative approaches.
Suggested Citation
Cai Yuanqing & Zhenming Gao & Zhang Jian & Roohallah Alizadehsani & Paweł Pławiak, 2025.
"Credit risk prediction model for listed companies based on improved reinforcement learning and Bayesian optimization hyperband,"
PLOS ONE, Public Library of Science, vol. 20(10), pages 1-38, October.
Handle:
RePEc:plo:pone00:0332150
DOI: 10.1371/journal.pone.0332150
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0332150. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.