IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v10y2022i10p1776-d821954.html
   My bibliography  Save this article

DEA and Machine Learning for Performance Prediction

Author

Listed:
  • Zhishuo Zhang

    (International Business School, Beijing Foreign Studies University, Beijing 100089, China)

  • Yao Xiao

    (International Business School, Beijing Foreign Studies University, Beijing 100089, China)

  • Huayong Niu

    (International Business School, Beijing Foreign Studies University, Beijing 100089, China)

Abstract

Data envelopment analysis (DEA) has been widely applied to evaluate the performance of banks, enterprises, governments, research institutions, hospitals, and other fields as a non-parametric estimation method for evaluating the relative effectiveness of research objects. However, the composition of its effective frontier surface is based on the input-output data of existing decision units, which makes it challenging to apply the method to predict the future performance level of other decision units. In this paper, the Slack Based Measure (SBM) model in DEA method is used to measure the relative efficiency values of decision units, and then, eleven machine learning models are used to train the absolute efficient frontier to be applied to the performance prediction of new decisions units. To further improve the prediction effect of the models, this paper proposes a training set under the DEA classification method, starting from the training-set sample selection and input feature indicators. In this paper, regression prediction of test set performance based on the training set under different classification combinations is performed, and the prediction effects of proportional relative indicators and absolute number indicators as machine-learning input features are explored. The robustness of the effective frontier surface under the integrated model is verified. An integrated models of DEA and machine learning with better prediction effects is proposed, taking China’s regional carbon-dioxide emission (carbon emission) performance prediction as an example. The novelty of this work is mainly as follows: firstly, the integrated model can achieve performance prediction by constructing an effective frontier surface, and the empirical results show that this is a feasible methodological technique. Secondly, two schemes to improve the prediction effectiveness of integrated models are discussed in terms of training set partitioning and feature selection, and the effectiveness of the schemes is demonstrated by using carbon-emission performance prediction as an example. This study has some application value and is a complement to the existing literature.

Suggested Citation

  • Zhishuo Zhang & Yao Xiao & Huayong Niu, 2022. "DEA and Machine Learning for Performance Prediction," Mathematics, MDPI, vol. 10(10), pages 1-23, May.
  • Handle: RePEc:gam:jmathe:v:10:y:2022:i:10:p:1776-:d:821954
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/10/10/1776/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/10/10/1776/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Charnes, A. & Cooper, W. W. & Rhodes, E., 1978. "Measuring the efficiency of decision making units," European Journal of Operational Research, Elsevier, vol. 2(6), pages 429-444, November.
    2. Omur Tosun, 2012. "Using data envelopment analysis-neural network model to evaluate hospital efficiency," International Journal of Productivity and Quality Management, Inderscience Enterprises Ltd, vol. 9(2), pages 245-257.
    3. Staub, Roberta B. & da Silva e Souza, Geraldo & Tabak, Benjamin M., 2010. "Evolution of bank efficiency in Brazil: A DEA approach," European Journal of Operational Research, Elsevier, vol. 202(1), pages 204-213, April.
    4. Yuan Xu & Yong Shin Park & Ju Dong Park & Wonjoo Cho, 2021. "Evaluating the environmental efficiency of the U.S. airline industry using a directional distance function DEA approach," Journal of Management Analytics, Taylor & Francis Journals, vol. 8(1), pages 1-18, January.
    5. Bauer, Paul W., 1990. "Recent developments in the econometric estimation of frontiers," Journal of Econometrics, Elsevier, vol. 46(1-2), pages 39-56.
    6. Samoilenko, Sergey & Osei-Bryson, Kweku-Muata, 2010. "Determining sources of relative inefficiency in heterogeneous samples: Methodology using Cluster Analysis, DEA and Neural Networks," European Journal of Operational Research, Elsevier, vol. 206(2), pages 479-487, October.
    7. Holod, Dmytro & Lewis, Herbert F., 2011. "Resolving the deposit dilemma: A new DEA bank efficiency model," Journal of Banking & Finance, Elsevier, vol. 35(11), pages 2801-2810, November.
    8. Kwon, He-Boong, 2017. "Exploring the predictive potential of artificial neural networks in conjunction with DEA in railroad performance modeling," International Journal of Production Economics, Elsevier, vol. 183(PA), pages 159-170.
    9. Zhang, Caiqing & Chen, Panyu, 2022. "Applying the three-stage SBM-DEA model to evaluate energy efficiency and impact factors in RCEP countries," Energy, Elsevier, vol. 241(C).
    10. Daniel Santin & Francisco Delgado & Aurelia Valino, 2004. "The measurement of technical efficiency: a neural network approach," Applied Economics, Taylor & Francis Journals, vol. 36(6), pages 627-635.
    11. Robert Stefko & Beata Gavurova & Kristina Kocisova, 2018. "Healthcare efficiency assessment using DEA analysis in the Slovak Republic," Health Economics Review, Springer, vol. 8(1), pages 1-12, December.
    12. Mohamed Dia & Shashi K. Shahi & Luckny Zéphyr, 2021. "An Assessment of the Efficiency of Canadian Power Generation Companies with Bootstrap DEA," JRFM, MDPI, vol. 14(10), pages 1-27, October.
    13. A. Charnes & W. W. Cooper, 1962. "Programming with linear fractional functionals," Naval Research Logistics Quarterly, John Wiley & Sons, vol. 9(3‐4), pages 181-186, September.
    14. R. D. Banker & A. Charnes & W. W. Cooper, 1984. "Some Models for Estimating Technical and Scale Inefficiencies in Data Envelopment Analysis," Management Science, INFORMS, vol. 30(9), pages 1078-1092, September.
    15. Fakarudin Kamarudin & Fadzlan Sufian & Annuar Md. Nassir & Nazratul Aina Mohamad Anwar & Hafezali Iqbal Hussain, 2019. "Bank Efficiency in Malaysia a DEA Approach," Journal of Central Banking Theory and Practice, Central bank of Montenegro, vol. 8(1), pages 133-162.
    16. Tone, Kaoru, 2001. "A slacks-based measure of efficiency in data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 130(3), pages 498-509, May.
    17. Wanke, Peter & Abul Kalam Azad, Md & Emrouznejad, Ali & Antunes, Jorge, 2019. "A dynamic network DEA model for accounting and financial indicators: A case of efficiency in MENA banking," International Review of Economics & Finance, Elsevier, vol. 61(C), pages 52-68.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Huayong Niu & Zhishuo Zhang & Manting Luo, 2022. "Evaluation and Prediction of Low-Carbon Economic Efficiency in China, Japan and South Korea: Based on DEA and Machine Learning," IJERPH, MDPI, vol. 19(19), pages 1-28, October.
    2. Reza Sanei & Farhad Hosseinzadeh lotfi & Mohammad Fallah & Farzad Movahedi Sobhani, 2022. "An Estimation of an Acceptable Efficiency Frontier Having an Optimum Resource Management Approach, with a Combination of the DEA-ANN-GA Technique (A Case Study of Branches of an Insurance Company)," Mathematics, MDPI, vol. 10(23), pages 1-21, November.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mansour Zarrin & Jan Schoenfelder & Jens O. Brunner, 2022. "Homogeneity and Best Practice Analyses in Hospital Performance Management: An Analytical Framework," Health Care Management Science, Springer, vol. 25(3), pages 406-425, September.
    2. Yung‐ho Chiu & Tai‐Yu Lin & Tzu‐Han Chang & Yi‐Nuo Lin & Shih‐Yung Chiu, 2021. "Prevaluating efficiency gains from potential mergers and acquisitions in the financial industry with the Resample Past–Present–Future data envelopment analysis approach," Managerial and Decision Economics, John Wiley & Sons, Ltd., vol. 42(2), pages 369-384, March.
    3. Phung, Manh-Trung & Cheng, Cheng-Ping & Guo, Chuanyin & Kao, Chen-Yu, 2020. "Mixed Network DEA with Shared Resources: A Case of Measuring Performance for Banking Industry," Operations Research Perspectives, Elsevier, vol. 7(C).
    4. Kao, Chiang & Liu, Shiang-Tai, 2020. "A slacks-based measure model for calculating cross efficiency in data envelopment analysis," Omega, Elsevier, vol. 95(C).
    5. Huang, Shwu-Huei & Yu, Ming-Miin & Huang, Ya-Ling, 2022. "Evaluation of the efficiency of the local tax administration in Taiwan: Application of a dynamic network data envelopment analysis," Socio-Economic Planning Sciences, Elsevier, vol. 83(C).
    6. Alperovych, Yan & Amess, Kevin & Wright, Mike, 2013. "Private equity firm experience and buyout vendor source: What is their impact on efficiency?," European Journal of Operational Research, Elsevier, vol. 228(3), pages 601-611.
    7. Koronakos, Gregory & Sotiros, Dimitris & Despotis, Dimitris K. & Kritikos, Manolis N., 2022. "Fair efficiency decomposition in network DEA: A compromise programming approach," Socio-Economic Planning Sciences, Elsevier, vol. 79(C).
    8. Shivi Agarwal, 2016. "DEA-neural networks approach to assess the performance of public transport sector of India," OPSEARCH, Springer;Operational Research Society of India, vol. 53(2), pages 248-258, June.
    9. Kao, Chiang, 2022. "A maximum slacks-based measure of efficiency for closed series production systems," Omega, Elsevier, vol. 106(C).
    10. Kao, Chiang, 2022. "Closest targets in the slacks-based measure of efficiency for production units with multi-period data," European Journal of Operational Research, Elsevier, vol. 297(3), pages 1042-1054.
    11. Adriel Martins de Freitas Branco & Alexandre Pereira Salgado Junior & Patrícia Benites Cava & Eduardo Falsarella Junior & Marco Antônio Alves de Souza Junior, 2017. "Efficiency of the Brazilian Banking System in 2014: A DEA-SBM Analysis," Journal of Applied Finance & Banking, SCIENPRESS Ltd, vol. 7(5), pages 1-2.
    12. Lin, Ruiyue & Liu, Qian, 2021. "Multiplier dynamic data envelopment analysis based on directional distance function: An application to mutual funds," European Journal of Operational Research, Elsevier, vol. 293(3), pages 1043-1057.
    13. Chiang Kao & Shiang-Tai Liu, 2022. "Stochastic efficiencies of network production systems with correlated stochastic data: the case of Taiwanese commercial banks," Annals of Operations Research, Springer, vol. 315(2), pages 1151-1174, August.
    14. Coert Erasmus, 2014. "An Empirical Study of Bank Efficiency in South Africa Using the Standard and Alternative Approaches to Data Envelopment Analysis (DEA)," Journal of Economics and Behavioral Studies, AMH International, vol. 6(4), pages 310-317.
    15. Xiang Ji & Jiasen Sun & Qunwei Wang & Qianqian Yuan, 2019. "Revealing Energy Over-Consumption and Pollutant Over-Emission Behind GDP: A New Multi-criteria Sustainable Measure," Computational Economics, Springer;Society for Computational Economics, vol. 54(4), pages 1391-1421, December.
    16. Fenfen Li & Bo Dai & Qifan Wu, 2021. "Dynamic Green Growth Assessment of China’s Industrial System with an Improved SBM Model and Global Malmquist Index," Mathematics, MDPI, vol. 9(20), pages 1-26, October.
    17. Gerami, Javad & Mozaffari, Mohammad Reza & Wanke, Peter F. & Correa, Henrique L., 2022. "Improving information reliability of non-radial value efficiency analysis: An additive slacks based measure approach," European Journal of Operational Research, Elsevier, vol. 298(3), pages 967-978.
    18. Dan Li & Yanfeng Li & Yeming Gong & Jiawei Yang, 2021. "Estimation of bank performance from multiple perspectives: an alternative solution to the deposit dilemma," Journal of Productivity Analysis, Springer, vol. 56(2), pages 151-170, December.
    19. Wen-Min Lu & Qian Long Kweh & Kai-Chu Yang, 2022. "Multiplicative efficiency aggregation to evaluate Taiwanese local auditing institutions performance," Annals of Operations Research, Springer, vol. 315(2), pages 1243-1262, August.
    20. Ya Chen & Yongjun Li & Liang Liang & Huaqing Wu, 2019. "An extension on super slacks-based measure DEA approach," Annals of Operations Research, Springer, vol. 278(1), pages 101-121, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:10:y:2022:i:10:p:1776-:d:821954. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.