IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v10y2022i10p1776-d821954.html
   My bibliography  Save this article

DEA and Machine Learning for Performance Prediction

Author

Listed:
  • Zhishuo Zhang

    (International Business School, Beijing Foreign Studies University, Beijing 100089, China)

  • Yao Xiao

    (International Business School, Beijing Foreign Studies University, Beijing 100089, China)

  • Huayong Niu

    (International Business School, Beijing Foreign Studies University, Beijing 100089, China)

Abstract

Data envelopment analysis (DEA) has been widely applied to evaluate the performance of banks, enterprises, governments, research institutions, hospitals, and other fields as a non-parametric estimation method for evaluating the relative effectiveness of research objects. However, the composition of its effective frontier surface is based on the input-output data of existing decision units, which makes it challenging to apply the method to predict the future performance level of other decision units. In this paper, the Slack Based Measure (SBM) model in DEA method is used to measure the relative efficiency values of decision units, and then, eleven machine learning models are used to train the absolute efficient frontier to be applied to the performance prediction of new decisions units. To further improve the prediction effect of the models, this paper proposes a training set under the DEA classification method, starting from the training-set sample selection and input feature indicators. In this paper, regression prediction of test set performance based on the training set under different classification combinations is performed, and the prediction effects of proportional relative indicators and absolute number indicators as machine-learning input features are explored. The robustness of the effective frontier surface under the integrated model is verified. An integrated models of DEA and machine learning with better prediction effects is proposed, taking China’s regional carbon-dioxide emission (carbon emission) performance prediction as an example. The novelty of this work is mainly as follows: firstly, the integrated model can achieve performance prediction by constructing an effective frontier surface, and the empirical results show that this is a feasible methodological technique. Secondly, two schemes to improve the prediction effectiveness of integrated models are discussed in terms of training set partitioning and feature selection, and the effectiveness of the schemes is demonstrated by using carbon-emission performance prediction as an example. This study has some application value and is a complement to the existing literature.

Suggested Citation

  • Zhishuo Zhang & Yao Xiao & Huayong Niu, 2022. "DEA and Machine Learning for Performance Prediction," Mathematics, MDPI, vol. 10(10), pages 1-23, May.
  • Handle: RePEc:gam:jmathe:v:10:y:2022:i:10:p:1776-:d:821954
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/10/10/1776/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/10/10/1776/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Charnes, A. & Cooper, W. W. & Rhodes, E., 1978. "Measuring the efficiency of decision making units," European Journal of Operational Research, Elsevier, vol. 2(6), pages 429-444, November.
    2. Omur Tosun, 2012. "Using data envelopment analysis-neural network model to evaluate hospital efficiency," International Journal of Productivity and Quality Management, Inderscience Enterprises Ltd, vol. 9(2), pages 245-257.
    3. Staub, Roberta B. & da Silva e Souza, Geraldo & Tabak, Benjamin M., 2010. "Evolution of bank efficiency in Brazil: A DEA approach," European Journal of Operational Research, Elsevier, vol. 202(1), pages 204-213, April.
    4. Yuan Xu & Yong Shin Park & Ju Dong Park & Wonjoo Cho, 2021. "Evaluating the environmental efficiency of the U.S. airline industry using a directional distance function DEA approach," Journal of Management Analytics, Taylor & Francis Journals, vol. 8(1), pages 1-18, January.
    5. Bauer, Paul W., 1990. "Recent developments in the econometric estimation of frontiers," Journal of Econometrics, Elsevier, vol. 46(1-2), pages 39-56.
    6. Samoilenko, Sergey & Osei-Bryson, Kweku-Muata, 2010. "Determining sources of relative inefficiency in heterogeneous samples: Methodology using Cluster Analysis, DEA and Neural Networks," European Journal of Operational Research, Elsevier, vol. 206(2), pages 479-487, October.
    7. Holod, Dmytro & Lewis, Herbert F., 2011. "Resolving the deposit dilemma: A new DEA bank efficiency model," Journal of Banking & Finance, Elsevier, vol. 35(11), pages 2801-2810, November.
    8. Kwon, He-Boong, 2017. "Exploring the predictive potential of artificial neural networks in conjunction with DEA in railroad performance modeling," International Journal of Production Economics, Elsevier, vol. 183(PA), pages 159-170.
    9. Zhang, Caiqing & Chen, Panyu, 2022. "Applying the three-stage SBM-DEA model to evaluate energy efficiency and impact factors in RCEP countries," Energy, Elsevier, vol. 241(C).
    10. Daniel Santin & Francisco Delgado & Aurelia Valino, 2004. "The measurement of technical efficiency: a neural network approach," Applied Economics, Taylor & Francis Journals, vol. 36(6), pages 627-635.
    11. Robert Stefko & Beata Gavurova & Kristina Kocisova, 2018. "Healthcare efficiency assessment using DEA analysis in the Slovak Republic," Health Economics Review, Springer, vol. 8(1), pages 1-12, December.
    12. Mohamed Dia & Shashi K. Shahi & Luckny Zéphyr, 2021. "An Assessment of the Efficiency of Canadian Power Generation Companies with Bootstrap DEA," JRFM, MDPI, vol. 14(10), pages 1-27, October.
    13. A. Charnes & W. W. Cooper, 1962. "Programming with linear fractional functionals," Naval Research Logistics Quarterly, John Wiley & Sons, vol. 9(3‐4), pages 181-186, September.
    14. R. D. Banker & A. Charnes & W. W. Cooper, 1984. "Some Models for Estimating Technical and Scale Inefficiencies in Data Envelopment Analysis," Management Science, INFORMS, vol. 30(9), pages 1078-1092, September.
    15. Fakarudin Kamarudin & Fadzlan Sufian & Annuar Md. Nassir & Nazratul Aina Mohamad Anwar & Hafezali Iqbal Hussain, 2019. "Bank Efficiency in Malaysia a DEA Approach," Journal of Central Banking Theory and Practice, Central bank of Montenegro, vol. 8(1), pages 133-162.
    16. Tone, Kaoru, 2001. "A slacks-based measure of efficiency in data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 130(3), pages 498-509, May.
    17. Wanke, Peter & Abul Kalam Azad, Md & Emrouznejad, Ali & Antunes, Jorge, 2019. "A dynamic network DEA model for accounting and financial indicators: A case of efficiency in MENA banking," International Review of Economics & Finance, Elsevier, vol. 61(C), pages 52-68.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Huayong Niu & Zhishuo Zhang & Manting Luo, 2022. "Evaluation and Prediction of Low-Carbon Economic Efficiency in China, Japan and South Korea: Based on DEA and Machine Learning," IJERPH, MDPI, vol. 19(19), pages 1-28, October.
    2. Reza Sanei & Farhad Hosseinzadeh lotfi & Mohammad Fallah & Farzad Movahedi Sobhani, 2022. "An Estimation of an Acceptable Efficiency Frontier Having an Optimum Resource Management Approach, with a Combination of the DEA-ANN-GA Technique (A Case Study of Branches of an Insurance Company)," Mathematics, MDPI, vol. 10(23), pages 1-21, November.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mansour Zarrin & Jan Schoenfelder & Jens O. Brunner, 2022. "Homogeneity and Best Practice Analyses in Hospital Performance Management: An Analytical Framework," Health Care Management Science, Springer, vol. 25(3), pages 406-425, September.
    2. Phung, Manh-Trung & Cheng, Cheng-Ping & Guo, Chuanyin & Kao, Chen-Yu, 2020. "Mixed Network DEA with Shared Resources: A Case of Measuring Performance for Banking Industry," Operations Research Perspectives, Elsevier, vol. 7(C).
    3. Kao, Chiang & Liu, Shiang-Tai, 2020. "A slacks-based measure model for calculating cross efficiency in data envelopment analysis," Omega, Elsevier, vol. 95(C).
    4. Koronakos, Gregory & Sotiros, Dimitris & Despotis, Dimitris K. & Kritikos, Manolis N., 2022. "Fair efficiency decomposition in network DEA: A compromise programming approach," Socio-Economic Planning Sciences, Elsevier, vol. 79(C).
    5. Kao, Chiang, 2022. "A maximum slacks-based measure of efficiency for closed series production systems," Omega, Elsevier, vol. 106(C).
    6. Kao, Chiang, 2022. "Closest targets in the slacks-based measure of efficiency for production units with multi-period data," European Journal of Operational Research, Elsevier, vol. 297(3), pages 1042-1054.
    7. Lin, Ruiyue & Liu, Qian, 2021. "Multiplier dynamic data envelopment analysis based on directional distance function: An application to mutual funds," European Journal of Operational Research, Elsevier, vol. 293(3), pages 1043-1057.
    8. Chiang Kao & Shiang-Tai Liu, 2022. "Stochastic efficiencies of network production systems with correlated stochastic data: the case of Taiwanese commercial banks," Annals of Operations Research, Springer, vol. 315(2), pages 1151-1174, August.
    9. Coert Erasmus, 2014. "An Empirical Study of Bank Efficiency in South Africa Using the Standard and Alternative Approaches to Data Envelopment Analysis (DEA)," Journal of Economics and Behavioral Studies, AMH International, vol. 6(4), pages 310-317.
    10. Gerami, Javad & Mozaffari, Mohammad Reza & Wanke, Peter F. & Correa, Henrique L., 2022. "Improving information reliability of non-radial value efficiency analysis: An additive slacks based measure approach," European Journal of Operational Research, Elsevier, vol. 298(3), pages 967-978.
    11. Dan Li & Yanfeng Li & Yeming Gong & Jiawei Yang, 2021. "Estimation of bank performance from multiple perspectives: an alternative solution to the deposit dilemma," Journal of Productivity Analysis, Springer, vol. 56(2), pages 151-170, December.
    12. Wen-Min Lu & Qian Long Kweh & Kai-Chu Yang, 2022. "Multiplicative efficiency aggregation to evaluate Taiwanese local auditing institutions performance," Annals of Operations Research, Springer, vol. 315(2), pages 1243-1262, August.
    13. Zarrin, Mansour & Brunner, Jens O., 2023. "Analyzing the accuracy of variable returns to scale data envelopment analysis models," European Journal of Operational Research, Elsevier, vol. 308(3), pages 1286-1301.
    14. Jorge Antunes & Abdollah Hadi-Vencheh & Ali Jamshidi & Yong Tan & Peter Wanke, 2022. "Bank efficiency estimation in China: DEA-RENNA approach," Annals of Operations Research, Springer, vol. 315(2), pages 1373-1398, August.
    15. Yong Tan & Peter Wanke & Jorge Antunes & Ali Emrouznejad, 2021. "Unveiling endogeneity between competition and efficiency in Chinese banks: a two-stage network DEA and regression analysis," Annals of Operations Research, Springer, vol. 306(1), pages 131-171, November.
    16. Luis R. Murillo‐Zamorano, 2004. "Economic Efficiency and Frontier Techniques," Journal of Economic Surveys, Wiley Blackwell, vol. 18(1), pages 33-77, February.
    17. Paolo Liberati & Raffaele Lagravinese & Giuliano Resce, 2017. "How Does Economic Social And Cultural Status Affect The Efficiency Of Educational Attainments? A Comparative Analysis On Pisa Results," Departmental Working Papers of Economics - University 'Roma Tre' 0217, Department of Economics - University Roma Tre.
    18. Mushtaq Taleb & Ruzelan Khalid & Ali Emrouznejad & Razamin Ramli, 2023. "Environmental efficiency under weak disposability: an improved super efficiency data envelopment analysis model with application for assessment of port operations considering NetZero," Environment, Development and Sustainability: A Multidisciplinary Approach to the Theory and Practice of Sustainable Development, Springer, vol. 25(7), pages 6627-6656, July.
    19. Kao, Chiang & Liu, Shiang-Tai, 2022. "Group decision making in data envelopment analysis: A robot selection application," European Journal of Operational Research, Elsevier, vol. 297(2), pages 592-599.
    20. Barros, C.P. & Emrouznejad, Ali, 2016. "Assessing productive efficiency of banks using integrated Fuzzy-DEA and bootstrapping: A case of Mozambican banksAuthor-Name: Wanke, Peter," European Journal of Operational Research, Elsevier, vol. 249(1), pages 378-389.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:10:y:2022:i:10:p:1776-:d:821954. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.