IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v304y2023i2p729-744.html
   My bibliography  Save this article

Random Forests and the measurement of super-efficiency in the context of Free Disposal Hull

Author

Listed:
  • Esteve, Miriam
  • Aparicio, Juan
  • Rodriguez-Sala, Jesus J.
  • Zhu, Joe

Abstract

In the technical efficiency evaluation area, it may happen that many observations obtain a similar relative technical efficiency status, making it difficult to discriminate between them. The determination of super-efficiency has been a way of solving this problem by providing a method to differentiate between the performance of observations. Despite the existence of some approaches dealing with the notion of super-efficiency in the literature, there have been few attempts to address this problem from the standpoint of machine learning techniques. In this paper, we fill this gap by adapting Random Forest to determine super-efficiency in the context of the Free Disposal Hull (FDH) technique. The new super-efficiency approach is robust to resampling on inputs and data. Additionally, we show how the new approach could be a possible solution for dealing with the curse of dimensionality problem; typically associated with FDH. Furthermore, exploiting the adaptation of Random Forest, a new method for assessing the importance of input variables is introduced. Finally, the advantages of the proposed approach are illustrated through a real example.

Suggested Citation

  • Esteve, Miriam & Aparicio, Juan & Rodriguez-Sala, Jesus J. & Zhu, Joe, 2023. "Random Forests and the measurement of super-efficiency in the context of Free Disposal Hull," European Journal of Operational Research, Elsevier, vol. 304(2), pages 729-744.
  • Handle: RePEc:eee:ejores:v:304:y:2023:i:2:p:729-744
    DOI: 10.1016/j.ejor.2022.04.024
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221722003381
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2022.04.024?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Leopold Simar & Paul Wilson, 2000. "A general methodology for bootstrapping in non-parametric frontier models," Journal of Applied Statistics, Taylor & Francis Journals, vol. 27(6), pages 779-802.
    2. Agrell, Per J. & Bogetoft, Peter, 2017. "Regulatory Benchmarking: Models, Analyses and Applications," Data Envelopment Analysis Journal, now publishers, vol. 3(1-2), pages 49-91, November.
    3. S C Ray, 2008. "The directional distance function and measurement of super-efficiency: an application to airlines data," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 59(6), pages 788-797, June.
    4. Léopold Simar & Paul Wilson, 2000. "Statistical Inference in Nonparametric Frontier Models: The State of the Art," Journal of Productivity Analysis, Springer, vol. 13(1), pages 49-78, January.
    5. Kerstens, Kristiaan & O’Donnell, Christopher & Van de Woestyne, Ignace, 2019. "Metatechnology frontier and convexity: A restatement," European Journal of Operational Research, Elsevier, vol. 275(2), pages 780-792.
    6. Golany, B & Roll, Y, 1989. "An application procedure for DEA," Omega, Elsevier, vol. 17(3), pages 237-250.
    7. Kerstens, Kristiaan & Sadeghi, Jafar & Toloo, Mehdi & Van de Woestyne, Ignace, 2022. "Procedures for ranking technical and cost efficient units: With a focus on nonconvexity," European Journal of Operational Research, Elsevier, vol. 300(1), pages 269-281.
    8. Per Andersen & Niels Christian Petersen, 1993. "A Procedure for Ranking Efficient Units in Data Envelopment Analysis," Management Science, INFORMS, vol. 39(10), pages 1261-1264, October.
    9. Lee, Hsuan-Shih & Chu, Ching-Wu & Zhu, Joe, 2011. "Super-efficiency DEA in the presence of infeasibility," European Journal of Operational Research, Elsevier, vol. 212(1), pages 141-147, July.
    10. Charles, Vincent & Aparicio, Juan & Zhu, Joe, 2019. "The curse of dimensionality of decision-making units: A simple approach to increase the discriminatory power of data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 279(3), pages 929-940.
    11. Lee, Chia-Yen & Cai, Jia-Ying, 2020. "LASSO variable selection in data envelopment analysis with small datasets," Omega, Elsevier, vol. 91(C).
    12. Adler, Nicole & Berechman, Joseph, 2001. "Measuring airport quality from the airlines' viewpoint: an application of data envelopment analysis," Transport Policy, Elsevier, vol. 8(3), pages 171-181, July.
    13. Lidia Angulo-Meza & Marcos Lins, 2002. "Review of Methods for Increasing Discrimination in Data Envelopment Analysis," Annals of Operations Research, Springer, vol. 116(1), pages 225-242, October.
    14. Daniel J. Henderson & Christopher F. Parmeter, 2009. "Imposing economic constraints in nonparametric regression: survey, implementation, and extension," Advances in Econometrics, in: Nonparametric Econometric Methods, pages 433-469, Emerald Group Publishing Limited.
    15. Aparicio, Juan & Zofío, José L., 2021. "Economic cross-efficiency," Omega, Elsevier, vol. 100(C).
      • Aparicio, J. & Zofío, J.L., 2019. "Economic Cross-Efficiency," ERIM Report Series Research in Management ERS-2019-001-LIS, Erasmus Research Institute of Management (ERIM), ERIM is the joint research institute of the Rotterdam School of Management, Erasmus University and the Erasmus School of Economics (ESE) at Erasmus University Rotterdam.
    16. Valero-Carreras, Daniel & Aparicio, Juan & Guerrero, Nadia M., 2021. "Support vector frontiers: A new approach for estimating production functions through support vector machines," Omega, Elsevier, vol. 104(C).
    17. Charnes, A. & Cooper, W. W. & Rhodes, E., 1978. "Measuring the efficiency of decision making units," European Journal of Operational Research, Elsevier, vol. 2(6), pages 429-444, November.
    18. John Ruggiero, 2005. "Impact Assessment Of Input Omission On Dea," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 4(03), pages 359-368.
    19. Cláudia Araújo & Carlos Barros & Peter Wanke, 2014. "Efficiency determinants and capacity issues in Brazilian for-profit hospitals," Health Care Management Science, Springer, vol. 17(2), pages 126-138, June.
    20. Jesús T. Pastor & JosÉ L. Ruiz & Inmaculada Sirvent, 2002. "A Statistical Test for Nested Radial Dea Models," Operations Research, INFORMS, vol. 50(4), pages 728-735, August.
    21. Misiunas, Nicholas & Oztekin, Asil & Chen, Yao & Chandra, Kavitha, 2016. "DEANN: A healthcare analytic methodology of data envelopment analysis and artificial neural networks for the prediction of organ recipient functional status," Omega, Elsevier, vol. 58(C), pages 46-54.
    22. Olesen, O.B. & Ruggiero, J., 2022. "The hinging hyperplanes: An alternative nonparametric representation of a production function," European Journal of Operational Research, Elsevier, vol. 296(1), pages 254-266.
    23. Nataraja, Niranjan R. & Johnson, Andrew L., 2011. "Guidelines for using variable selection techniques in data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 215(3), pages 662-669, December.
    24. R. G. Chambers & Y. Chung & R. Färe, 1998. "Profit, Directional Distance Functions, and Nerlovian Efficiency," Journal of Optimization Theory and Applications, Springer, vol. 98(2), pages 351-364, August.
    25. Joe Zhu, 2014. "Data Envelopment Analysis," International Series in Operations Research & Management Science, in: Quantitative Models for Performance Evaluation and Benchmarking, edition 3, chapter 1, pages 1-9, Springer.
    26. Banker, Rajiv D. & Chang, Hsihui, 2006. "The super-efficiency procedure for outlier identification, not for ranking efficient units," European Journal of Operational Research, Elsevier, vol. 175(2), pages 1311-1320, December.
    27. Léopold Simar & Paul W. Wilson, 1998. "Sensitivity Analysis of Efficiency Scores: How to Bootstrap in Nonparametric Frontier Models," Management Science, INFORMS, vol. 44(1), pages 49-61, January.
    28. Lozano, Sebastián & Khezri, Somayeh, 2021. "Network DEA smallest improvement approach," Omega, Elsevier, vol. 98(C).
    29. Carlos Pestana Barros & Silvestre Dumbo & Peter Wanke, 2014. "Efficiency Determinants and Capacity Issues in Angolan Insurance Companies," South African Journal of Economics, Economic Society of South Africa, vol. 82(3), pages 455-467, September.
    30. Dominique Deprins & Léopold Simar & Henry Tulkens, 2006. "Measuring Labor-Efficiency in Post Offices," Springer Books, in: Parkash Chander & Jacques Drèze & C. Knox Lovell & Jack Mintz (ed.), Public goods, environmental externalities and fiscal competition, chapter 0, pages 285-309, Springer.
    31. Homburg, Carsten, 2001. "Using data envelopment analysis to benchmark activities," International Journal of Production Economics, Elsevier, vol. 73(1), pages 51-58, August.
    32. N Adler & B Golany, 2002. "Including principal component weights to improve discrimination in data envelopment analysis," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 53(9), pages 985-991, September.
    33. Balk, Bert M. & (René) De Koster, M.B.M. & Kaps, Christian & Zofío, José L., 2021. "An evaluation of cross-efficiency methods: With an application to warehouse performance," Applied Mathematics and Computation, Elsevier, vol. 406(C).
    34. Juo, Jia-Ching & Fu, Tsu-Tan & Yu, Ming-Miin & Lin, Yu-Hui, 2015. "Profit-oriented productivity change," Omega, Elsevier, vol. 57(PB), pages 176-187.
    35. Raymond L. Raab & Richard W. Lichty, 2002. "Identifying Subareas That Comprise A Greater Metropolitan Area: The Criterion of County Relative Efficiency," Journal of Regional Science, Wiley Blackwell, vol. 42(3), pages 579-594, August.
    36. Kuosmanen, Timo & Johnson, Andrew, 2017. "Modeling joint production of multiple outputs in StoNED: Directional distance function approach," European Journal of Operational Research, Elsevier, vol. 262(2), pages 792-801.
    37. Chen, Yao, 2005. "Measuring super-efficiency in DEA in the presence of infeasibility," European Journal of Operational Research, Elsevier, vol. 161(2), pages 545-551, March.
    38. Timo Kuosmanen & Andrew L. Johnson, 2010. "Data Envelopment Analysis as Nonparametric Least-Squares Regression," Operations Research, INFORMS, vol. 58(1), pages 149-160, February.
    39. R. D. Banker & A. Charnes & W. W. Cooper, 1984. "Some Models for Estimating Technical and Scale Inefficiencies in Data Envelopment Analysis," Management Science, INFORMS, vol. 30(9), pages 1078-1092, September.
    40. Sarkis, Joseph, 2000. "A comparative analysis of DEA as a discrete alternative multiple criteria decision tool," European Journal of Operational Research, Elsevier, vol. 123(3), pages 543-557, June.
    41. Dyson, R. G. & Allen, R. & Camanho, A. S. & Podinovski, V. V. & Sarrico, C. S. & Shale, E. A., 2001. "Pitfalls and protocols in DEA," European Journal of Operational Research, Elsevier, vol. 132(2), pages 245-259, July.
    42. Vincent Charles & Juan Aparicio & Joe Zhu (ed.), 2020. "Data Science and Productivity Analytics," International Series in Operations Research and Management Science, Springer, number 978-3-030-43384-0, December.
    43. Mojirsheibani, M., 1997. "A consistent combined classification rule," Statistics & Probability Letters, Elsevier, vol. 36(1), pages 43-47, November.
    44. Christopher Parmeter & Kai Sun & Daniel Henderson & Subal Kumbhakar, 2014. "Estimation and inference under economic restrictions," Journal of Productivity Analysis, Springer, vol. 41(1), pages 111-129, February.
    45. William W. Cooper & Lawrence M. Seiford & Kaoru Tone, 2007. "Data Envelopment Analysis," Springer Books, Springer, edition 0, number 978-0-387-45283-8, November.
    46. Rajiv D. Banker, 1993. "Maximum Likelihood, Consistency and Data Envelopment Analysis: A Statistical Foundation," Management Science, INFORMS, vol. 39(10), pages 1265-1273, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Qianying JIN & Kristiaan KERSTENS & Ignace VAN DE WOESTYNE, 2023. "Convex and Nonconvex Nonparametric Frontier-based Classification Methods for Anomaly Detection," Working Papers 2023-EQM-01, IESEG School of Management.
    2. España, Victor J. & Aparicio, Juan & Barber, Xavier & Esteve, Miriam, 2024. "Estimating production functions through additive models based on regression splines," European Journal of Operational Research, Elsevier, vol. 312(2), pages 684-699.
    3. Papaioannou, Grammatoula & Podinovski, Victor V., 2023. "Production technologies with ratio inputs and outputs," European Journal of Operational Research, Elsevier, vol. 310(3), pages 1164-1178.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Valero-Carreras, Daniel & Aparicio, Juan & Guerrero, Nadia M., 2021. "Support vector frontiers: A new approach for estimating production functions through support vector machines," Omega, Elsevier, vol. 104(C).
    2. Raul Moragues & Juan Aparicio & Miriam Esteve, 2023. "Ranking the Importance of Variables in a Nonparametric Frontier Analysis Using Unsupervised Machine Learning Techniques," Mathematics, MDPI, vol. 11(11), pages 1-24, June.
    3. Nadia M. Guerrero & Juan Aparicio & Daniel Valero-Carreras, 2022. "Combining Data Envelopment Analysis and Machine Learning," Mathematics, MDPI, vol. 10(6), pages 1-22, March.
    4. Zervopoulos, Panagiotis & Emrouznejad, Ali & Sklavos, Sokratis, 2019. "A Bayesian approach for correcting bias of data envelopment analysis estimators," MPRA Paper 91886, University Library of Munich, Germany.
    5. Sebastian Kohl & Jan Schoenfelder & Andreas Fügener & Jens O. Brunner, 2019. "The use of Data Envelopment Analysis (DEA) in healthcare with a focus on hospitals," Health Care Management Science, Springer, vol. 22(2), pages 245-286, June.
    6. Valentin Zelenyuk, 2019. "Data Envelopment Analysis and Business Analytics: The Big Data Challenges and Some Solutions," CEPA Working Papers Series WP072019, School of Economics, University of Queensland, Australia.
    7. Charles, Vincent & Aparicio, Juan & Zhu, Joe, 2019. "The curse of dimensionality of decision-making units: A simple approach to increase the discriminatory power of data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 279(3), pages 929-940.
    8. Vincent Charles & Ioannis E. Tsolas & Tatiana Gherman, 2018. "Satisficing data envelopment analysis: a Bayesian approach for peer mining in the banking sector," Annals of Operations Research, Springer, vol. 269(1), pages 81-102, October.
    9. Zelenyuk, Valentin, 2020. "Aggregation of inputs and outputs prior to Data Envelopment Analysis under big data," European Journal of Operational Research, Elsevier, vol. 282(1), pages 172-187.
    10. Imad Bou-Hamad & Abdel Latef Anouze & Ibrahim H. Osman, 2022. "A cognitive analytics management framework to select input and output variables for data envelopment analysis modeling of performance efficiency of banks using random forest and entropy of information," Annals of Operations Research, Springer, vol. 308(1), pages 63-92, January.
    11. An, Qingxian & Tao, Xiangyang & Chen, Xiaohong, 2023. "Nested frontier-based best practice regulation under asymmetric information in a principal–agent framework," European Journal of Operational Research, Elsevier, vol. 306(1), pages 269-285.
    12. España, Victor J. & Aparicio, Juan & Barber, Xavier & Esteve, Miriam, 2024. "Estimating production functions through additive models based on regression splines," European Journal of Operational Research, Elsevier, vol. 312(2), pages 684-699.
    13. Toloo, Mehdi & Tone, Kaoru & Izadikhah, Mohammad, 2023. "Selecting slacks-based data envelopment analysis models," European Journal of Operational Research, Elsevier, vol. 308(3), pages 1302-1318.
    14. Lee, Chia-Yen & Cai, Jia-Ying, 2020. "LASSO variable selection in data envelopment analysis with small datasets," Omega, Elsevier, vol. 91(C).
    15. Ebrahimi, Bohlool & Dhamotharan, Lalitha & Ghasemi, Mohammad Reza & Charles, Vincent, 2022. "A cross-inefficiency approach based on the deviation variables framework," Omega, Elsevier, vol. 111(C).
    16. Anna Łozowicka & Bartłomiej Lach, 2022. "CI-DEA: A Way to Improve the Discriminatory Power of DEA—Using the Example of the Efficiency Assessment of the Digitalization in the Life of the Generation 50+," Sustainability, MDPI, vol. 14(6), pages 1-22, March.
    17. Halkos, George & Tzeremes, Nickolaos, 2010. "Measuring the effect of virtual mergers on banks’ efficiency levels:A non parametric analysis," MPRA Paper 23696, University Library of Munich, Germany.
    18. Sahoo, Biresh K. & Singh, Ramadhar & Mishra, Bineet & Sankaran, Krithiga, 2017. "Research productivity in management schools of India during 1968-2015: A directional benefit-of-doubt model analysis," Omega, Elsevier, vol. 66(PA), pages 118-139.
    19. Nataraja, Niranjan R. & Johnson, Andrew L., 2011. "Guidelines for using variable selection techniques in data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 215(3), pages 662-669, December.
    20. Rafael Benítez & Vicente Coll-Serrano & Vicente J. Bolós, 2021. "deaR-Shiny: An Interactive Web App for Data Envelopment Analysis," Sustainability, MDPI, vol. 13(12), pages 1-19, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:304:y:2023:i:2:p:729-744. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.