IDEAS home Printed from https://ideas.repec.org/a/spr/annopr/v253y2017i1d10.1007_s10479-016-2382-2.html
   My bibliography  Save this article

Variable selection in data envelopment analysis via Akaike’s information criteria

Author

Listed:
  • Yongjun Li

    (University of Science and Technology of China)

  • Xiao Shi

    (Shandong University of Finance and Economics)

  • Min Yang

    (Hefei University of Technology)

  • Liang Liang

    (Hefei University of Technology)

Abstract

The decision makers always suffer from predicament in choosing appropriate variable set to evaluate/improve production efficiencies in many applications of data envelopment analysis (DEA). The selected data set may exist information redundancy. On that account, this study proposes an alternative approach to screen out proper input and output variables set for evaluation via Akaike’s information criteria (AIC) rule. This method mainly focuses on assessing the importance of subset of original variables rather than testing the marginal role of variables one by one in many other methods. In terms of the proposed approach, the most optimized variable set contains the least redundant information, which provides decision support to the decision makers. Besides, we also define redundant/cross redundant variables with the form of theorems and give the proofs subsequently. In addition, the AIC approach is firstly extended to stochastic data set to select an appropriate set of stochastic variables as well. Finally, the proposed approach has been applied to some data sets from given data and prior DEA literatures.

Suggested Citation

  • Yongjun Li & Xiao Shi & Min Yang & Liang Liang, 2017. "Variable selection in data envelopment analysis via Akaike’s information criteria," Annals of Operations Research, Springer, vol. 253(1), pages 453-476, June.
  • Handle: RePEc:spr:annopr:v:253:y:2017:i:1:d:10.1007_s10479-016-2382-2
    DOI: 10.1007/s10479-016-2382-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10479-016-2382-2
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10479-016-2382-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Léopold Simar & Valentin Zelenyuk, 2011. "Stochastic FDH/DEA estimators for frontier analysis," Journal of Productivity Analysis, Springer, vol. 36(1), pages 1-20, August.
    2. Wu, Y., 2008. "Simultaneous change point analysis and variable selection in a regression problem," Journal of Multivariate Analysis, Elsevier, vol. 99(9), pages 2154-2171, October.
    3. Anzanello, Michel J. & Albin, Susan L. & Chaovalitwongse, Wanpracha A., 2012. "Multicriteria variable selection for classification of production batches," European Journal of Operational Research, Elsevier, vol. 218(1), pages 97-105.
    4. Adler, Nicole & Golany, Boaz, 2001. "Evaluation of deregulated airline networks using data envelopment analysis combined with principal component analysis with an application to Western Europe," European Journal of Operational Research, Elsevier, vol. 132(2), pages 260-273, July.
    5. Qingxian An & Haoxun Chen & Jie Wu & Liang Liang, 2015. "Measuring slacks-based efficiency for commercial banks in China by using a two-stage DEA model with undesirable output," Annals of Operations Research, Springer, vol. 235(1), pages 13-35, December.
    6. Juan Du & Justin Wang & Yao Chen & Shin-Yi Chou & Joe Zhu, 2014. "Incorporating health outcomes in Pennsylvania hospital efficiency: an additive super-efficiency DEA approach," Annals of Operations Research, Springer, vol. 221(1), pages 161-172, October.
    7. C Serrano Cinca & C Mar Molinero, 2004. "Selecting DEA specifications and ranking units via PCA," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 55(5), pages 521-528, May.
    8. Léopold Simar, 2007. "How to improve the performances of DEA/FDH estimators in the presence of noise?," Journal of Productivity Analysis, Springer, vol. 28(3), pages 183-201, December.
    9. Wagner, Janet M. & Shimshak, Daniel G., 2007. "Stepwise selection of variables in data envelopment analysis: Procedures and managerial perspectives," European Journal of Operational Research, Elsevier, vol. 180(1), pages 57-67, July.
    10. Jie Wu & Zhixiang Zhou, 2015. "A mixed-objective integer DEA model," Annals of Operations Research, Springer, vol. 228(1), pages 81-95, May.
    11. R. D. Banker & A. Charnes & W. W. Cooper, 1984. "Some Models for Estimating Technical and Scale Inefficiencies in Data Envelopment Analysis," Management Science, INFORMS, vol. 30(9), pages 1078-1092, September.
    12. Boussofiane, A. & Dyson, R. G. & Thanassoulis, E., 1991. "Applied data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 52(1), pages 1-15, May.
    13. Huang, Zhimin & Li, Susan X., 1996. "Dominance stochastic models in data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 95(2), pages 390-403, December.
    14. Jenkins, Larry & Anderson, Murray, 2003. "A multivariate statistical approach to reducing the number of variables in data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 147(1), pages 51-61, May.
    15. Boris Bravo-Ureta & Daniel Solís & Víctor Moreira López & José Maripani & Abdourahmane Thiam & Teodoro Rivas, 2007. "Technical efficiency in farming: a meta-regression analysis," Journal of Productivity Analysis, Springer, vol. 27(1), pages 57-72, February.
    16. Hamparsum Bozdogan, 1987. "Model selection and Akaike's Information Criterion (AIC): The general theory and its analytical extensions," Psychometrika, Springer;The Psychometric Society, vol. 52(3), pages 345-370, September.
    17. Hughes, Andrew & Yaisawarng, Suthathip, 2004. "Sensitivity and dimensionality tests of DEA efficiency scores," European Journal of Operational Research, Elsevier, vol. 154(2), pages 410-422, April.
    18. Hoff, Ayoe, 2007. "Second stage DEA: Comparison of approaches for modelling the DEA score," European Journal of Operational Research, Elsevier, vol. 181(1), pages 425-435, August.
    19. Sarkis, Joseph, 2000. "A comparative analysis of DEA as a discrete alternative multiple criteria decision tool," European Journal of Operational Research, Elsevier, vol. 123(3), pages 543-557, June.
    20. Dyson, R. G. & Allen, R. & Camanho, A. S. & Podinovski, V. V. & Sarrico, C. S. & Shale, E. A., 2001. "Pitfalls and protocols in DEA," European Journal of Operational Research, Elsevier, vol. 132(2), pages 245-259, July.
    21. Charnes, A. & Cooper, W. W. & Rhodes, E., 1978. "Measuring the efficiency of decision making units," European Journal of Operational Research, Elsevier, vol. 2(6), pages 429-444, November.
    22. John Ruggiero, 2005. "Impact Assessment Of Input Omission On Dea," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 4(03), pages 359-368.
    23. Jesús T. Pastor & JosÉ L. Ruiz & Inmaculada Sirvent, 2002. "A Statistical Test for Nested Radial Dea Models," Operations Research, INFORMS, vol. 50(4), pages 728-735, August.
    24. Basak, Indrani, 2002. "On the use of information criteria in analytic hierarchy process," European Journal of Operational Research, Elsevier, vol. 141(1), pages 200-216, August.
    25. Cooper, William W. & Deng, H. & Huang, Zhimin & Li, Susan X., 2004. "Chance constrained programming approaches to congestion in stochastic data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 155(2), pages 487-501, June.
    26. Laure Latruffe & Kelvin Balcombe & Sophia Davidova & Katarzyna Zawalinska, 2004. "Determinants of technical efficiency of crop and livestock farms in Poland," Applied Economics, Taylor & Francis Journals, vol. 36(12), pages 1255-1263.
    27. Liang Liang & Jie Wu & Wade D. Cook & Joe Zhu, 2008. "The DEA Game Cross-Efficiency Model and Its Nash Equilibrium," Operations Research, INFORMS, vol. 56(5), pages 1278-1288, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jamal Ouenniche & Skarleth Carrales, 2018. "Assessing efficiency profiles of UK commercial banks: a DEA analysis with regression-based feedback," Annals of Operations Research, Springer, vol. 266(1), pages 551-587, July.
    2. Hosseini, Keyvan & Stefaniec, Agnieszka, 2019. "Efficiency assessment of Iran's petroleum refining industry in the presence of unprofitable output: A dynamic two-stage slacks-based measure," Energy, Elsevier, vol. 189(C).
    3. Can Zhang & Jixia Li & Tengfei Liu & Mengzhi Xu & Huachun Wang & Xu Li, 2022. "The Spatiotemporal Evolution and Influencing Factors of the Chinese Cities’ Ecological Welfare Performance," IJERPH, MDPI, vol. 19(19), pages 1-27, October.
    4. Raul Moragues & Juan Aparicio & Miriam Esteve, 2023. "Ranking the Importance of Variables in a Nonparametric Frontier Analysis Using Unsupervised Machine Learning Techniques," Mathematics, MDPI, vol. 11(11), pages 1-24, June.
    5. Benítez-Peña, Sandra & Bogetoft, Peter & Romero Morales, Dolores, 2020. "Feature Selection in Data Envelopment Analysis: A Mathematical Optimization approach," Omega, Elsevier, vol. 96(C).
    6. Imad Bou-Hamad & Abdel Latef Anouze & Ibrahim H. Osman, 2022. "A cognitive analytics management framework to select input and output variables for data envelopment analysis modeling of performance efficiency of banks using random forest and entropy of information," Annals of Operations Research, Springer, vol. 308(1), pages 63-92, January.
    7. Dai, Sheng, 2023. "Variable selection in convex quantile regression: L1-norm or L0-norm regularization?," European Journal of Operational Research, Elsevier, vol. 305(1), pages 338-355.
    8. Lin, Tzu-Yu & Chiu, Sheng-Hsiung & Yang, Hai-Lan, 2022. "Performance evaluation for regional innovation systems development in China based on the two-stage SBM-DNDEA model," Socio-Economic Planning Sciences, Elsevier, vol. 80(C).
    9. Villanueva-Cantillo, Jeyms & Munoz-Marquez, Manuel, 2021. "Methodology for calculating critical values of relevance measures in variable selection methods in data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 290(2), pages 657-670.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Qiwei Xie & Yuanyuan Li & Lizheng Wang & Chao Liu, 2018. "Improving discrimination in data envelopment analysis without losing information based on Renyi’s entropy," Central European Journal of Operations Research, Springer;Slovak Society for Operations Research;Hungarian Operational Research Society;Czech Society for Operations Research;Österr. Gesellschaft für Operations Research (ÖGOR);Slovenian Society Informatika - Section for Operational Research;Croatian Operational Research Society, vol. 26(4), pages 1053-1068, December.
    2. Charles, Vincent & Aparicio, Juan & Zhu, Joe, 2019. "The curse of dimensionality of decision-making units: A simple approach to increase the discriminatory power of data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 279(3), pages 929-940.
    3. Jamal Ouenniche & Skarleth Carrales, 2018. "Assessing efficiency profiles of UK commercial banks: a DEA analysis with regression-based feedback," Annals of Operations Research, Springer, vol. 266(1), pages 551-587, July.
    4. Nataraja, Niranjan R. & Johnson, Andrew L., 2011. "Guidelines for using variable selection techniques in data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 215(3), pages 662-669, December.
    5. Valentin Zelenyuk, 2019. "Data Envelopment Analysis and Business Analytics: The Big Data Challenges and Some Solutions," CEPA Working Papers Series WP072019, School of Economics, University of Queensland, Australia.
    6. Peyrache, Antonio & Rose, Christiern & Sicilia, Gabriela, 2020. "Variable selection in Data Envelopment Analysis," European Journal of Operational Research, Elsevier, vol. 282(2), pages 644-659.
    7. Eskelinen, Juha, 2017. "Comparison of variable selection techniques for data envelopment analysis in a retail bank," European Journal of Operational Research, Elsevier, vol. 259(2), pages 778-788.
    8. Anna Łozowicka & Bartłomiej Lach, 2022. "CI-DEA: A Way to Improve the Discriminatory Power of DEA—Using the Example of the Efficiency Assessment of the Digitalization in the Life of the Generation 50+," Sustainability, MDPI, vol. 14(6), pages 1-22, March.
    9. Liu, John S. & Lu, Louis Y.Y. & Lu, Wen-Min, 2016. "Research fronts in data envelopment analysis," Omega, Elsevier, vol. 58(C), pages 33-45.
    10. Villanueva-Cantillo, Jeyms & Munoz-Marquez, Manuel, 2021. "Methodology for calculating critical values of relevance measures in variable selection methods in data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 290(2), pages 657-670.
    11. Li, Yongjun & Yang, Feng & Liang, Liang & Hua, Zhongsheng, 2009. "Allocating the fixed cost as a complement of other cost inputs: A DEA approach," European Journal of Operational Research, Elsevier, vol. 197(1), pages 389-401, August.
    12. Esteve, Miriam & Aparicio, Juan & Rodriguez-Sala, Jesus J. & Zhu, Joe, 2023. "Random Forests and the measurement of super-efficiency in the context of Free Disposal Hull," European Journal of Operational Research, Elsevier, vol. 304(2), pages 729-744.
    13. Imad Bou-Hamad & Abdel Latef Anouze & Ibrahim H. Osman, 2022. "A cognitive analytics management framework to select input and output variables for data envelopment analysis modeling of performance efficiency of banks using random forest and entropy of information," Annals of Operations Research, Springer, vol. 308(1), pages 63-92, January.
    14. Diaz-Balteiro, Luis & Casimiro Herruzo, A. & Martinez, Margarita & Gonzalez-Pachon, Jacinto, 2006. "An analysis of productive efficiency and innovation activity using DEA: An application to Spain's wood-based industry," Forest Policy and Economics, Elsevier, vol. 8(7), pages 762-773, October.
    15. Toloo, Mehdi & Tone, Kaoru & Izadikhah, Mohammad, 2023. "Selecting slacks-based data envelopment analysis models," European Journal of Operational Research, Elsevier, vol. 308(3), pages 1302-1318.
    16. Wen-Chih Chen & Andrew Johnson, 2010. "The dynamics of performance space of Major League Baseball pitchers 1871–2006," Annals of Operations Research, Springer, vol. 181(1), pages 287-302, December.
    17. Lee, Boon L. & Wilson, Clevo & Simshauser, Paul & Majiwa, Eucabeth, 2021. "Deregulation, efficiency and policy determination: An analysis of Australia's electricity distribution sector," Energy Economics, Elsevier, vol. 98(C).
    18. Chaouk, Mohammed & Pagliari, Dr Romano & Moxon, Richard, 2020. "The impact of national macro-environment exogenous variables on airport efficiency," Journal of Air Transport Management, Elsevier, vol. 82(C).
    19. Zelenyuk, Valentin, 2020. "Aggregation of inputs and outputs prior to Data Envelopment Analysis under big data," European Journal of Operational Research, Elsevier, vol. 282(1), pages 172-187.
    20. Sickles, Robin C. & Song, Wonho & Zelenyuk, Valentin, 2018. "Econometric Analysis of Productivity: Theory and Implementation in R," Working Papers 18-008, Rice University, Department of Economics.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:annopr:v:253:y:2017:i:1:d:10.1007_s10479-016-2382-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.