IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v327y2025i3p957-970.html

Detecting two-dimensional projection-efficient units in data envelopment analysis under big data scenarios

Author

Listed:
  • Xu, Shuqi
  • Zhu, Qingyuan
  • Shen, Zhiyang
  • Vardanyan, Michael
  • Pan, Yinghao

Abstract

In the age of big data, traditional estimation methods may struggle to process large datasets efficiently. Ali (1993) laid the foundation for improving efficiency assessment using Data Envelopment Analysis (DEA). Building on this work, we demonstrate how to detect two-dimensional projection-efficient units. This is achieved by projecting the multidimensional DEA production frontier onto two-dimensional subspaces and utilizing slope analysis to identify key efficient units. These units are then linked to their full-dimensional counterparts to define projection-efficient units. We propose using these key efficient units as a preliminary step to speed up the identification of full-dimensional efficient units or to estimate the relative density of datasets. Simulations show that our method reduces computation time for the two fastest approaches by an average of 54.2 % across different datasets.

Suggested Citation

  • Xu, Shuqi & Zhu, Qingyuan & Shen, Zhiyang & Vardanyan, Michael & Pan, Yinghao, 2025. "Detecting two-dimensional projection-efficient units in data envelopment analysis under big data scenarios," European Journal of Operational Research, Elsevier, vol. 327(3), pages 957-970.
  • Handle: RePEc:eee:ejores:v:327:y:2025:i:3:p:957-970
    DOI: 10.1016/j.ejor.2025.05.053
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221725004515
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2025.05.053?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Khezrimotlagh, Dariush & Zhu, Joe & Cook, Wade D. & Toloo, Mehdi, 2019. "Data envelopment analysis and big data," European Journal of Operational Research, Elsevier, vol. 274(3), pages 1047-1054.
    2. Dariush Khezrimotlagh, 2021. "Parallel Processing and Large-Scale Datasets in Data Envelopment Analysis," International Series in Operations Research & Management Science, in: Joe Zhu & Vincent Charles (ed.), Data-Enabled Analytics, pages 159-174, Springer.
    3. R. D. Banker & A. Charnes & W. W. Cooper, 1984. "Some Models for Estimating Technical and Scale Inefficiencies in Data Envelopment Analysis," Management Science, INFORMS, vol. 30(9), pages 1078-1092, September.
    4. Chu, Junfei & Rui, Yuting & Khezrimotlagh, Dariush & Zhu, Joe, 2024. "A general computational framework and a hybrid algorithm for large-scale data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 316(2), pages 639-650.
    5. Charnes, A. & Cooper, W. W. & Rhodes, E., 1978. "Measuring the efficiency of decision making units," European Journal of Operational Research, Elsevier, vol. 2(6), pages 429-444, November.
    6. Ali, Agha Iqbal, 1993. "Streamlined computation for data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 64(1), pages 61-67, January.
    7. Richard Barr & Matthew Durchholz, 1997. "Parallel and hierarchical decomposition approaches for solving large-scale Data Envelopment Analysis models," Annals of Operations Research, Springer, vol. 73(0), pages 339-372, October.
    8. Victor V. Podinovski, 2022. "Variable and Constant Returns-to-Scale Production Technologies with Component Processes," Operations Research, INFORMS, vol. 70(2), pages 1238-1258, March.
    9. Wen-Chih Chen & Sheng-Yung Lai, 2017. "Determining radial efficiency with a large data set by solving small-size linear programs," Annals of Operations Research, Springer, vol. 250(1), pages 147-166, March.
    10. Murty, Sushama & Robert Russell, R. & Levkoff, Steven B., 2012. "On modeling pollution-generating technologies," Journal of Environmental Economics and Management, Elsevier, vol. 64(1), pages 117-135.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yuanxiang Zhou & Shan Wang & Shuqi Xu & Qingyuan Zhu, 2025. "Big data in data envelopment analysis with undesirable outputs based on simulation and environmental-health matching data of Chinese industrial enterprises," Annals of Operations Research, Springer, vol. 348(1), pages 279-298, May.
    2. Chu, Junfei & Rui, Yuting & Khezrimotlagh, Dariush & Zhu, Joe, 2024. "A general computational framework and a hybrid algorithm for large-scale data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 316(2), pages 639-650.
    3. Shengqing Chang & Jingjing Ding & Chenpeng Feng & Ruifeng Wang, 2024. "A Hybrid Parallel Processing Strategy for Large-Scale DEA Computation," Computational Economics, Springer;Society for Computational Economics, vol. 63(6), pages 2325-2349, June.
    4. Khezrimotlagh, Dariush & Cook, Wade D. & Zhu, Joe, 2020. "A nonparametric framework to detect outliers in estimating production frontiers," European Journal of Operational Research, Elsevier, vol. 286(1), pages 375-388.
    5. Valentin Zelenyuk, 2019. "Data Envelopment Analysis and Business Analytics: The Big Data Challenges and Some Solutions," CEPA Working Papers Series WP072019, School of Economics, University of Queensland, Australia.
    6. Wang, Mengyuan & Chen, Ya & Zelenyuk, Valentin, 2025. "DEA for big wide data based on regularization approaches: An application to energy efficiency analysis," Energy Economics, Elsevier, vol. 152(C).
    7. Zelenyuk, Valentin, 2020. "Aggregation of inputs and outputs prior to Data Envelopment Analysis under big data," European Journal of Operational Research, Elsevier, vol. 282(1), pages 172-187.
    8. Khezrimotlagh, Dariush & Zhu, Joe & Cook, Wade D. & Toloo, Mehdi, 2019. "Data envelopment analysis and big data," European Journal of Operational Research, Elsevier, vol. 274(3), pages 1047-1054.
    9. Wen-Chih Chen & Sheng-Yung Lai, 2017. "Determining radial efficiency with a large data set by solving small-size linear programs," Annals of Operations Research, Springer, vol. 250(1), pages 147-166, March.
    10. Jie Wu & Chao-Chao Zhang & Jun-Fei Chu & Guang-Cheng Xu & Ying-Hao Pan, 2025. "Environmental efficiency evaluation of China’s coal-fired electricity supply chain enterprises with large-scale datasets: an enhanced build hull algorithm," Annals of Operations Research, Springer, vol. 355(1), pages 391-418, December.
    11. Jin, Zhuo & Zhou, Zhixiang & Wu, Huaqing, 2025. "Enhancing digital manufacturing efficiency and dominance relation driven big Data analytics," International Journal of Production Economics, Elsevier, vol. 289(C).
    12. Tao Jie, 2020. "Parallel processing of the Build Hull algorithm to address the large-scale DEA problem," Annals of Operations Research, Springer, vol. 295(1), pages 453-481, December.
    13. Atkinson, Scott E. & Tsionas, Mike G., 2021. "Generalized estimation of productivity with multiple bad outputs: The importance of materials balance constraints," European Journal of Operational Research, Elsevier, vol. 292(3), pages 1165-1186.
    14. Idiano D’Adamo & Cinzia Daraio & Simone Di Leo & Léopold Simar, 2024. "A Flexible and Sustainable Analysis of Waste Efficiency at the European Level," Global Journal of Flexible Systems Management, Springer;Global Institute of Flexible Systems Management, vol. 25(4), pages 881-894, December.
    15. Chen, Xiaodong & Miao, Zhuang & Wu, Ge & Zhu, Pengyu, 2024. "City-level green growth accounting: Evidence from China's thirteen urban agglomerations," Renewable and Sustainable Energy Reviews, Elsevier, vol. 203(C).
    16. Tavana, Madjid & Izadikhah, Mohammad & Toloo, Mehdi & Roostaee, Razieh, 2021. "A new non-radial directional distance model for data envelopment analysis problems with negative and flexible measures," Omega, Elsevier, vol. 102(C).
    17. Pham, Manh D. & Zelenyuk, Valentin, 2019. "Weak disposability in nonparametric production analysis: A new taxonomy of reference technology sets," European Journal of Operational Research, Elsevier, vol. 274(1), pages 186-198.
    18. Ali, Agha Iqbal & Lerme, Catherine S. & Seiford, Lawrence M., 1995. "Components of efficiency evaluation in data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 80(3), pages 462-473, February.
    19. Guillen, Maria D. & Charles, Vincent & Aparicio, Juan, 2025. "Enhanced efficiency assessment in manufacturing: Leveraging machine learning for improved performance analysis," Omega, Elsevier, vol. 134(C).
    20. Premachandra, I. M., 2001. "A note on DEA vs principal component analysis: An improvement to Joe Zhu's approach," European Journal of Operational Research, Elsevier, vol. 132(3), pages 553-560, August.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:327:y:2025:i:3:p:957-970. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.