IDEAS home Printed from https://ideas.repec.org/a/spr/annopr/v348y2025i1d10.1007_s10479-022-05010-w.html
   My bibliography  Save this article

Big data in data envelopment analysis with undesirable outputs based on simulation and environmental-health matching data of Chinese industrial enterprises

Author

Listed:
  • Yuanxiang Zhou

    (Anhui University of Finance and Economics)

  • Shan Wang

    (Anhui University of Finance and Economics)

  • Shuqi Xu

    (Nanjing University of Aeronautics and Astronautics)

  • Qingyuan Zhu

    (Nanjing University of Aeronautics and Astronautics)

Abstract

There is a growing enthusiasm to combine the data envelopment analysis (DEA) method with the conception of “big data.” This study proposes a new algorithm to deal with massive data for DEA with undesirable output. First, we modified the current weak disposability model to deal with undesirable output and prove its applicability. Then, we introduced the two fastest approaches (build hull approach and pre-score approach) and found that they cannot be directly employed in DEA with undesirable output because the pre-score approach may not be as powerful. We modified these two approaches to make them capable of dealing with undesirable output and integrated them by proposing a new algorithm. In practice, we used simulated data and data of Chinese industrial enterprises including environmental and health indicators, and all the above data meet the requirements of the model. The results show that our approach can not only be applied to deal with undesirable output in DEA, but it also can perform better than the build hull approach in some situations. Our approach also benefits the pre-score approach if it cannot include total efficiency units. Finally, five datasets were collected or generated to test the capability of our approach.

Suggested Citation

  • Yuanxiang Zhou & Shan Wang & Shuqi Xu & Qingyuan Zhu, 2025. "Big data in data envelopment analysis with undesirable outputs based on simulation and environmental-health matching data of Chinese industrial enterprises," Annals of Operations Research, Springer, vol. 348(1), pages 279-298, May.
  • Handle: RePEc:spr:annopr:v:348:y:2025:i:1:d:10.1007_s10479-022-05010-w
    DOI: 10.1007/s10479-022-05010-w
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10479-022-05010-w
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10479-022-05010-w?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Tao Jie, 2020. "Parallel processing of the Build Hull algorithm to address the large-scale DEA problem," Annals of Operations Research, Springer, vol. 295(1), pages 453-481, December.
    2. Khezrimotlagh, Dariush & Zhu, Joe & Cook, Wade D. & Toloo, Mehdi, 2019. "Data envelopment analysis and big data," European Journal of Operational Research, Elsevier, vol. 274(3), pages 1047-1054.
    3. Timo Kuosmanen, 2005. "Weak Disposability in Nonparametric Production Analysis with Undesirable Outputs," American Journal of Agricultural Economics, Agricultural and Applied Economics Association, vol. 87(4), pages 1077-1082.
    4. Golany, B & Roll, Y, 1989. "An application procedure for DEA," Omega, Elsevier, vol. 17(3), pages 237-250.
    5. Hampf, Benjamin & Rødseth, Kenneth Løvold, 2015. "Carbon dioxide emission standards for U.S. power plants: An efficiency analysis perspective," Energy Economics, Elsevier, vol. 50(C), pages 140-153.
    6. Hampf, Benjamin & Rødseth, Kenneth Løvold, 2015. "Carbon dioxode emission standards for U.S. power plants: An efficiency analysis perspective," Publications of Darmstadt Technical University, Institute for Business Studies (BWL) 77009, Darmstadt Technical University, Department of Business Administration, Economics and Law, Institute for Business Studies (BWL).
    7. Rolf Färe & Shawna Grosskopf, 2003. "Nonparametric Productivity Analysis with Undesirable Outputs: Comment," American Journal of Agricultural Economics, Agricultural and Applied Economics Association, vol. 85(4), pages 1070-1074.
    8. Leleu, Hervé, 2013. "Shadow pricing of undesirable outputs in nonparametric analysis," European Journal of Operational Research, Elsevier, vol. 231(2), pages 474-480.
    9. Sebastián Lozano & Gabriel Villa, 2004. "Centralized Resource Allocation Using Data Envelopment Analysis," Journal of Productivity Analysis, Springer, vol. 22(1), pages 143-161, July.
    10. R. Robert Russell & William Schworm, 2018. "Technological inefficiency indexes: a binary taxonomy and a generic theorem," Journal of Productivity Analysis, Springer, vol. 49(1), pages 17-23, February.
    11. Chien-Ming Chen, 2014. "Evaluating eco-efficiency with data envelopment analysis: an analytical reexamination," Annals of Operations Research, Springer, vol. 214(1), pages 49-71, March.
    12. R. D. Banker & A. Charnes & W. W. Cooper, 1984. "Some Models for Estimating Technical and Scale Inefficiencies in Data Envelopment Analysis," Management Science, INFORMS, vol. 30(9), pages 1078-1092, September.
    13. Kao, Chiang & Hwang, Shiuh-Nan, 2021. "Measuring the effects of undesirable outputs on the efficiency of production units," European Journal of Operational Research, Elsevier, vol. 292(3), pages 996-1003.
    14. Fare, Rolf, 1989. "Multilateral Productivity Comparisons When Some Outputs Are Undesirable: A Nonparametric Approach," The Review of Economics and Statistics, MIT Press, vol. 71(1), pages 90-98, February.
    15. Atakelty Hailu & Terrence S. Veeman, 2001. "Non-parametric Productivity Analysis with Undesirable Outputs: An Application to the Canadian Pulp and Paper Industry," American Journal of Agricultural Economics, Agricultural and Applied Economics Association, vol. 83(3), pages 605-616.
    16. Charnes, A. & Cooper, W. W. & Rhodes, E., 1978. "Measuring the efficiency of decision making units," European Journal of Operational Research, Elsevier, vol. 2(6), pages 429-444, November.
    17. Seiford, Lawrence M. & Zhu, Joe, 2002. "Modeling undesirable factors in efficiency evaluation," European Journal of Operational Research, Elsevier, vol. 142(1), pages 16-20, October.
    18. Podinovski, Victor V. & Kuosmanen, Timo, 2011. "Modelling weak disposability in data envelopment analysis under relaxed convexity assumptions," European Journal of Operational Research, Elsevier, vol. 211(3), pages 577-585, June.
    19. Ali, Agha Iqbal, 1993. "Streamlined computation for data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 64(1), pages 61-67, January.
    20. Chen, Shiyi & Santos-Paulino, Amelia U., 2013. "Energy consumption restricted productivity re-estimates and industrial sustainability analysis in post-reform China," Energy Policy, Elsevier, vol. 57(C), pages 52-60.
    21. Jun-Fei Chu & Jie Wu & Ma-Lin Song, 2018. "An SBM-DEA model with parallel computing design for environmental efficiency evaluation in the big data context: a transportation system application," Annals of Operations Research, Springer, vol. 270(1), pages 105-124, November.
    22. Richard Barr & Matthew Durchholz, 1997. "Parallel and hierarchical decomposition approaches for solving large-scale Data Envelopment Analysis models," Annals of Operations Research, Springer, vol. 73(0), pages 339-372, October.
    23. Zelenyuk, Valentin, 2020. "Aggregation of inputs and outputs prior to Data Envelopment Analysis under big data," European Journal of Operational Research, Elsevier, vol. 282(1), pages 172-187.
    24. Jeanneaux, Philippe & Latruffe, Laure, 2016. "Modelling pollution-generating technologies in performance benchmarking: Recent developments, limits and future prospects in the nonparametric frameworkAuthor-Name: Dakpo, K. Hervé," European Journal of Operational Research, Elsevier, vol. 250(2), pages 347-359.
    25. Murty, Sushama & Robert Russell, R. & Levkoff, Steven B., 2012. "On modeling pollution-generating technologies," Journal of Environmental Economics and Management, Elsevier, vol. 64(1), pages 117-135.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Abad, Arnaud & Briec, Walter, 2019. "On the axiomatic of pollution-generating technologies: Non-parametric production analysis," European Journal of Operational Research, Elsevier, vol. 277(1), pages 377-390.
    2. Pham, Manh D. & Zelenyuk, Valentin, 2019. "Weak disposability in nonparametric production analysis: A new taxonomy of reference technology sets," European Journal of Operational Research, Elsevier, vol. 274(1), pages 186-198.
    3. Chu, Junfei & Shao, Caifeng & Emrouznejad, Ali & Wu, Jie & Yuan, Zhe, 2021. "Performance evaluation of organizations considering economic incentives for emission reduction: A carbon emission permit trading approach," Energy Economics, Elsevier, vol. 101(C).
    4. Wang, Ke & Wei, Yi-Ming & Huang, Zhimin, 2018. "Environmental efficiency and abatement efficiency measurements of China's thermal power industry: A data envelopment analysis based materials balance approach," European Journal of Operational Research, Elsevier, vol. 269(1), pages 35-50.
    5. Cherchye, Laurens & Rock, Bram De & Walheer, Barnabé, 2015. "Multi-output efficiency with good and bad outputs," European Journal of Operational Research, Elsevier, vol. 240(3), pages 872-881.
    6. Andreas Eder, 2022. "Environmental efficiency measurement when producers control pollutants under heterogeneous conditions: a generalization of the materials balance approach," Journal of Productivity Analysis, Springer, vol. 57(2), pages 157-176, April.
    7. Andreas Eder, 2021. "Environmental efficiency measurement when producers control pollutants under heterogeneous conditions: a generalization of the materials balance approach," Working Papers 752021, University of Natural Resources and Life Sciences, Vienna, Department of Economics and Social Sciences, Institute for Sustainable Economic Development.
    8. repec:zbw:inwedp:752021 is not listed on IDEAS
    9. Barnabé Walheer, 2020. "Output, input, and undesirable output interconnections in data envelopment analysis: convexity and returns-to-scale," Annals of Operations Research, Springer, vol. 284(1), pages 447-467, January.
    10. Jeanneaux, Philippe & Latruffe, Laure, 2016. "Modelling pollution-generating technologies in performance benchmarking: Recent developments, limits and future prospects in the nonparametric frameworkAuthor-Name: Dakpo, K. Hervé," European Journal of Operational Research, Elsevier, vol. 250(2), pages 347-359.
    11. Guillen, Maria D. & Aparicio, Juan & Kapelko, Magdalena & Esteve, Miriam, 2025. "Measuring environmental inefficiency through machine learning: An approach based on efficiency analysis trees and by-production technology," European Journal of Operational Research, Elsevier, vol. 321(2), pages 529-542.
    12. Beltrán-Esteve, Mercedes & Picazo-Tadeo, Andrés J., 2017. "Assessing environmental performance in the European Union: Eco-innovation versus catching-up," Energy Policy, Elsevier, vol. 104(C), pages 240-252.
    13. Ke Wang & Yi-Ming Wei & Zhimin Huang, 2017. "Environmental efficiency and abatement efficiency measurements of China¡¯s thermal power industry: A data envelopment analysis based materials balance approach," CEEP-BIT Working Papers 108, Center for Energy and Environmental Policy Research (CEEP), Beijing Institute of Technology.
    14. Fukuyama, Hirofumi & Matousek, Roman & Tzeremes, Nickolaos G., 2024. "A unified framework for nonperforming loan modeling in bank production: An application of data envelopment analysis," Omega, Elsevier, vol. 126(C).
    15. Mahnaz Maghbouli & Azam Pourhabib Yekta & Josef Jablonsky, 2024. "Weak disposability in the non-parametric analysis of efficiency. An interval abatement factor," Operations Research and Decisions, Wroclaw University of Science and Technology, Faculty of Management, vol. 34(3), pages 193-203.
    16. Roshdi, Israfil & Hasannasab, Maryam & Margaritis, Dimitris & Rouse, Paul, 2018. "Generalised weak disposability and efficiency measurement in environmental technologies," European Journal of Operational Research, Elsevier, vol. 266(3), pages 1000-1012.
    17. Ke Wang & Zhifu Mi & Yi‐Ming Wei, 2019. "Will Pollution Taxes Improve Joint Ecological and Economic Efficiency of Thermal Power Industry in China?: A DEA‐Based Materials Balance Approach," Journal of Industrial Ecology, Yale University, vol. 23(2), pages 389-401, April.
    18. Sahoo, Biresh K. & Luptacik, Mikulas & Mahlberg, Bernhard, 2011. "Alternative measures of environmental technology structure in DEA: An application," European Journal of Operational Research, Elsevier, vol. 215(3), pages 750-762, December.
    19. Shen, Zhiyang & Bai, Kaixuan & Hong, Tianyang & Balezentis, Tomas, 2021. "Evaluation of carbon shadow price within a non-parametric meta-frontier framework: The case of OECD, ASEAN and BRICS," Applied Energy, Elsevier, vol. 299(C).
    20. Cui, Qiang, 2021. "A data-based comparison of the five undesirable output disposability approaches in airline environmental efficiency," Socio-Economic Planning Sciences, Elsevier, vol. 74(C).
    21. Aparicio, Juan & Kapelko, Magdalena & Zofío, José L., 2020. "The measurement of environmental economic inefficiency with pollution-generating technologies," Resource and Energy Economics, Elsevier, vol. 62(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:annopr:v:348:y:2025:i:1:d:10.1007_s10479-022-05010-w. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.