IDEAS home Printed from https://ideas.repec.org/a/gam/jlands/v13y2024i3p327-d1350794.html
   My bibliography  Save this article

Effects of Big Data on PM 2.5 : A Study Based on Double Machine Learning

Author

Listed:
  • Xinyu Wei

    (School of Economics and Management, Tongji University, Shanghai 200092, China)

  • Mingwang Cheng

    (School of Economics and Management, Tongji University, Shanghai 200092, China)

  • Kaifeng Duan

    (School of Economics and Management, Fuzhou University, Fuzhou 350108, China)

  • Xiangxing Kong

    (School of Economics and Management, Tongji University, Shanghai 200092, China)

Abstract

The critical role of high-quality urban development and scientific land use in leveraging big data for air quality enhancement is paramount. The application of machine learning for causal inferences in research related to big data development and air pollution presents considerable potential. This study employs a double machine learning model to explore the impact of big data development on the PM 2.5 concentration in 277 prefecture-level cities across China. This analysis is grounded in the quasi-natural experiment named the National Big Data Comprehensive Pilot Zone. The findings reveal a significant inverse relationship between big data development and PM 2.5 levels, with a correlation coefficient of −0.0149, a result consistently supported by various robustness checks. Further mechanism analyses elucidate that big data development markedly diminishes PM 2.5 levels through the avenues of enhanced urban development and land use planning. The examination of heterogeneity underscores big data’s suppressive effect on PM 2.5 levels across central, eastern, and western regions, as well as in both resource-dependent and non-resource-dependent cities, albeit with varying degrees of significance. This study offers policy recommendations for the formulation and execution of big data policies, emphasizing the importance of acknowledging local variances and the structural nuances of urban economies.

Suggested Citation

  • Xinyu Wei & Mingwang Cheng & Kaifeng Duan & Xiangxing Kong, 2024. "Effects of Big Data on PM 2.5 : A Study Based on Double Machine Learning," Land, MDPI, vol. 13(3), pages 1-21, March.
  • Handle: RePEc:gam:jlands:v:13:y:2024:i:3:p:327-:d:1350794
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2073-445X/13/3/327/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2073-445X/13/3/327/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
    2. Helmut Farbmacher & Martin Huber & Lukáš Lafférs & Henrika Langen & Martin Spindler, 2022. "Causal mediation analysis with double machine learning [Mediation analysis via potential outcomes models]," The Econometrics Journal, Royal Economic Society, vol. 25(2), pages 277-300.
    3. Yao Zhao & Xuena Kong & Mahmood Ahmad & Zahoor Ahmed, 2023. "Digital Economy, Industrial Structure, and Environmental Quality: Assessing the Roles of Educational Investment, Green Innovation, and Economic Globalization," Sustainability, MDPI, vol. 15(3), pages 1-24, January.
    4. Qiuqiu Guo & Xiaoyu Ma, 2023. "How Does the Digital Economy Affect Sustainable Urban Development? Empirical Evidence from Chinese Cities," Sustainability, MDPI, vol. 15(5), pages 1-21, February.
    5. Jie Li & Zhengchuan Sun & Jie Zhou & Yaya Sow & Xufeng Cui & Haipeng Chen & Qianling Shen, 2023. "The Impact of the Digital Economy on Carbon Emissions from Cultivated Land Use," Land, MDPI, vol. 12(3), pages 1-18, March.
    6. Jianing Pang & Yimeng Zhang & Fangyi Jiao, 2023. "The Impact of the Digital Economy on Transformation and Upgrading of Industrial Structure: A Perspective Based on the “Poverty Trap”," Sustainability, MDPI, vol. 15(20), pages 1-20, October.
    7. Ning Hui & Qian Yu & Yu Gu, 2023. "Does the Digital Economy Improve the Innovation Efficiency of the Manufacturing Industry? Evidence in Provincial Data from China," Sustainability, MDPI, vol. 15(13), pages 1-17, July.
    8. Menggen Chen & Songyangyang Zhao & Jiawen Wang, 2023. "The Impact of the Digital Economy on Regional Carbon Emissions: Evidence from China," Sustainability, MDPI, vol. 15(20), pages 1-34, October.
    9. Yang, Jui-Chung & Chuang, Hui-Ching & Kuan, Chung-Ming, 2020. "Double machine learning with gradient boosting and its application to the Big N audit quality effect," Journal of Econometrics, Elsevier, vol. 216(1), pages 268-283.
    10. Huaxue Zhao & Yu Cheng & Ruijing Zheng, 2022. "Impact of the Digital Economy on PM 2.5 : Experience from the Middle and Lower Reaches of the Yellow River Basin," IJERPH, MDPI, vol. 19(24), pages 1-20, December.
    11. Ying Tang & Menghan Chen, 2022. "The Impact Mechanism and Spillover Effect of Digital Rural Construction on the Efficiency of Green Transformation for Cultivated Land Use in China," IJERPH, MDPI, vol. 19(23), pages 1-18, December.
    12. Xing Zhang & Jian Zhong & Huanfang Wang, 2023. "Does the Development of Digital Economy Affect Environmental Pollution?," Sustainability, MDPI, vol. 15(12), pages 1-18, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jonathan Fuhr & Philipp Berens & Dominik Papies, 2024. "Estimating Causal Effects with Double Machine Learning -- A Method Evaluation," Papers 2403.14385, arXiv.org.
    2. Michael Lechner, 2023. "Causal Machine Learning and its use for public policy," Swiss Journal of Economics and Statistics, Springer;Swiss Society of Economics and Statistics, vol. 159(1), pages 1-15, December.
    3. Yong Bian & Xiqian Wang & Qin Zhang, 2023. "How Does China's Household Portfolio Selection Vary with Financial Inclusion?," Papers 2311.01206, arXiv.org.
    4. Huber, Martin & Meier, Jonas & Wallimann, Hannes, 2022. "Business analytics meets artificial intelligence: Assessing the demand effects of discounts on Swiss train tickets," Transportation Research Part B: Methodological, Elsevier, vol. 163(C), pages 22-39.
    5. Rahul Singh & Liyuan Xu & Arthur Gretton, 2020. "Kernel Methods for Causal Functions: Dose, Heterogeneous, and Incremental Response Curves," Papers 2010.04855, arXiv.org, revised Oct 2022.
    6. Bilgin, Rumeysa, 2023. "The Selection Of Control Variables In Capital Structure Research With Machine Learning," SocArXiv e26qf, Center for Open Science.
    7. Rahul Singh & Liyuan Xu & Arthur Gretton, 2021. "Sequential Kernel Embedding for Mediated and Time-Varying Dose Response Curves," Papers 2111.03950, arXiv.org, revised Jul 2023.
    8. Jie Li & Zhengchuan Sun & Qin Gao & Yanbin Qi, 2024. "Evaluation of Cropland Utilization Eco-Efficiency and Influencing Factors in Primary Grain-Producing Regions of China," Agriculture, MDPI, vol. 14(2), pages 1-18, February.
    9. AmirEmad Ghassami & Andrew Ying & Ilya Shpitser & Eric Tchetgen Tchetgen, 2021. "Minimax Kernel Machine Learning for a Class of Doubly Robust Functionals with Application to Proximal Causal Inference," Papers 2104.02929, arXiv.org, revised Mar 2022.
    10. Nora Bearth & Michael Lechner, 2024. "Causal Machine Learning for Moderation Effects," Papers 2401.08290, arXiv.org, revised Apr 2024.
    11. Wang, Xiqian & Bian, Yong & Zhang, Qin, 2023. "The effect of cooking fuel choice on the elderly’s well-being: Evidence from two non-parametric methods," Energy Economics, Elsevier, vol. 125(C).
    12. Alexandre Belloni & Victor Chernozhukov & Denis Chetverikov & Christian Hansen & Kengo Kato, 2018. "High-dimensional econometrics and regularized GMM," CeMMAP working papers CWP35/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    13. Nicolaj N. Mühlbach, 2020. "Tree-based Synthetic Control Methods: Consequences of moving the US Embassy," CREATES Research Papers 2020-04, Department of Economics and Business Economics, Aarhus University.
    14. Kyle Colangelo & Ying-Ying Lee, 2019. "Double debiased machine learning nonparametric inference with continuous treatments," CeMMAP working papers CWP72/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    15. Sant’Anna, Pedro H.C. & Zhao, Jun, 2020. "Doubly robust difference-in-differences estimators," Journal of Econometrics, Elsevier, vol. 219(1), pages 101-122.
    16. Ruoxuan Xiong & Allison Koenecke & Michael Powell & Zhu Shen & Joshua T. Vogelstein & Susan Athey, 2021. "Federated Causal Inference in Heterogeneous Observational Data," Papers 2107.11732, arXiv.org, revised Apr 2023.
    17. Susan Athey & Guido W. Imbens & Stefan Wager, 2018. "Approximate residual balancing: debiased inference of average treatment effects in high dimensions," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 80(4), pages 597-623, September.
    18. Jelena Bradic & Weijie Ji & Yuqian Zhang, 2021. "High-dimensional Inference for Dynamic Treatment Effects," Papers 2110.04924, arXiv.org, revised May 2023.
    19. Davide Viviano & Jelena Bradic, 2019. "Synthetic learner: model-free inference on treatments over time," Papers 1904.01490, arXiv.org, revised Aug 2022.
    20. Zhenxiang Cao & Liqing Peng, 2023. "The Impact of Digital Economics on Environmental Quality: A System Dynamics Approach," SAGE Open, , vol. 13(4), pages 21582440231, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jlands:v:13:y:2024:i:3:p:327-:d:1350794. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.