IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2504.12450.html
   My bibliography  Save this paper

Can Moran Eigenvectors Improve Machine Learning of Spatial Data? Insights from Synthetic Data Validation

Author

Listed:
  • Ziqi Li
  • Zhan Peng

Abstract

Moran Eigenvector Spatial Filtering (ESF) approaches have shown promise in accounting for spatial effects in statistical models. Can this extend to machine learning? This paper examines the effectiveness of using Moran Eigenvectors as additional spatial features in machine learning models. We generate synthetic datasets with known processes involving spatially varying and nonlinear effects across two different geometries. Moran Eigenvectors calculated from different spatial weights matrices, with and without a priori eigenvector selection, are tested. We assess the performance of popular machine learning models, including Random Forests, LightGBM, XGBoost, and TabNet, and benchmark their accuracies in terms of cross-validated R2 values against models that use only coordinates as features. We also extract coefficients and functions from the models using GeoShapley and compare them with the true processes. Results show that machine learning models using only location coordinates achieve better accuracies than eigenvector-based approaches across various experiments and datasets. Furthermore, we discuss that while these findings are relevant for spatial processes that exhibit positive spatial autocorrelation, they do not necessarily apply when modeling network autocorrelation and cases with negative spatial autocorrelation, where Moran Eigenvectors would still be useful.

Suggested Citation

  • Ziqi Li & Zhan Peng, 2025. "Can Moran Eigenvectors Improve Machine Learning of Spatial Data? Insights from Synthetic Data Validation," Papers 2504.12450, arXiv.org.
  • Handle: RePEc:arx:papers:2504.12450
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2504.12450
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Daniel Griffith, 2006. "Hidden negative spatial autocorrelation," Journal of Geographical Systems, Springer, vol. 8(4), pages 335-355, October.
    2. Yongwan Chun, 2008. "Modeling network autocorrelation within migration flows by eigenvector spatial filtering," Journal of Geographical Systems, Springer, vol. 10(4), pages 317-344, December.
    3. A. Stewart Fotheringham & Wenbai Yang & Wei Kang, 2017. "Multiscale Geographically Weighted Regression (MGWR)," Annals of the American Association of Geographers, Taylor & Francis Journals, vol. 107(6), pages 1247-1265, November.
    4. Daniel A. Griffith, 2019. "Negative Spatial Autocorrelation: One of the Most Neglected Concepts in Spatial Statistics," Stats, MDPI, vol. 2(3), pages 1-28, August.
    5. Di Zhu & Fan Zhang & Shengyin Wang & Yaoli Wang & Ximeng Cheng & Zhou Huang & Yu Liu, 2020. "Understanding Place Characteristics in Geographic Contexts through Graph Convolutional Neural Networks," Annals of the American Association of Geographers, Taylor & Francis Journals, vol. 110(2), pages 408-420, March.
    6. Ziqi Li, 2024. "GeoShapley: A Game Theory Approach to Measuring Spatial Effects in Machine Learning Models," Annals of the American Association of Geographers, Taylor & Francis Journals, vol. 114(7), pages 1365-1385, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Daisuke Murakami & Daniel Griffith, 2015. "Random effects specifications in eigenvector spatial filtering: a simulation study," Journal of Geographical Systems, Springer, vol. 17(4), pages 311-331, October.
    2. Lan Hu & Yongwan Chun & Daniel A. Griffith, 2020. "Uncovering a positive and negative spatial autocorrelation mixture pattern: a spatial analysis of breast cancer incidences in Broward County, Florida, 2000–2010," Journal of Geographical Systems, Springer, vol. 22(3), pages 291-308, July.
    3. Yanzhao Wang & Jianfei Cao, 2023. "Examining the Effects of Socioeconomic Development on Fine Particulate Matter (PM2.5) in China’s Cities Based on Spatial Autocorrelation Analysis and MGWR Model," IJERPH, MDPI, vol. 20(4), pages 1-23, February.
    4. Jinzhao Song & Qing Feng & Xiaoping Wang & Hanliang Fu & Wei Jiang & Baiyu Chen, 2018. "Spatial Association and Effect Evaluation of CO 2 Emission in the Chengdu-Chongqing Urban Agglomeration: Quantitative Evidence from Social Network Analysis," Sustainability, MDPI, vol. 11(1), pages 1-19, December.
    5. Wang, Xiaoxi & Zhang, Yaojun & Yu, Danlin & Qi, Jinghan & Li, Shujing, 2022. "Investigating the spatiotemporal pattern of urban vibrancy and its determinants: Spatial big data analyses in Beijing, China," Land Use Policy, Elsevier, vol. 119(C).
    6. Hengyu Gu & Hanchen Yu & Mehak Sachdeva & Ye Liu, 2021. "Analyzing the distribution of researchers in China: An approach using multiscale geographically weighted regression," Growth and Change, Wiley Blackwell, vol. 52(1), pages 443-459, March.
    7. Shichao Lu & Zhihua Zhang & M. James C. Crabbe & Prin Suntichaikul, 2024. "Effects of Urban Land-Use Planning on Housing Prices in Chiang Mai, Thailand," Land, MDPI, vol. 13(8), pages 1-13, July.
    8. Jin, Peizhen & Mangla, Sachin Kumar & Song, Malin, 2021. "Moving towards a sustainable and innovative city: Internal urban traffic accessibility and high-level innovation based on platform monitoring data," International Journal of Production Economics, Elsevier, vol. 235(C).
    9. Daniel A. Griffith & Yongwan Chun & Jan Hauke, 2022. "A Moran eigenvector spatial filtering specification of entropy measures," Papers in Regional Science, Wiley Blackwell, vol. 101(1), pages 259-279, February.
    10. Daniel A. Griffith & Manfred M. Fischer, 2016. "Constrained Variants of the Gravity Model and Spatial Dependence: Model Specification and Estimation Issues," Advances in Spatial Science, in: Roberto Patuelli & Giuseppe Arbia (ed.), Spatial Econometric Interaction Modelling, chapter 0, pages 37-66, Springer.
    11. Aurélien Fichet de Clairfontaine & Manfred Fischer & Rafael Lata & Manfred Paier, 2015. "Barriers to cross-region research and development collaborations in Europe: evidence from the fifth European Framework Programme," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 54(2), pages 577-590, March.
    12. Chunfang Zhao & Yingliang Wu & Yunfeng Chen & Guohua Chen, 2023. "Multiscale Effects of Hedonic Attributes on Airbnb Listing Prices Based on MGWR: A Case Study of Beijing, China," Sustainability, MDPI, vol. 15(2), pages 1-21, January.
    13. Qing Luo & Daniel A. Griffith & Huayi Wu, 2019. "Spatial autocorrelation for massive spatial data: verification of efficiency and statistical power asymptotics," Journal of Geographical Systems, Springer, vol. 21(2), pages 237-269, June.
    14. Li Gao & Mingjing Huang & Wuping Zhang & Lei Qiao & Guofang Wang & Xumeng Zhang, 2021. "Comparative Study on Spatial Digital Mapping Methods of Soil Nutrients Based on Different Geospatial Technologies," Sustainability, MDPI, vol. 13(6), pages 1-19, March.
    15. Oshan, Taylor M., 2022. "Spatial Interaction Modeling," OSF Preprints m3ah8, Center for Open Science.
    16. Rene Westerholt & Enrico Steiger & Bernd Resch & Alexander Zipf, 2016. "Abundant Topological Outliers in Social Media Data and Their Effect on Spatial Analysis," PLOS ONE, Public Library of Science, vol. 11(9), pages 1-31, September.
    17. Tamara Mata & Carlos Llano, 2013. "Social networks and trade of services: modelling interregional flows with spatial and network autocorrelation effects," Journal of Geographical Systems, Springer, vol. 15(3), pages 319-367, July.
    18. Li, Mengya & Kwan, Mei-Po & Hu, Wenyan & Li, Rui & Wang, Jun, 2023. "Examining the effects of station-level factors on metro ridership using multiscale geographically weighted regression," Journal of Transport Geography, Elsevier, vol. 113(C).
    19. Kirillov, Andrew, 2021. "A study on spatial autocorrelation: Case of Russian regional inflation," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 64, pages 5-22.
    20. Zhimin Hong & Zhiwen Wang & Huhu Wang & Ruoxuan Wang, 2025. "A scale-adaptive estimation for mixed geographically and temporally weighted regression models," Journal of Geographical Systems, Springer, vol. 27(1), pages 85-111, January.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2504.12450. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.