IDEAS home Printed from https://ideas.repec.org/a/kap/jrefec/v68y2024i2d10.1007_s11146-022-09915-y.html
   My bibliography  Save this article

Accounting for Spatial Autocorrelation in Algorithm-Driven Hedonic Models: A Spatial Cross-Validation Approach

Author

Listed:
  • Juergen Deppner

    (University of Regensburg, IRE|BS International Real Estate Business School)

  • Marcelo Cajias

    (University of Regensburg, IRE|BS International Real Estate Business School
    PATRIZIA AG)

Abstract

Data-driven machine learning algorithms have initiated a paradigm shift in hedonic house price and rent modeling through their ability to capture highly complex and non-monotonic relationships. Their superior accuracy compared to parametric model alternatives has been demonstrated repeatedly in the literature. However, the statistical independence of the data implicitly assumed by resampling-based error estimates is unlikely to hold in a real estate context as price-formation processes in property markets are inherently spatial, which leads to spatial dependence structures in the data. When performing conventional cross-validation techniques for model selection and model assessment, spatial dependence between training and test data may lead to undetected overfitting and overoptimistic perception of predictive power. This study sheds light on the bias in cross-validation errors of tree-based algorithms induced by spatial autocorrelation and proposes a bias-reduced spatial cross-validation strategy. The findings confirm that error estimates from non-spatial resampling methods are overly optimistic, whereas spatially conscious techniques are more dependable and can increase generalizability. As accurate and unbiased error estimates are crucial to automated valuation methods, our results prove helpful for applications including, but not limited to, mass appraisal, credit risk management, portfolio allocation and investment decision making.

Suggested Citation

  • Juergen Deppner & Marcelo Cajias, 2024. "Accounting for Spatial Autocorrelation in Algorithm-Driven Hedonic Models: A Spatial Cross-Validation Approach," The Journal of Real Estate Finance and Economics, Springer, vol. 68(2), pages 235-273, February.
  • Handle: RePEc:kap:jrefec:v:68:y:2024:i:2:d:10.1007_s11146-022-09915-y
    DOI: 10.1007/s11146-022-09915-y
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11146-022-09915-y
    File Function: Abstract
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s11146-022-09915-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Pace, R. Kelley & Barry, Ronald & Gilley, Otis W. & Sirmans, C. F., 2000. "A method for spatial-temporal forecasting with an application to real estate prices," International Journal of Forecasting, Elsevier, vol. 16(2), pages 229-246.
    2. Sören Gröbel & Lorenz Thomschke, 2018. "Hedonic pricing and the spatial structure of housing data – an application to Berlin," Journal of Property Research, Taylor & Francis Journals, vol. 35(3), pages 185-208, July.
    3. Füss, Roland & Koller, Jan A., 2016. "The role of spatial and temporal structure for residential rent predictions," International Journal of Forecasting, Elsevier, vol. 32(4), pages 1352-1368.
    4. Allan Din & Martin Hoesli & Andre Bender, 2001. "Environmental Variables and Real Estate Prices," Urban Studies, Urban Studies Journal Limited, vol. 38(11), pages 1989-2000, October.
    5. Kelejian, Harry H & Prucha, Ingmar R, 1998. "A Generalized Spatial Two-Stage Least Squares Procedure for Estimating a Spatial Autoregressive Model with Autoregressive Disturbances," The Journal of Real Estate Finance and Economics, Springer, vol. 17(1), pages 99-121, July.
    6. Steven C. Bourassa & Eva Cantoni & Martin Hoesli, 2010. "Predicting House Prices with Spatial Dependence: A Comparison of Alternative Methods," Journal of Real Estate Research, American Real Estate Society, vol. 32(2), pages 139-160.
    7. Alexander N. Bogin & Jessica Shui, 2020. "Correction to: Appraisal Accuracy and Automated Valuation Models in Rural Areas," The Journal of Real Estate Finance and Economics, Springer, vol. 61(4), pages 730-731, November.
    8. Jozef Zurada & Alan S. Levitan & Jian Guan, 2011. "A Comparison of Regression and Artificial Intelligence Methods in a Mass Appraisal Context," Journal of Real Estate Research, American Real Estate Society, vol. 33(3), pages 349-388.
    9. Can, Ayse, 1992. "Specification and estimation of hedonic housing price models," Regional Science and Urban Economics, Elsevier, vol. 22(3), pages 453-474, September.
    10. G. Stacy Sirmans & John D. Benjamin, 1991. "Determinants of Market Rent," Journal of Real Estate Research, American Real Estate Society, vol. 6(3), pages 357-380.
    11. G. Stacy Sirmans & C.F. Sirmans & John D. Benjamin, 1989. "Determining Apartment Rent: The Value of Amenities, Services, and External Factors," Journal of Real Estate Research, American Real Estate Society, vol. 4(2), pages 33-44.
    12. Anselin, Luc & Bera, Anil K. & Florax, Raymond & Yoon, Mann J., 1996. "Simple diagnostic tests for spatial dependence," Regional Science and Urban Economics, Elsevier, vol. 26(1), pages 77-104, February.
    13. W.J. McCluskey & M. McCord & P.T. Davis & M. Haran & D. McIlhatton, 2013. "Prediction accuracy in mass appraisal: a comparison of modern approaches," Journal of Property Research, Taylor & Francis Journals, vol. 30(4), pages 239-265, December.
    14. James P. LeSage, 2014. "What Regional Scientists Need to Know about Spatial Econometrics," The Review of Regional Studies, Southern Regional Science Association, vol. 44(1), pages 13-32, Spring.
    15. Jorge Iván Pérez-Rave & Juan Carlos Correa-Morales & Favián González-Echavarría, 2019. "A machine learning approach to big data regression analysis of real estate prices for inferential and predictive purposes," Journal of Property Research, Taylor & Francis Journals, vol. 36(1), pages 59-96, January.
    16. Bourassa, Steven C. & Hoesli, Martin & Peng, Vincent S., 2003. "Do housing submarkets really matter?," Journal of Housing Economics, Elsevier, vol. 12(1), pages 12-28, March.
    17. Basu, Sabyasachi & Thibodeau, Thomas G, 1998. "Analysis of Spatial Autocorrelation in House Prices," The Journal of Real Estate Finance and Economics, Springer, vol. 17(1), pages 61-85, July.
    18. Schratz, Patrick & Muenchow, Jannes & Iturritxa, Eugenia & Richter, Jakob & Brenning, Alexander, 2019. "Hyperparameter tuning and performance assessment of statistical and machine-learning algorithms using spatial data," Ecological Modelling, Elsevier, vol. 406(C), pages 109-120.
    19. Pace, R Kelley & Gilley, Otis W, 1997. "Using the Spatial Configuration of the Data to Improve Estimation," The Journal of Real Estate Finance and Economics, Springer, vol. 14(3), pages 333-340, May.
    20. Steven Peterson & Albert B. Flanagan, 2009. "Neural Network Hedonic Pricing Models in Mass Real Estate Appraisal," Journal of Real Estate Research, American Real Estate Society, vol. 31(2), pages 147-164.
    21. James Valente & ShanShan Wu & Alan Gelfand & C.F. Sirmans, 2005. "Apartment Rent Prediction Using Spatial Modeling," Journal of Real Estate Research, American Real Estate Society, vol. 27(1), pages 105-136.
    22. A. F. Militino & M. D. Ugarte & L. García-Reinaldos, 2004. "Alternative Models for Describing Spatial Dependence among Dwelling Selling Prices," The Journal of Real Estate Finance and Economics, Springer, vol. 29(2), pages 193-209, September.
    23. Charles F. Manski, 1993. "Identification of Endogenous Social Effects: The Reflection Problem," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 60(3), pages 531-542.
    24. Winky K.O. Ho & Bo-Sin Tang & Siu Wai Wong, 2021. "Predicting property prices with machine learning algorithms," Journal of Property Research, Taylor & Francis Journals, vol. 38(1), pages 48-70, January.
    25. Sendhil Mullainathan & Jann Spiess, 2017. "Machine Learning: An Applied Econometric Approach," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 87-106, Spring.
    26. Alexander N. Bogin & Jessica Shui, 2020. "Appraisal Accuracy and Automated Valuation Models in Rural Areas," The Journal of Real Estate Finance and Economics, Springer, vol. 60(1), pages 40-52, February.
    27. Bradford Case & John Clapp & Robin Dubin & Mauricio Rodriguez, 2004. "Modeling Spatial and Temporal House Price Patterns: A Comparison of Four Models," The Journal of Real Estate Finance and Economics, Springer, vol. 29(2), pages 167-191, September.
    28. Seungwoo Chin & Matthew E. Kahn & Hyungsik Roger Moon, 2020. "Estimating the Gains from New Rail Transit Investment: A Machine Learning Tree Approach," Real Estate Economics, American Real Estate and Urban Economics Association, vol. 48(3), pages 886-914, September.
    29. Steven Bourassa & Eva Cantoni & Martin Hoesli, 2007. "Spatial Dependence, Housing Submarkets, and House Price Prediction," The Journal of Real Estate Finance and Economics, Springer, vol. 35(2), pages 143-160, August.
    30. Allen, Marcus T & Springer, Thomas M & Waller, Neil G, 1995. "Implicit Pricing across Residential Rental Submarkets," The Journal of Real Estate Finance and Economics, Springer, vol. 11(2), pages 137-151, September.
    31. Liv Osland, 2010. "An Application of Spatial Econometrics in Relation to Hedonic House Price Modelling," Journal of Real Estate Research, American Real Estate Society, vol. 32(3), pages 289-320.
    32. R. Kelley Pace & Darren Hayunga, 2020. "Examining the Information Content of Residuals from Hedonic and Spatial Models Using Trees and Forests," The Journal of Real Estate Finance and Economics, Springer, vol. 60(1), pages 170-180, February.
    33. K.C. Lam & C.Y. Yu & C.K. Lam, 2009. "Support vector machine and entropy based decision support system for property valuation," Journal of Property Research, Taylor & Francis Journals, vol. 26(3), pages 213-233, August.
    34. Can, Ayse & Megbolugbe, Isaac, 1997. "Spatial Dependence and House Price Index Construction," The Journal of Real Estate Finance and Economics, Springer, vol. 14(1-2), pages 203-222, Jan.-Marc.
    35. Elaine M. Worzala & Margarita Lenk & Ana Silva, 1995. "An Exploration of Neural Networks and Its Application to Real Estate Valuation," Journal of Real Estate Research, American Real Estate Society, vol. 10(2), pages 185-202.
    36. Hu, Lirong & He, Shenjing & Han, Zixuan & Xiao, He & Su, Shiliang & Weng, Min & Cai, Zhongliang, 2019. "Monitoring housing rental prices based on social media:An integrated approach of machine-learning algorithms and hedonic modeling to inform equitable housing policies," Land Use Policy, Elsevier, vol. 82(C), pages 657-673.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dieudonné Tchuente & Serge Nyawa, 2022. "Real estate price estimation in French cities using geocoding and machine learning," Annals of Operations Research, Springer, vol. 308(1), pages 571-608, January.
    2. Füss, Roland & Koller, Jan A., 2016. "The role of spatial and temporal structure for residential rent predictions," International Journal of Forecasting, Elsevier, vol. 32(4), pages 1352-1368.
    3. Steven Bourassa & Eva Cantoni & Martin Hoesli, 2007. "Spatial Dependence, Housing Submarkets, and House Price Prediction," The Journal of Real Estate Finance and Economics, Springer, vol. 35(2), pages 143-160, August.
    4. Xiaolong Liu, 2013. "Spatial and Temporal Dependence in House Price Prediction," The Journal of Real Estate Finance and Economics, Springer, vol. 47(2), pages 341-369, August.
    5. Steven C. Bourassa & Eva Cantoni & Martin Hoesli, 2005. "Spatial Dependence, Housing Submarkets, and House Prices," FAME Research Paper Series rp151, International Center for Financial Asset Management and Engineering.
    6. Yunlong Gong & Peter Boelhouwer & Jan de Haan, 2014. "Spatial Dependence in House Prices: Evidence from China's Interurban Housing Market," ERSA conference papers ersa14p448, European Regional Science Association.
    7. Antonio Páez & Fei Long & Steven Farber, 2008. "Moving Window Approaches for Hedonic Price Estimation: An Empirical Comparison of Modelling Techniques," Urban Studies, Urban Studies Journal Limited, vol. 45(8), pages 1565-1581, July.
    8. Kiefer, Hua, 2011. "The house price determination process: Rational expectations with a spatial context," Journal of Housing Economics, Elsevier, vol. 20(4), pages 249-266.
    9. Jamie Spinney & Pavlos Kanaroglou & Darren Scott, 2011. "Exploring Spatial Dynamics with Land Price Indexes," Urban Studies, Urban Studies Journal Limited, vol. 48(4), pages 719-735, March.
    10. Tien Foo Sing & Jesse Jingye Yang & Shi Ming Yu, 2022. "Boosted Tree Ensembles for Artificial Intelligence Based Automated Valuation Models (AI-AVM)," The Journal of Real Estate Finance and Economics, Springer, vol. 65(4), pages 649-674, November.
    11. David Maddison, 2009. "A Spatio‐temporal Model of Farmland Values," Journal of Agricultural Economics, Wiley Blackwell, vol. 60(1), pages 171-189, February.
    12. Jos魍ar𨁍ontero-Lorenzo & Beatriz Larraz-Iribas, 2012. "Space-time approach to commercial property prices valuation," Applied Economics, Taylor & Francis Journals, vol. 44(28), pages 3705-3715, October.
    13. Eddie Chi Man Hui & Cong Liang & Ziyou Wang & Yuan Wang, 2016. "The roles of developer’s status and competitive intensity in presale pricing in a residential market: A study of the spatio-temporal model in Hangzhou, China," Urban Studies, Urban Studies Journal Limited, vol. 53(6), pages 1203-1224, May.
    14. LE GALLO, Julie, 2000. "Econométrie spatiale 1 -Autocorrélation spatiale," LATEC - Document de travail - Economie (1991-2003) 2000-05, LATEC, Laboratoire d'Analyse et des Techniques EConomiques, CNRS UMR 5118, Université de Bourgogne.
    15. Alice Barreca & Elena Fregonara & Diana Rolando, 2021. "EPC Labels and Building Features: Spatial Implications over Housing Prices," Sustainability, MDPI, vol. 13(5), pages 1-21, March.
    16. Rocco Curto & Elena Fregonara, 2019. "Monitoring and Analysis of the Real Estate Market in a Social Perspective: Results from the Turin’s (Italy) Experience," Sustainability, MDPI, vol. 11(11), pages 1-22, June.
    17. Mark D. Ecker & Victor De Oliveira, 2007. "Bayesian Spatial Modeling of Housing Prices Subject to a Localized Externality," Working Papers 0030, College of Business, University of Texas at San Antonio.
    18. Sebastian Gnat, 2021. "Property Mass Valuation on Small Markets," Land, MDPI, vol. 10(4), pages 1-14, April.
    19. Bing Zhu & Roland Füss & Nico Rottke, 2011. "The Predictive Power of Anisotropic Spatial Correlation Modeling in Housing Prices," The Journal of Real Estate Finance and Economics, Springer, vol. 42(4), pages 542-565, May.
    20. Daikun Wang & Victor Jing Li, 2019. "Mass Appraisal Models of Real Estate in the 21st Century: A Systematic Literature Review," Sustainability, MDPI, vol. 11(24), pages 1-14, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:jrefec:v:68:y:2024:i:2:d:10.1007_s11146-022-09915-y. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.