IDEAS home Printed from https://ideas.repec.org/a/eee/lauspo/v107y2021ics0264837721001988.html
   My bibliography  Save this article

Deriving adequate sample sizes for ANN-based modelling of real estate valuation tasks by complexity analysis

Author

Listed:
  • Horvath, Sabine
  • Soot, Matthias
  • Zaddach, Sebastian
  • Neuner, Hans
  • Weitkamp, Alexandra

Abstract

Property valuation in areas with few transactions on basis of a linear regression fails due to a not sufficient number of purchasing cases. One approach which is enhancing the available data set is to evaluate these purchasing cases together with a neighbouring submarket. However, it leads to non-linearities. Consequently, non-linear models for a cross-submarket real estate valuation are required to obtain reasonable results. We focus in this contribution on non-linear modelling on basis of artificial neural networks (ANN). A prerequisite for these procedures is an adequate sample size. We present a new approach based on the aggregation of submarkets additional to the markets with few transactions at the expense of increasing complexity of the model required. The cross-submarket ANN estimation aims to reach accuracies comparable to local property valuation procedures in a first step and in further consequence to enable a reasonable estimation in areas with few transactions. We introduce an extended Kalman filter (EKF) estimation procedure for the ANN parameters and compare it to the standard optimisation procedure Levenberg Marquardt (LM) as well as to the multiple linear regression. Thus, German spatial and functional submarkets are aggregated. For the spatially aggregated data set, the ANN estimation leads to improved results. The ANN estimation of the functionally aggregated data appears deceptively simple due to too small samples not representing the sampling density. The question arises, what are adequate sample sizes regarding the complexity of the unknown relationship. We purpose a model complexity analysis procedure based on resampling and the structural risk minimisation theory and derive a minimum sample size for the spatially aggregated data. Only for the EKF computations, this minimum sample size is reached due to less variance of the ANN estimations. Generally, the EKF computation leads to a better ANN performance in contrast to LM. Finally, the spatial cross-submarket ANN estimation reaches accuracies of local property valuation procedures.

Suggested Citation

  • Horvath, Sabine & Soot, Matthias & Zaddach, Sebastian & Neuner, Hans & Weitkamp, Alexandra, 2021. "Deriving adequate sample sizes for ANN-based modelling of real estate valuation tasks by complexity analysis," Land Use Policy, Elsevier, vol. 107(C).
  • Handle: RePEc:eee:lauspo:v:107:y:2021:i:c:s0264837721001988
    DOI: 10.1016/j.landusepol.2021.105475
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0264837721001988
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.landusepol.2021.105475?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Jozef Zurada & Alan S. Levitan & Jian Guan, 2011. "A Comparison of Regression and Artificial Intelligence Methods in a Mass Appraisal Context," Journal of Real Estate Research, American Real Estate Society, vol. 33(3), pages 349-388.
    2. Gaetano Lisi, 2013. "On the Functional Form of the Hedonic Price Function: A Matching-theoretic Model and Empirical Evidence," International Real Estate Review, Global Social Science Institute, vol. 16(2), pages 189-207.
    3. Halvorsen, Robert & Pollakowski, Henry O., 1981. "Choice of functional form for hedonic price equations," Journal of Urban Economics, Elsevier, vol. 10(1), pages 37-49, July.
    4. Nghiep Nguyen & Al Cripps, 2001. "Predicting Housing Value: A Comparison of Multiple Regression Analysis and Artificial Neural Networks," Journal of Real Estate Research, American Real Estate Society, vol. 22(3), pages 313-336.
    5. Allan Din & Martin Hoesli & Andre Bender, 2001. "Environmental Variables and Real Estate Prices," Urban Studies, Urban Studies Journal Limited, vol. 38(11), pages 1989-2000, October.
    6. Elaine M. Worzala & Margarita Lenk & Ana Silva, 1995. "An Exploration of Neural Networks and Its Application to Real Estate Valuation," Journal of Real Estate Research, American Real Estate Society, vol. 10(2), pages 185-202.
    7. Hu, Lirong & He, Shenjing & Han, Zixuan & Xiao, He & Su, Shiliang & Weng, Min & Cai, Zhongliang, 2019. "Monitoring housing rental prices based on social media:An integrated approach of machine-learning algorithms and hedonic modeling to inform equitable housing policies," Land Use Policy, Elsevier, vol. 82(C), pages 657-673.
    8. Angelos Mimis & Antonis Rovolis & Marianthi Stamou, 2013. "Property valuation with artificial neural network: the case of Athens," Journal of Property Research, Taylor & Francis Journals, vol. 30(2), pages 128-143, June.
    9. Alwosheel, Ahmad & van Cranenburgh, Sander & Chorus, Caspar G., 2018. "Is your dataset big enough? Sample size requirements when using artificial neural networks for discrete choice analysis," Journal of choice modelling, Elsevier, vol. 28(C), pages 167-182.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Sisman, S. & Aydinoglu, A.C., 2022. "Improving performance of mass real estate valuation through application of the dataset optimization and Spatially Constrained Multivariate Clustering Analysis," Land Use Policy, Elsevier, vol. 119(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Juergen Deppner & Marcelo Cajias, 2024. "Accounting for Spatial Autocorrelation in Algorithm-Driven Hedonic Models: A Spatial Cross-Validation Approach," The Journal of Real Estate Finance and Economics, Springer, vol. 68(2), pages 235-273, February.
    2. Hyunsoo Kim & Youngwoo Kwon & Yeol Choi, 2020. "Assessing the Impact of Public Rental Housing on the Housing Prices in Proximity: Based on the Regional and Local Level of Price Prediction Models Using Long Short-Term Memory (LSTM)," Sustainability, MDPI, vol. 12(18), pages 1-25, September.
    3. Jose Torres-Pruñonosa & Pablo García-Estévez & Camilo Prado-Román, 2021. "Artificial Neural Network, Quantile and Semi-Log Regression Modelling of Mass Appraisal in Housing," Mathematics, MDPI, vol. 9(7), pages 1-16, April.
    4. Tien Foo Sing & Jesse Jingye Yang & Shi Ming Yu, 2022. "Boosted Tree Ensembles for Artificial Intelligence Based Automated Valuation Models (AI-AVM)," The Journal of Real Estate Finance and Economics, Springer, vol. 65(4), pages 649-674, November.
    5. Ali Azadeh & Mohammad Sheikhalishahi & Ali Boostani, 2014. "A Flexible Neuro-Fuzzy Approach for Improvement of Seasonal Housing Price Estimation in Uncertain and Non-Linear Environments," South African Journal of Economics, Economic Society of South Africa, vol. 82(4), pages 567-582, December.
    6. Maurizio d’Amato, 2007. "Comparing Rough Set Theory with Multiple Regression Analysis as Automated Valuation Methodologies," International Real Estate Review, Global Social Science Institute, vol. 10(2), pages 42-65.
    7. Jose Torres-Pruñonosa & Pablo García-Estévez & Josep Maria Raya & Camilo Prado-Román, 2022. "How on Earth Did Spanish Banking Sell the Housing Stock?," SAGE Open, , vol. 12(1), pages 21582440221, March.
    8. Camilo Serrano & Martin Hoesli, 2010. "Are Securitized Real Estate Returns more Predictable than Stock Returns?," The Journal of Real Estate Finance and Economics, Springer, vol. 41(2), pages 170-192, August.
    9. Jos魍ar𨁍ontero-Lorenzo & Beatriz Larraz-Iribas, 2012. "Space-time approach to commercial property prices valuation," Applied Economics, Taylor & Francis Journals, vol. 44(28), pages 3705-3715, October.
    10. Anthony Amoah & Peter G. Moffatt, 2017. "Estimating demand for reliable piped-water services in urban Ghana: An application of competing valuation approaches," University of East Anglia School of Economics Working Paper Series 2017-01, School of Economics, University of East Anglia, Norwich, UK..
    11. George H. Lentz & Ko Wang, 1998. "Residential Appraisal and the Lending Process: A Survey of Issues," Journal of Real Estate Research, American Real Estate Society, vol. 15(1), pages 11-40.
    12. Antipov, Evgeny & Pokryshevskaya, Elena, 2010. "Mass appraisal of residential apartments: An application of Random forest for valuation and a CART-based approach for model diagnostics," MPRA Paper 27645, University Library of Munich, Germany.
    13. William Cheung & Lewen Guo & Yuichiro Kawaguchi, 2021. "Automated valuation model for residential rental markets: evidence from Japan," Journal of Spatial Econometrics, Springer, vol. 2(1), pages 1-34, December.
    14. Mehmet Emin Tabar & Aziz Sisman & Yasemin Sisman, 2023. "A Real Estate Appraisal Model with Artificial Neural Networks and Fuzzy Logic: A Local Case Study of Samsun City," International Real Estate Review, Global Social Science Institute, vol. 26(4), pages 565-581.
    15. Dieudonné Tchuente & Serge Nyawa, 2022. "Real estate price estimation in French cities using geocoding and machine learning," Annals of Operations Research, Springer, vol. 308(1), pages 571-608, January.
    16. Susanna Levantesi & Gabriella Piscopo, 2020. "The Importance of Economic Variables on London Real Estate Market: A Random Forest Approach," Risks, MDPI, vol. 8(4), pages 1-17, October.
    17. Yan Kestens & Marius Thériault & François Des Rosiers, 2004. "The Impact of Surrounding Land Use and Vegetation on Single-Family House Prices," Environment and Planning B, , vol. 31(4), pages 539-567, August.
    18. Pierluigi Morano & Paolo Rosato & Francesco Tajani & Benedetto Manganelli & Felicia Di Liddo, 2019. "Contextualized Property Market Models vs. Generalized Mass Appraisals: An Innovative Approach," Sustainability, MDPI, vol. 11(18), pages 1-28, September.
    19. Steven Peterson & Albert B. Flanagan, 2009. "Neural Network Hedonic Pricing Models in Mass Real Estate Appraisal," Journal of Real Estate Research, American Real Estate Society, vol. 31(2), pages 147-164.
    20. Craig Ellis & Patrick J. Wilson & Ralf Zurbruegg, 2007. "Real Estate ‘Value’ Stocks and International Diversification," Journal of Property Research, Taylor & Francis Journals, vol. 24(3), pages 265-287, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:lauspo:v:107:y:2021:i:c:s0264837721001988. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Joice Jiang (email available below). General contact details of provider: https://www.journals.elsevier.com/land-use-policy .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.