IDEAS home Printed from https://ideas.repec.org/p/grz/wpaper/2019-02.html
   My bibliography  Save this paper

Metrics for Evaluating the Performance of Automated Valuation Models

Author

Listed:
  • Miriam Steurer

    (University of Graz, Austria)

  • Robert Hill

    (University of Graz, Austria)

Abstract

Automated Valuation Models (AVMs) based on Machine Learning (ML) algorithms are widely used for the prediction of house prices. While there is consensus in the literature that cross-validation (CV) should be used for model selection in this context, the question of which performance metrics to use is generally neglected. Here we collect the most commonly used metrics from the AVM literature and elsewhere, and evaluate them with respect to two symmetry conditions: symmetry with respect to prediction error rates and symmetry with respect to the treatment of actual and predicted values. While none of the commonly used metrics satisfy both conditions, we propose a number of new metrics that do. We also show how popular existing metrics can be altered so that they adhere to these conditions. To illustrate our findings we compare the performance of 5 ML-based AVMs and find, that the most popular metrics in the AVM literature can generate misleading results. A different picture emerges when the full set of metrics is considered, and especially when we focus on four key metrics with the best symmetry properties.

Suggested Citation

  • Miriam Steurer & Robert Hill, 2019. "Metrics for Evaluating the Performance of Automated Valuation Models," Graz Economics Papers 2019-02, University of Graz, Department of Economics.
  • Handle: RePEc:grz:wpaper:2019-02
    as

    Download full text from publisher

    File URL: http://www100.uni-graz.at/vwlwww/forschung/RePEc/wpaper/2019-02.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    2. Voyant, Cyril & Notton, Gilles & Kalogirou, Soteris & Nivet, Marie-Laure & Paoli, Christophe & Motte, Fabrice & Fouilloy, Alexis, 2017. "Machine learning methods for solar radiation forecasting: A review," Renewable Energy, Elsevier, vol. 105(C), pages 569-582.
    3. Kuhn, Max, 2008. "Building Predictive Models in R Using the caret Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 28(i05).
    4. Rainer Schulz & Martin Wersing & Axel Werwatz, 2014. "Automated valuation modelling: a specification exercise," Journal of Property Research, Taylor & Francis Journals, vol. 31(2), pages 131-153, June.
    5. Hyndman, Rob J. & Koehler, Anne B., 2006. "Another look at measures of forecast accuracy," International Journal of Forecasting, Elsevier, vol. 22(4), pages 679-688.
    6. Shiller, Robert J & Weiss, Allan N, 1999. "Evaluating Real Estate Valuation Systems," The Journal of Real Estate Finance and Economics, Springer, vol. 18(2), pages 147-161, March.
    7. Patrick Bajari & Denis Nekipelov & Stephen P. Ryan & Miaoyu Yang, 2015. "Machine Learning Methods for Demand Estimation," American Economic Review, American Economic Association, vol. 105(5), pages 481-485, May.
    8. Jurado, Sergio & Nebot, Àngela & Mugica, Fransisco & Avellana, Narcís, 2015. "Hybrid methodologies for electricity load forecasting: Entropy-based feature selection with machine learning and soft computing techniques," Energy, Elsevier, vol. 86(C), pages 276-291.
    9. Antipov, Evgeny & Pokryshevskaya, Elena, 2010. "Mass appraisal of residential apartments: An application of Random forest for valuation and a CART-based approach for model diagnostics," MPRA Paper 27645, University Library of Munich, Germany.
    10. Alexander N. Bogin & Jessica Shui, 2018. "Appraisal Accuracy, Automated Valuation Models, And Credit Modeling in Rural Areas," FHFA Staff Working Papers 18-03, Federal Housing Finance Agency.
    11. He, Qianchuan & Kong, Linglong & Wang, Yanhua & Wang, Sijian & Chan, Timothy A. & Holland, Eric, 2016. "Regularized quantile regression under heterogeneous sparsity with application to quantitative genetic traits," Computational Statistics & Data Analysis, Elsevier, vol. 95(C), pages 222-239.
    12. D. S.P. Rao (ed.), 2009. "Purchasing Power Parities of Currencies," Books, Edward Elgar Publishing, number 3725.
    13. Foley, Aoife M. & Leahy, Paul G. & Marvuglia, Antonino & McKeogh, Eamon J., 2012. "Current methods and advances in forecasting of wind power generation," Renewable Energy, Elsevier, vol. 37(1), pages 1-8.
    14. Makridakis, Spyros, 1993. "Accuracy measures: theoretical and practical concerns," International Journal of Forecasting, Elsevier, vol. 9(4), pages 527-529, December.
    15. Li, Yanting & He, Yong & Su, Yan & Shu, Lianjie, 2016. "Forecasting the daily power output of a grid-connected photovoltaic system based on multivariate adaptive regression splines," Applied Energy, Elsevier, vol. 180(C), pages 392-401.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lan, Hai & Yin, He & Hong, Ying-Yi & Wen, Shuli & Yu, David C. & Cheng, Peng, 2018. "Day-ahead spatio-temporal forecasting of solar irradiation along a navigation route," Applied Energy, Elsevier, vol. 211(C), pages 15-27.
    2. Coelho, Vitor N. & Coelho, Igor M. & Coelho, Bruno N. & Reis, Agnaldo J.R. & Enayatifar, Rasul & Souza, Marcone J.F. & Guimarães, Frederico G., 2016. "A self-adaptive evolutionary fuzzy model for load forecasting problems on smart grid environment," Applied Energy, Elsevier, vol. 169(C), pages 567-584.
    3. Pedro, Hugo T.C. & Lim, Edwin & Coimbra, Carlos F.M., 2018. "A database infrastructure to implement real-time solar and wind power generation intra-hour forecasts," Renewable Energy, Elsevier, vol. 123(C), pages 513-525.
    4. Alysha M De Livera, 2010. "Automatic forecasting with a modified exponential smoothing state space framework," Monash Econometrics and Business Statistics Working Papers 10/10, Monash University, Department of Econometrics and Business Statistics.
    5. Blaskowitz, Oliver & Herwartz, Helmut, 2011. "On economic evaluation of directional forecasts," International Journal of Forecasting, Elsevier, vol. 27(4), pages 1058-1065, October.
    6. Eduardo Correia & Rodrigo Calili & José Francisco Pessanha & Maria Fatima Almeida, 2023. "Definition of Regulatory Targets for Electricity Non-Technical Losses: Proposition of an Automatic Model-Selection Technique for Panel Data Regressions," Energies, MDPI, vol. 16(6), pages 1-22, March.
    7. Santamaría-Bonfil, G. & Reyes-Ballesteros, A. & Gershenson, C., 2016. "Wind speed forecasting for wind farms: A method based on support vector regression," Renewable Energy, Elsevier, vol. 85(C), pages 790-809.
    8. Wasilewski, J. & Baczynski, D., 2017. "Short-term electric energy production forecasting at wind power plants in pareto-optimality context," Renewable and Sustainable Energy Reviews, Elsevier, vol. 69(C), pages 177-187.
    9. Wang, Kejun & Qi, Xiaoxia & Liu, Hongda, 2019. "A comparison of day-ahead photovoltaic power forecasting models based on deep learning neural network," Applied Energy, Elsevier, vol. 251(C), pages 1-1.
    10. Ruhnau, Oliver & Hennig, Patrick & Madlener, Reinhard, 2020. "Economic implications of forecasting electricity generation from variable renewable energy sources," Renewable Energy, Elsevier, vol. 161(C), pages 1318-1327.
    11. Da Liu & Kun Sun & Han Huang & Pingzhou Tang, 2018. "Monthly Load Forecasting Based on Economic Data by Decomposition Integration Theory," Sustainability, MDPI, vol. 10(9), pages 1-22, September.
    12. Van Belle, Jente & Guns, Tias & Verbeke, Wouter, 2021. "Using shared sell-through data to forecast wholesaler demand in multi-echelon supply chains," European Journal of Operational Research, Elsevier, vol. 288(2), pages 466-479.
    13. Zhou, Yi & Zhou, Nanrun & Gong, Lihua & Jiang, Minlin, 2020. "Prediction of photovoltaic power output based on similar day analysis, genetic algorithm and extreme learning machine," Energy, Elsevier, vol. 204(C).
    14. Su, Miaomiao & Wang, Qihua, 2022. "A convex programming solution based debiased estimator for quantile with missing response and high-dimensional covariables," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).
    15. Sabarathinam Srinivasan & Suresh Kumarasamy & Zacharias E. Andreadakis & Pedro G. Lind, 2023. "Artificial Intelligence and Mathematical Models of Power Grids Driven by Renewable Energy Sources: A Survey," Energies, MDPI, vol. 16(14), pages 1-56, July.
    16. Kim, Sungil & Kim, Heeyoung, 2016. "A new metric of absolute percentage error for intermittent demand forecasts," International Journal of Forecasting, Elsevier, vol. 32(3), pages 669-679.
    17. Yagli, Gokhan Mert & Yang, Dazhi & Srinivasan, Dipti, 2019. "Automatic hourly solar forecasting using machine learning models," Renewable and Sustainable Energy Reviews, Elsevier, vol. 105(C), pages 487-498.
    18. Heo, Jae & Jung, Jaehoon & Kim, Byungil & Han, SangUk, 2020. "Digital elevation model-based convolutional neural network modeling for searching of high solar energy regions," Applied Energy, Elsevier, vol. 262(C).
    19. Ngoc-Lan Huynh, Anh & Deo, Ravinesh C. & Ali, Mumtaz & Abdulla, Shahab & Raj, Nawin, 2021. "Novel short-term solar radiation hybrid model: Long short-term memory network integrated with robust local mean decomposition," Applied Energy, Elsevier, vol. 298(C).
    20. Yagli, Gokhan Mert & Yang, Dazhi & Gandhi, Oktoviano & Srinivasan, Dipti, 2020. "Can we justify producing univariate machine-learning forecasts with satellite-derived solar irradiance?," Applied Energy, Elsevier, vol. 259(C).

    More about this item

    Keywords

    Performance metric; Automated valuation model (AVM); Appraisal; Prediction error; Model selection;
    All these keywords.

    JEL classification:

    • C45 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Neural Networks and Related Topics
    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:grz:wpaper:2019-02. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Michael Scholz (email available below). General contact details of provider: https://edirc.repec.org/data/vgrazat.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.