IDEAS home Printed from https://ideas.repec.org/a/kap/compec/v66y2025i1d10.1007_s10614-024-10703-4.html
   My bibliography  Save this article

A Hybrid Machine Learning Model Architecture with Clustering Analysis and Stacking Ensemble for Real Estate Price Prediction

Author

Listed:
  • Cihan Çılgın

    (Bolu Abant İzzet Baysal University)

  • Hadi Gökçen

    (Gazi University)

Abstract

Population growth, rapid developments in technology, increase in living standards, changes in the household structure and economic structure of societies, and the increase in urbanization at very high rates, as well as the increase in the demand for renting or purchasing real estate, have both expanded the real estate market and made it more active. This intense activity in the real estate markets also accelerates real estate price prediction studies in direct proportion. The aim of this study is to present a model architecture that can achieve high accuracy in predicting the current market value of real estates by using a hybrid approach, through clustering models as a preliminary approach, in order to achieve higher homogeneity with stacking ensemble using multiple machine learning methods. In order to obtain more homogeneous submarkets, the collected data set was first grouped according to the number of rooms and then each group was divided into clusters by cluster analysis. In this way, more homogeneous submarkets were obtained and predict accuracy was improved. Then, the training process was carried out for 13 different weak learners using fivefold cross-validation for each determined sub-market. Feature selection and parameter optimization were performed separately for each weak learner. Then, the predictions obtained according to the feature and parameter set that gave the best results were used to train the meta-learner. As a result of this entire process, the final prediction was created with the meta learner that gave the least error rate. As the findings show, high predicting performance at international standards has been demonstrated even in a period of high price fluctuations for many and various sub-markets of real estate.

Suggested Citation

  • Cihan Çılgın & Hadi Gökçen, 2025. "A Hybrid Machine Learning Model Architecture with Clustering Analysis and Stacking Ensemble for Real Estate Price Prediction," Computational Economics, Springer;Society for Computational Economics, vol. 66(1), pages 127-178, July.
  • Handle: RePEc:kap:compec:v:66:y:2025:i:1:d:10.1007_s10614-024-10703-4
    DOI: 10.1007/s10614-024-10703-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10614-024-10703-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10614-024-10703-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Mehmet DURKAYA & Rahmi YAMAK, 2004. "Türkiye''de Konut Piyasasının Talep Yönlü Analizi," Iktisat Isletme ve Finans, Bilgesel Yayincilik, vol. 19(217), pages 75-83.
    2. repec:eme:jpvi00:14635789810205128 is not listed on IDEAS
    3. Chung Chun Lin & Satish B. Mohan, 2011. "Effectiveness comparison of the residential property mass appraisal methodologies in the USA," International Journal of Housing Markets and Analysis, Emerald Group Publishing Limited, vol. 4(3), pages 224-243, August.
    4. Yilmazer, Seckin & Kocaman, Sultan, 2020. "A mass appraisal assessment study using machine learning based on multiple regression and random forest," Land Use Policy, Elsevier, vol. 99(C).
    5. Nghiep Nguyen & Al Cripps, 2001. "Predicting Housing Value: A Comparison of Multiple Regression Analysis and Artificial Neural Networks," Journal of Real Estate Research, Taylor & Francis Journals, vol. 22(3), pages 313-336, January.
    6. Jasmina Ćetković & Slobodan Lakić & Marijana Lazarevska & Miloš Žarković & Saša Vujošević & Jelena Cvijović & Mladen Gogić, 2018. "Assessment of the Real Estate Market Value in the European Market by Artificial Neural Networks Application," Complexity, Hindawi, vol. 2018, pages 1-10, January.
    7. Allan Din & Martin Hoesli & Andre Bender, 2001. "Environmental Variables and Real Estate Prices," Urban Studies, Urban Studies Journal Limited, vol. 38(11), pages 1989-2000, October.
    8. Mateusz Tomal, 2021. "Housing market heterogeneity and cluster formation: evidence from Poland," International Journal of Housing Markets and Analysis, Emerald Group Publishing Limited, vol. 14(5), pages 1166-1185, March.
    9. Gang-Zhi Fan & Seow Eng Ong & Hian Chye Koh, 2006. "Determinants of House Price: A Decision Tree Approach," Urban Studies, Urban Studies Journal Limited, vol. 43(12), pages 2301-2315, November.
    10. Joseph Awoamim Yacim & Douw Gert Brand Boshoff, 2018. "Impact of Artificial Neural Networks Training Algorithms on Accurate Prediction of Property Values," Journal of Real Estate Research, Taylor & Francis Journals, vol. 40(3), pages 375-418, July.
    11. Sun, Xiaolei & Liu, Mingxi & Sima, Zeqian, 2020. "A novel cryptocurrency price trend forecasting model based on LightGBM," Finance Research Letters, Elsevier, vol. 32(C).
    12. Agostino Valier, 2020. "Who performs better? AVMs vs hedonic models," Journal of Property Investment & Finance, Emerald Group Publishing Limited, vol. 38(3), pages 213-225, March.
    13. Elena G. Irwin, 2002. "The Effects of Open Space on Residential Property Values," Land Economics, University of Wisconsin Press, vol. 78(4), pages 465-480.
    14. Hinrichs, Nils & Kolbe, Jens & Werwatz, Axel, 2020. "AVM and high dimensional data: Do ridge, the lasso or the elastic net provide an "automated" solution?," FORLand Working Papers 22 (2020), Humboldt University Berlin, DFG Research Unit 2569 FORLand "Agricultural Land Markets – Efficiency and Regulation".
    15. Jozef Zurada & Alan S. Levitan & Jian Guan, 2011. "A Comparison of Regression and Artificial Intelligence Methods in a Mass Appraisal Context," Journal of Real Estate Research, American Real Estate Society, vol. 33(3), pages 349-388.
    16. Sheppard, Stephen, 1999. "Hedonic analysis of housing markets," Handbook of Regional and Urban Economics, in: P. C. Cheshire & E. S. Mills (ed.), Handbook of Regional and Urban Economics, edition 1, volume 3, chapter 41, pages 1595-1635, Elsevier.
    17. D.H. Jenkins & O.M. Lewis & N. Almond & S.A. Gronow & J.A. Ware, 1999. "Towards an intelligent residential appraisal model," Journal of Property Research, Taylor & Francis Journals, vol. 16(1), pages 67-90, January.
    18. Elli Pagourtzi & Vassilis Assimakopoulos & Thomas Hatzichristos & Nick French, 2003. "Real estate appraisal: a review of valuation methods," Journal of Property Investment & Finance, Emerald Group Publishing Limited, vol. 21(4), pages 383-401, August.
    19. Elena G. Irwin & Nancy E. Bockstael, 2001. "The Problem of Identifying Land Use Spillovers: Measuring the Effects of Open Space on Residential Property Values," American Journal of Agricultural Economics, Agricultural and Applied Economics Association, vol. 83(3), pages 698-704.
    20. Bourassa, Steven C. & Hamelink, Foort & Hoesli, Martin & MacGregor, Bryan D., 1999. "Defining Housing Submarkets," Journal of Housing Economics, Elsevier, vol. 8(2), pages 160-183, June.
    21. Sander, Heather & Polasky, Stephen & Haight, Robert G., 2010. "The value of urban tree cover: A hedonic property price model in Ramsey and Dakota Counties, Minnesota, USA," Ecological Economics, Elsevier, vol. 69(8), pages 1646-1656, June.
    22. Cathy Maugis & Gilles Celeux & Marie-Laure Martin-Magniette, 2009. "Variable Selection for Clustering with Gaussian Mixture Models," Biometrics, The International Biometric Society, vol. 65(3), pages 701-709, September.
    23. Manuel Landajo & Celia Bilbao & Amelia Bilbao, 2012. "Nonparametric neural network modeling of hedonic prices in the housing market," Empirical Economics, Springer, vol. 42(3), pages 987-1009, June.
    24. repec:eme:ijhma0:17538271111153013 is not listed on IDEAS
    25. Abhijat Arun Abhyankar & Harish Kumar Singla, 2021. "Comparing predictive performance of general regression neural network (GRNN) and hedonic regression model for factors affecting housing prices in “Pune-India”," International Journal of Housing Markets and Analysis, Emerald Group Publishing Limited, vol. 15(2), pages 451-477, June.
    26. repec:eme:jpvi00:14635789710163775 is not listed on IDEAS
    27. Chung Chun Lin & Satish B. Mohan, 2011. "Effectiveness comparison of the residential property mass appraisal methodologies in the USA," International Journal of Housing Markets and Analysis, Emerald Group Publishing Limited, vol. 4(3), pages 224-243, August.
    28. Volkan Sevinç, 2022. "Determining the Flat Sales Prices by Flat Characteristics Using Bayesian Network Models," Computational Economics, Springer;Society for Computational Economics, vol. 59(2), pages 549-577, February.
    29. Miriam Steurer & Robert J. Hill & Norbert Pfeifer, 2021. "Metrics for evaluating the performance of machine learning based automated valuation models," Journal of Property Research, Taylor & Francis Journals, vol. 38(2), pages 99-129, April.
    30. Jozef Zurada & Alan Levitan & Jian Guan, 2011. "A Comparison of Regression and Artificial Intelligence Methods in a Mass Appraisal Context," Journal of Real Estate Research, Taylor & Francis Journals, vol. 33(3), pages 349-388, January.
    31. Bucholtz, Shawn & Geoghegan, Jacqueline & Lynch, Lori, 2003. "Capitalization of Open Spaces into Housing Values and the Residential Property Tax Revenue Impacts of Agricultural Easement Programs," Agricultural and Resource Economics Review, Northeastern Agricultural and Resource Economics Association, vol. 32(01), pages 1-13, April.
    32. Hui Zou & Trevor Hastie, 2005. "Addendum: Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(5), pages 768-768, November.
    33. W.J. McCluskey & M. McCord & P.T. Davis & M. Haran & D. McIlhatton, 2013. "Prediction accuracy in mass appraisal: a comparison of modern approaches," Journal of Property Research, Taylor & Francis Journals, vol. 30(4), pages 239-265, December.
    34. Acharya, Gayatri & Bennett, Lynne Lewis, 2001. "Valuing Open Space and Land-Use Patterns in Urban Watersheds," The Journal of Real Estate Finance and Economics, Springer, vol. 22(2-3), pages 221-237, March-May.
    35. Susanna Levantesi & Gabriella Piscopo, 2020. "The Importance of Economic Variables on London Real Estate Market: A Random Forest Approach," Risks, MDPI, vol. 8(4), pages 1-17, October.
    36. Mahdieh Yazdani, 2021. "Machine Learning, Deep Learning, and Hedonic Methods for Real Estate Price Prediction," Papers 2110.07151, arXiv.org.
    37. Süreyya Özöğür Akyüz & Birsen Eygi Erdogan & Özlem Yıldız & Pınar Karadayı Ataş, 2023. "A Novel Hybrid House Price Prediction Model," Computational Economics, Springer;Society for Computational Economics, vol. 62(3), pages 1215-1232, October.
    38. Matthias Schonlau & Rosie Yuyan Zou, 2020. "The random forest algorithm for statistical learning," Stata Journal, StataCorp LLC, vol. 20(1), pages 3-29, March.
    39. K.C. Lam & C.Y. Yu & C.K. Lam, 2009. "Support vector machine and entropy based decision support system for property valuation," Journal of Property Research, Taylor & Francis Journals, vol. 26(3), pages 213-233, August.
    40. repec:eme:ijhma0:ijhma-01-2021-0003 is not listed on IDEAS
    41. William McCluskey & Sarabjot Anand, 1999. "The application of intelligent hybrid techniques for the mass appraisal of residential properties," Journal of Property Investment & Finance, Emerald Group Publishing Limited, vol. 17(3), pages 218-239, August.
    42. Evren Ozus & Vedia Dokmeci & Gulay Kiroglu & Guldehan Egdemir, 2007. "Spatial Analysis of Residential Prices in Istanbul," European Planning Studies, Taylor & Francis Journals, vol. 15(5), pages 707-721, June.
    43. repec:eme:ijhma0:ijhma-09-2020-0114 is not listed on IDEAS
    44. Luca Rampini & Fulvio Re Cecconi, 2021. "Artificial intelligence algorithms to predict Italian real estate market prices," Journal of Property Investment & Finance, Emerald Group Publishing Limited, vol. 40(6), pages 588-611, December.
    45. Hui Zou & Trevor Hastie, 2005. "Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(2), pages 301-320, April.
    46. Elaine M. Worzala & Margarita Lenk & Ana Silva, 1995. "An Exploration of Neural Networks and Its Application to Real Estate Valuation," Journal of Real Estate Research, American Real Estate Society, vol. 10(2), pages 185-202.
    47. Steven Peterson & Albert Flanagan, 2009. "Neural Network Hedonic Pricing Models in Mass Real Estate Appraisal," Journal of Real Estate Research, Taylor & Francis Journals, vol. 31(2), pages 147-164, January.
    48. Steven Peterson & Albert B. Flanagan, 2009. "Neural Network Hedonic Pricing Models in Mass Real Estate Appraisal," Journal of Real Estate Research, American Real Estate Society, vol. 31(2), pages 147-164.
    49. Nghiep Nguyen & Al Cripps, 2001. "Predicting Housing Value: A Comparison of Multiple Regression Analysis and Artificial Neural Networks," Journal of Real Estate Research, American Real Estate Society, vol. 22(3), pages 313-336.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Silviu-Ionuț BĂBȚAN, 2024. "AUTOMATED EVALUATION MODELS in real estate market: A comparative analysis between linear regression and XGBoost," Annales Universitatis Apulensis Series Oeconomica, Faculty of Sciences, "1 Decembrie 1918" University, Alba Iulia, vol. 2(26), pages 1-3.
    2. Juergen Deppner & Benedict Ahlefeldt-Dehn & Eli Beracha & Wolfgang Schaefers, 2025. "Boosting the Accuracy of Commercial Real Estate Appraisals: An Interpretable Machine Learning Approach," The Journal of Real Estate Finance and Economics, Springer, vol. 71(2), pages 314-351, August.
    3. Juergen Deppner & Marcelo Cajias, 2024. "Accounting for Spatial Autocorrelation in Algorithm-Driven Hedonic Models: A Spatial Cross-Validation Approach," The Journal of Real Estate Finance and Economics, Springer, vol. 68(2), pages 235-273, February.
    4. Jose Torres-Pruñonosa & Pablo García-Estévez & Josep Maria Raya & Camilo Prado-Román, 2022. "How on Earth Did Spanish Banking Sell the Housing Stock?," SAGE Open, , vol. 12(1), pages 21582440221, March.
    5. Jose A. Rodriguez-Serrano, 2025. "Prototype-based learning for real estate valuation: a machine learning model that explains prices," Annals of Operations Research, Springer, vol. 344(1), pages 287-311, January.
    6. Jay Mittal, 2017. "Valuing Visual Accessibility of Scenic Landscapes in a Single Family Housing Market: A Spatial Hedonic Approach," ERES eres2017_1, European Real Estate Society (ERES).
    7. Behr,Daniela Monika & Chen,Lixue & Goel,Ankita & Haider,Khondoker Tanveer & Sandeep Singh & Zaman,Asad, 2023. "Estimating House Prices in Emerging Markets and Developing Economies : A Big Data Approach," Policy Research Working Paper Series 10301, The World Bank.
    8. Anderson, Soren T. & West, Sarah E., 2006. "Open space, residential property values, and spatial context," Regional Science and Urban Economics, Elsevier, vol. 36(6), pages 773-789, November.
    9. Daikun Wang & Victor Jing Li, 2019. "Mass Appraisal Models of Real Estate in the 21st Century: A Systematic Literature Review," Sustainability, MDPI, vol. 11(24), pages 1-14, December.
    10. Dieudonné Tchuente & Serge Nyawa, 2022. "Real estate price estimation in French cities using geocoding and machine learning," Annals of Operations Research, Springer, vol. 308(1), pages 571-608, January.
    11. Tien Foo Sing & Jesse Jingye Yang & Shi Ming Yu, 2022. "Boosted Tree Ensembles for Artificial Intelligence Based Automated Valuation Models (AI-AVM)," The Journal of Real Estate Finance and Economics, Springer, vol. 65(4), pages 649-674, November.
    12. Cho, Seong-Hoon & Poudyal, Neelam C. & Roberts, Roland K., 2008. "Spatial analysis of the amenity value of green open space," Ecological Economics, Elsevier, vol. 66(2-3), pages 403-416, June.
    13. Zipp, Katherine Y. & Lewis, David J. & Provencher, Bill, 2017. "Does the conservation of land reduce development? An econometric-based landscape simulation with land market feedbacks," Journal of Environmental Economics and Management, Elsevier, vol. 81(C), pages 19-37.
    14. Seong-Hoon Cho & Dayton Lambert & Seung Kim & Roland Roberts & William Park, 2011. "Relationship between value of open space and distance from housing locations within a community," Journal of Geographical Systems, Springer, vol. 13(4), pages 393-414, December.
    15. Moritz Stang & Bastian Krämer & Cathrine Nagl & Wolfgang Schäfers, 2023. "From human business to machine learning—methods for automating real estate appraisals and their practical implications [Vom Vergleichswertverfahren zum maschinellen Lernen – Methoden zur automatisi," Zeitschrift für Immobilienökonomie (German Journal of Real Estate Research), Springer;Gesellschaft für Immobilienwirtschaftliche Forschung e. V., vol. 9(2), pages 81-108, October.
    16. Felix Lorenz & Jonas Willwersch & Marcelo Cajias & Franz Fuerst, 2023. "Interpretable machine learning for real estate market analysis," Real Estate Economics, American Real Estate and Urban Economics Association, vol. 51(5), pages 1178-1208, September.
    17. JunJie Wu & Wenchao Xu & Ralph Alig, 2016. "How Do the Location, Size and Budget of Open Space Conservation Affect Land Values?," The Journal of Real Estate Finance and Economics, Springer, vol. 52(1), pages 73-97, January.
    18. Jose Torres-Pruñonosa & Pablo García-Estévez & Camilo Prado-Román, 2021. "Artificial Neural Network, Quantile and Semi-Log Regression Modelling of Mass Appraisal in Housing," Mathematics, MDPI, vol. 9(7), pages 1-16, April.
    19. Mariusz Doszyń, 2024. "Might expert knowledge improve econometric real estate mass appraisal?," The Journal of Real Estate Finance and Economics, Springer, vol. 69(4), pages 719-740, November.
    20. Bingzi Jin & Xiaojie Xu, 2025. "Predictions of residential property price indices for China via machine learning models," Quality & Quantity: International Journal of Methodology, Springer, vol. 59(2), pages 1481-1513, April.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:compec:v:66:y:2025:i:1:d:10.1007_s10614-024-10703-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.