IDEAS home Printed from https://ideas.repec.org/a/wly/jjmath/v2022y2022i1n2068353.html

A Study on the Prediction of House Price Index in First‐Tier Cities in China Based on Heterogeneous Integrated Learning Model

Author

Listed:
  • Yaqi Mao
  • Yonghui Duan
  • Yibin Guo
  • Xiang Wang
  • Shen Gao

Abstract

To address the difficulty of low prediction accuracy, insufficient model stability, and certain lag associated with a single machine learning model in the prediction of house price, this paper proposes a multimodel fusion house price prediction model based on stacking integrated learning. Firstly, web search data affecting house prices were collected by web crawler technology, and Spearman correlation analysis was performed on the attribute set to reduce its complexity and establish a prediction index system for four first‐tier cities in China. Secondly, with the goal of improving accuracy, diversity, and generalization ability, the types of base learners as well as metalearners are determined, and the parameters of the base learners are optimized using the grey wolf optimization algorithm to produce the GWO‐stacking model, and the experimental results from four datasets demonstrate that the model has high prediction accuracy. Finally, to solve the issue of unintelligible black boxes in machine learning models, we have used the state‐of‐the‐art interpretation method SHAP combined with the LightGBM algorithm to interpret the model, and the result can be used as a basis for real estate policy planning and adjustment and even guide the demand of home buyers, thus improving the efficiency and effectiveness of government policy making.

Suggested Citation

  • Yaqi Mao & Yonghui Duan & Yibin Guo & Xiang Wang & Shen Gao, 2022. "A Study on the Prediction of House Price Index in First‐Tier Cities in China Based on Heterogeneous Integrated Learning Model," Journal of Mathematics, John Wiley & Sons, vol. 2022(1).
  • Handle: RePEc:wly:jjmath:v:2022:y:2022:i:1:n:2068353
    DOI: 10.1155/2022/2068353
    as

    Download full text from publisher

    File URL: https://doi.org/10.1155/2022/2068353
    Download Restriction: no

    File URL: https://libkey.io/10.1155/2022/2068353?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Jorge Chica-Olmo, 2007. "Prediction of Housing Location Price by a Multivariate Spatial Method: Cokriging," Journal of Real Estate Research, American Real Estate Society, vol. 29(1), pages 95-114.
    2. repec:hin:complx:5287263 is not listed on IDEAS
    3. Chambers, Daniel N., 1992. "The racial housing price differential and racially transitional neighborhoods," Journal of Urban Economics, Elsevier, vol. 32(2), pages 214-232, September.
    4. Kelvin J. Lancaster, 1966. "A New Approach to Consumer Theory," Journal of Political Economy, University of Chicago Press, vol. 74(2), pages 132-132.
    5. Linlin Zhao & Jasper Mbachu & Zhansheng Liu, 2019. "Exploring the Trend of New Zealand Housing Prices to Support Sustainable Development," Sustainability, MDPI, vol. 11(9), pages 1-18, April.
    6. Park, Sungjun & Kim, Jinsoo, 2018. "The effect of interest in renewable energy on US household electricity consumption: An analysis using Google Trends data," Renewable Energy, Elsevier, vol. 127(C), pages 1004-1010.
    7. Eli Beracha & M. Babajide Wintoki, 2013. "Forecasting Residential Real Estate Price Changes from Online Search Activity," Journal of Real Estate Research, Taylor & Francis Journals, vol. 35(3), pages 283-312, January.
    8. Jorge Chica-Olmo, 2007. "Prediction of Housing Location Price by a Multivariate Spatial Method: Cokriging," Journal of Real Estate Research, Taylor & Francis Journals, vol. 29(1), pages 91-114, January.
    9. Jeremy Ginsberg & Matthew H. Mohebbi & Rajan S. Patel & Lynnette Brammer & Mark S. Smolinski & Larry Brilliant, 2009. "Detecting influenza epidemics using search engine query data," Nature, Nature, vol. 457(7232), pages 1012-1014, February.
    10. Eli Beracha & M. Babajide Wintoki, 2013. "Forecasting Residential Real Estate Price Changes from Online Search Activity," Journal of Real Estate Research, American Real Estate Society, vol. 35(3), pages 283-312.
    11. Gang-Zhi Fan & Seow Eng Ong & Hian Chye Koh, 2006. "Determinants of House Price: A Decision Tree Approach," Urban Studies, Urban Studies Journal Limited, vol. 43(12), pages 2301-2315, November.
    12. Cankun Wei & Meichen Fu & Li Wang & Hanbing Yang & Feng Tang & Yuqing Xiong, 2022. "The Research Development of Hedonic Price Model-Based Real Estate Appraisal in the Era of Big Data," Land, MDPI, vol. 11(3), pages 1-30, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Cankun Wei & Meichen Fu & Li Wang & Hanbing Yang & Feng Tang & Yuqing Xiong, 2022. "The Research Development of Hedonic Price Model-Based Real Estate Appraisal in the Era of Big Data," Land, MDPI, vol. 11(3), pages 1-30, February.
    2. Ken Wong & Max Kwong & Paul Luk & Michael Cheng, 2023. "A robust textual analysis of the dynamics of Hong Kong property market," Pacific Economic Review, Wiley Blackwell, vol. 28(3), pages 314-346, August.
    3. José-María Montero & Román Mínguez & Gema Fernández-Avilés, 2018. "Housing price prediction: parametric versus semi-parametric spatial hedonic models," Journal of Geographical Systems, Springer, vol. 20(1), pages 27-55, January.
    4. Jan-Peter Kucklick & Jennifer Priefer & Daniel Beverungen & Oliver Müller, 2023. "Elucidating the Predictive Power of Search and Experience Qualities for Pricing of Complex Goods – A Machine Learning-based Study on Real Estate Appraisal," Working Papers Dissertations 112, Paderborn University, Faculty of Business Administration and Economics.
    5. Jennifer Priefer & Jan-Peter Kucklick & Daniel Beverungen & Oliver Müller, 2025. "Elucidating the Predictive Power of Search and Experience Qualities for Pricing of Complex Goods: A Machine Learning-based Study on Real Estate Appraisal," Working Papers Dissertations 138, Paderborn University, Faculty of Business Administration and Economics.
    6. Reza Amindarbari & Perver Baran & Ross K. Meentemeyer, 2023. "Spatially disaggregated simulation of interactions between home prices and land-use change," Environment and Planning B, , vol. 50(7), pages 1879-1897, September.
    7. Hillis, Holly & Karwowski, Nicole, 2025. "The Impact of Conservation Easements on Housing Prices in US Counties," 2025 AAEA & WAEA Joint Annual Meeting, July 27-29, 2025, Denver, CO 361122, Agricultural and Applied Economics Association.
    8. Jorge Onrubia & Fernando Pinto & María del Carmen Rodado Ruíz, 2025. "From digital search to deed: Forecasting UK housing purchases in Spain using Google Trends across the Brexit disruption," Working Papers 2025-08, FEDEA.
    9. Stig Vinther Møller & Thomas Pedersen & Erik Christian Montes Schütte & Allan Timmermann, 2024. "Search and Predictability of Prices in the Housing Market," Management Science, INFORMS, vol. 70(1), pages 415-438, January.
    10. Guarini, Maria Rosaria & Roma, Antonella & Sabatelli, Emma & Segura-de-la-Cal, Alejandro, 2025. "Intrinsic and extrinsic attributes in real estate pricing: Insights for sustainable urban planning strategies," Land Use Policy, Elsevier, vol. 153(C).
    11. David Coble & Pablo Pincheira, 2021. "Forecasting building permits with Google Trends," Empirical Economics, Springer, vol. 61(6), pages 3315-3345, December.
    12. Biktimirov, Ernest N. & Sokolyk, Tatyana & Ayanso, Anteneh, 2024. "Unpacking the relation between media sentiment and house prices: A topic modeling approach," Journal of Housing Economics, Elsevier, vol. 66(C).
    13. Wang, Kailai & Lim, Gino J. & Race, Bruce & Zhang, Yunpeng (Jack) & Gao, Lu & Qiao, Fengxiang (George), 2025. "Examining spatial patterns and economic interactions of logistics activities across three Texas metropolitan areas," Journal of Transport Geography, Elsevier, vol. 123(C).
    14. Behr,Daniela Monika & Chen,Lixue & Goel,Ankita & Haider,Khondoker Tanveer & Sandeep Singh & Zaman,Asad, 2023. "Estimating House Prices in Emerging Markets and Developing Economies : A Big Data Approach," Policy Research Working Paper Series 10301, The World Bank.
    15. Vandenbergh, Joren, 2024. "Seller and search behavior in the Belgian housing market," Journal of Housing Economics, Elsevier, vol. 64(C).
    16. Chauvet, Marcelle & Gabriel, Stuart & Lutz, Chandler, 2016. "Mortgage default risk: New evidence from internet search queries," Journal of Urban Economics, Elsevier, vol. 96(C), pages 91-111.
    17. George H. Lentz & Ko Wang, 1998. "Residential Appraisal and the Lending Process: A Survey of Issues," Journal of Real Estate Research, American Real Estate Society, vol. 15(1), pages 11-40.
    18. Sergiy Saydometov & Sanjiv Sabherwal & Ramya Rajajagadeesan Aroul, 2020. "Sentiment and its asymmetric effect on housing returns," Review of Financial Economics, John Wiley & Sons, vol. 38(4), pages 580-600, October.
    19. Frederico Mira Godinho & Katja Neugebauer, 2025. "House Hunting High and Low: Constructing a Housing Search Index for Portugal," Working Papers w202507, Banco de Portugal, Economics and Research Department.
    20. Gao, Qishuo & Shi, Vivien & Pettit, Christopher & Han, Hoon, 2022. "Property valuation using machine learning algorithms on statistical areas in Greater Sydney, Australia," Land Use Policy, Elsevier, vol. 123(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:jjmath:v:2022:y:2022:i:1:n:2068353. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://onlinelibrary.wiley.com/journal/1469 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.