IDEAS home Printed from https://ideas.repec.org/a/ire/issued/v28n042025p505-527.html

Using Machine Learning Regression Algorithms to Predict House Prices in Vietnam

Author

Listed:
  • Minh-Thang Ha

    (Hung Yen University of Technology and Education)

  • Thi-Cham Nguyen

    (Haiphong University of Medicine and Pharmacy)

  • Thanh-Huyen Pham

    (Halong University)

  • Van-Hau Nguyen

    (Hung Yen University of Technology and Education)

Abstract

This study develops a comprehensive machine learning (ML) framework for house price prediction in Vietnam by utilizing a dataset of 28,156 property listings from a real estate website. We employ rigorous data preprocessing, feature engineering, and comparative analysis of ML algorithms, including CatBoost, XGBoost, and random forests. The results demonstrate the superiority of ensemble methods, with CatBoost achieving the highest performance on the main dataset (R² = 0.510, RMSE = 17.614). Regional analyses in Hanoi and Ho Chi Minh City reveal the adaptability of the models for local market dynamics. A Shapley additive explanations analysis reveals key drivers of house prices, such as area, population density, and property-specific attributes. The findings contribute to the academic understanding of real estate valuation and provide actionable insights for policymakers, investors, and other stakeholders. This study lays the groundwork for developing automated valuation models and their practical implementation, exemplified by a website application. By harnessing ML and data-driven insights, this research advances transparent, efficient, and informed decision-making in the real estate sector in Vietnam, while offering a robust methodology for house price prediction in emerging markets.

Suggested Citation

  • Minh-Thang Ha & Thi-Cham Nguyen & Thanh-Huyen Pham & Van-Hau Nguyen, 2025. "Using Machine Learning Regression Algorithms to Predict House Prices in Vietnam," International Real Estate Review, Global Social Science Institute, vol. 28(4), pages 505-527.
  • Handle: RePEc:ire:issued:v:28:n:04:2025:p:505-527
    DOI: 10.53383/100412
    as

    Download full text from publisher

    File URL: https://doi.org/10.53383/100412
    File Function: Full text
    Download Restriction: no

    File URL: https://libkey.io/10.53383/100412?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Nghiep Nguyen & Al Cripps, 2001. "Predicting Housing Value: A Comparison of Multiple Regression Analysis and Artificial Neural Networks," Journal of Real Estate Research, American Real Estate Society, vol. 22(3), pages 313-336.
    2. Nghiep Nguyen & Al Cripps, 2001. "Predicting Housing Value: A Comparison of Multiple Regression Analysis and Artificial Neural Networks," Journal of Real Estate Research, Taylor & Francis Journals, vol. 22(3), pages 313-336, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sagi, Alon & Gal, Avigdor & Broitman, Dani & Czamanski, Daniel, 2024. "An unsupervised machine learning approach to the spatial analysis of urban systems through neighbourhoods’ dynamics," Land Use Policy, Elsevier, vol. 144(C).
    2. Demetriou, Demetris, 2016. "The assessment of land valuation in land consolidation schemes: The need for a new land valuation framework," Land Use Policy, Elsevier, vol. 54(C), pages 487-498.
    3. Cihan Çılgın & Hadi Gökçen, 2025. "A Hybrid Machine Learning Model Architecture with Clustering Analysis and Stacking Ensemble for Real Estate Price Prediction," Computational Economics, Springer;Society for Computational Economics, vol. 66(1), pages 127-178, July.
    4. Silviu-Ionuț BĂBȚAN, 2024. "AUTOMATED EVALUATION MODELS in real estate market: A comparative analysis between linear regression and XGBoost," Annales Universitatis Apulensis Series Oeconomica, Faculty of Sciences, "1 Decembrie 1918" University, Alba Iulia, vol. 2(26), pages 1-3.
    5. Kitova, Olga & Dyakonova, Ludmila & Savinova, Victoria, 2020. "Prediction of Socio-Economic Indicators of the Megapolis Development on the Basis of the Intellectual Forecasting Information System “SHM Horizon”," MPRA Paper 104234, University Library of Munich, Germany, revised 19 Nov 2020.
    6. Maurizio d’Amato, 2007. "Comparing Rough Set Theory with Multiple Regression Analysis as Automated Valuation Methodologies," International Real Estate Review, Global Social Science Institute, vol. 10(2), pages 42-65.
    7. Jose Torres-Pruñonosa & Pablo García-Estévez & Josep Maria Raya & Camilo Prado-Román, 2022. "How on Earth Did Spanish Banking Sell the Housing Stock?," SAGE Open, , vol. 12(1), pages 21582440221, March.
    8. Alla Koblyakova & Larisa Fleishman & Orly Furman, 2022. "Accuracy of Households’ Dwelling Valuations, Housing Demand and Mortgage Decisions: Israeli Case," The Journal of Real Estate Finance and Economics, Springer, vol. 65(1), pages 48-74, July.
    9. Mehmet Emin Tabar & Aziz Sisman & Yasemin Sisman, 2023. "A Real Estate Appraisal Model with Artificial Neural Networks and Fuzzy Logic: A Local Case Study of Samsun City," International Real Estate Review, Global Social Science Institute, vol. 26(4), pages 569-585.
    10. Jose Torres-Pruñonosa & Pablo García-Estévez & Camilo Prado-Román, 2021. "Artificial Neural Network, Quantile and Semi-Log Regression Modelling of Mass Appraisal in Housing," Mathematics, MDPI, vol. 9(7), pages 1-16, April.
    11. Kuan-Lun Pan & Hsiao Jung Teng & Shih-Yuan Lin & Yu En Cheng, 2021. "An Empirical Method for Decomposing the Contributions of Land and Building Values to Housing Value," International Real Estate Review, Global Social Science Institute, vol. 24(3), pages 385-403.
    12. Antipov, Evgeny & Pokryshevskaya, Elena, 2010. "Mass appraisal of residential apartments: An application of Random forest for valuation and a CART-based approach for model diagnostics," MPRA Paper 27645, University Library of Munich, Germany.
    13. Demetriou, Demetris, 2018. "Automating the land valuation process carried out in land consolidation schemes," Land Use Policy, Elsevier, vol. 75(C), pages 21-32.
    14. Christian Pierdzioch, 2012. "Macroeconomic Factors and the German Real Estate Market: A Stock-Market-Based Forecasting Experiment," Review of Economics & Finance, Better Advances Press, Canada, vol. 2, pages 87-96, May.
    15. Renigier-Biłozor, Małgorzata & Źróbek, Sabina & Walacik, Marek & Borst, Richard & Grover, Richard & d’Amato, Maurizio, 2022. "International acceptance of automated modern tools use must-have for sustainable real estate market development," Land Use Policy, Elsevier, vol. 113(C).
    16. repec:ire:issued:v:26:n:04:2023:p:565-581 is not listed on IDEAS
    17. repec:ipg:wpaper:2014-473 is not listed on IDEAS
    18. Bovkir, Rabia & Aydinoglu, Arif Cagdas, 2018. "Providing land value information from geographic data infrastructure by using fuzzy logic analysis approach," Land Use Policy, Elsevier, vol. 78(C), pages 46-60.
    19. Plakandaras, Vasilios & Gupta, Rangan & Gogas, Periklis & Papadimitriou, Theophilos, 2015. "Forecasting the U.S. real house price index," Economic Modelling, Elsevier, vol. 45(C), pages 259-267.
    20. Susanna Levantesi & Gabriella Piscopo, 2020. "The Importance of Economic Variables on London Real Estate Market: A Random Forest Approach," Risks, MDPI, vol. 8(4), pages 1-17, October.
    21. Yan Kestens & Marius Thériault & François Des Rosiers, 2004. "The Impact of Surrounding Land Use and Vegetation on Single-Family House Prices," Environment and Planning B, , vol. 31(4), pages 539-567, August.
    22. Wang, Dan & Tang, Yu-Ting & He, Jun & Yang, Fei & Robinson, Darren, 2021. "Generalized models to predict the lower heating value (LHV) of municipal solid waste (MSW)," Energy, Elsevier, vol. 216(C).

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ire:issued:v:28:n:04:2025:p:505-527. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: IRER Graduate Assistant/Webmaster (email available below). General contact details of provider: https://www.gssinst.org/gssinst/index.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.