IDEAS home Printed from https://ideas.repec.org/a/ire/issued/v28n042025p505-527.html

Using Machine Learning Regression Algorithms to Predict House Prices in Vietnam

Author

Listed:
  • Minh-Thang Ha

    (Hung Yen University of Technology and Education)

  • Thi-Cham Nguyen

    (Haiphong University of Medicine and Pharmacy)

  • Thanh-Huyen Pham

    (Halong University)

  • Van-Hau Nguyen

    (Hung Yen University of Technology and Education)

Abstract

This study develops a comprehensive machine learning (ML) framework for house price prediction in Vietnam by utilizing a dataset of 28,156 property listings from a real estate website. We employ rigorous data preprocessing, feature engineering, and comparative analysis of ML algorithms, including CatBoost, XGBoost, and random forests. The results demonstrate the superiority of ensemble methods, with CatBoost achieving the highest performance on the main dataset (R² = 0.510, RMSE = 17.614). Regional analyses in Hanoi and Ho Chi Minh City reveal the adaptability of the models for local market dynamics. A Shapley additive explanations analysis reveals key drivers of house prices, such as area, population density, and property-specific attributes. The findings contribute to the academic understanding of real estate valuation and provide actionable insights for policymakers, investors, and other stakeholders. This study lays the groundwork for developing automated valuation models and their practical implementation, exemplified by a website application. By harnessing ML and data-driven insights, this research advances transparent, efficient, and informed decision-making in the real estate sector in Vietnam, while offering a robust methodology for house price prediction in emerging markets.

Suggested Citation

  • Minh-Thang Ha & Thi-Cham Nguyen & Thanh-Huyen Pham & Van-Hau Nguyen, 2025. "Using Machine Learning Regression Algorithms to Predict House Prices in Vietnam," International Real Estate Review, Global Social Science Institute, vol. 28(4), pages 505-527.
  • Handle: RePEc:ire:issued:v:28:n:04:2025:p:505-527
    DOI: 10.53383/100412
    as

    Download full text from publisher

    File URL: https://doi.org/10.53383/100412
    File Function: Full text
    Download Restriction: no

    File URL: https://libkey.io/10.53383/100412?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Nghiep Nguyen & Al Cripps, 2001. "Predicting Housing Value: A Comparison of Multiple Regression Analysis and Artificial Neural Networks," Journal of Real Estate Research, Taylor & Francis Journals, vol. 22(3), pages 313-336, January.
    2. Nghiep Nguyen & Al Cripps, 2001. "Predicting Housing Value: A Comparison of Multiple Regression Analysis and Artificial Neural Networks," Journal of Real Estate Research, American Real Estate Society, vol. 22(3), pages 313-336.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sagi, Alon & Gal, Avigdor & Broitman, Dani & Czamanski, Daniel, 2024. "An unsupervised machine learning approach to the spatial analysis of urban systems through neighbourhoods’ dynamics," Land Use Policy, Elsevier, vol. 144(C).
    2. Demetriou, Demetris, 2016. "The assessment of land valuation in land consolidation schemes: The need for a new land valuation framework," Land Use Policy, Elsevier, vol. 54(C), pages 487-498.
    3. Cihan Çılgın & Hadi Gökçen, 2025. "A Hybrid Machine Learning Model Architecture with Clustering Analysis and Stacking Ensemble for Real Estate Price Prediction," Computational Economics, Springer;Society for Computational Economics, vol. 66(1), pages 127-178, July.
    4. Silviu-Ionuț BĂBȚAN, 2024. "AUTOMATED EVALUATION MODELS in real estate market: A comparative analysis between linear regression and XGBoost," Annales Universitatis Apulensis Series Oeconomica, Faculty of Sciences, "1 Decembrie 1918" University, Alba Iulia, vol. 2(26), pages 1-3.
    5. Kitova, Olga & Dyakonova, Ludmila & Savinova, Victoria, 2020. "Prediction of Socio-Economic Indicators of the Megapolis Development on the Basis of the Intellectual Forecasting Information System “SHM Horizon”," MPRA Paper 104234, University Library of Munich, Germany, revised 19 Nov 2020.
    6. Tien Foo Sing & Jesse Jingye Yang & Shi Ming Yu, 2022. "Boosted Tree Ensembles for Artificial Intelligence Based Automated Valuation Models (AI-AVM)," The Journal of Real Estate Finance and Economics, Springer, vol. 65(4), pages 649-674, November.
    7. Maurizio d’Amato, 2007. "Comparing Rough Set Theory with Multiple Regression Analysis as Automated Valuation Methodologies," International Real Estate Review, Global Social Science Institute, vol. 10(2), pages 42-65.
    8. Jose Torres-Pruñonosa & Pablo García-Estévez & Josep Maria Raya & Camilo Prado-Román, 2022. "How on Earth Did Spanish Banking Sell the Housing Stock?," SAGE Open, , vol. 12(1), pages 21582440221, March.
    9. Shawn L. Robey & Mark A McKnight & Misty R. Price & Rachel N. Coleman, 2019. "Considerations for a Regression-Based Real Estate Valuation and Appraisal Model: A Pilot Study," Accounting and Finance Research, Sciedu Press, vol. 8(2), pages 1-99, May.
    10. Koen Koning & Tatiana Filatova & Okmyung Bin, 2018. "Improved Methods for Predicting Property Prices in Hazard Prone Dynamic Markets," Environmental & Resource Economics, Springer;European Association of Environmental and Resource Economists, vol. 69(2), pages 247-263, February.
    11. Craig Ellis & Patrick J. Wilson & Ralf Zurbruegg, 2007. "Real Estate ‘Value’ Stocks and International Diversification," Journal of Property Research, Taylor & Francis Journals, vol. 24(3), pages 265-287, September.
    12. Plakandaras, Vasilios & Gupta, Rangan & Gogas, Periklis & Papadimitriou, Theophilos, 2015. "Forecasting the U.S. real house price index," Economic Modelling, Elsevier, vol. 45(C), pages 259-267.
    13. Alla Koblyakova & Larisa Fleishman & Orly Furman, 2022. "Accuracy of Households’ Dwelling Valuations, Housing Demand and Mortgage Decisions: Israeli Case," The Journal of Real Estate Finance and Economics, Springer, vol. 65(1), pages 48-74, July.
    14. Gang-Zhi Fan & Seow Eng Ong & Hian Chye Koh, 2006. "Determinants of House Price: A Decision Tree Approach," Urban Studies, Urban Studies Journal Limited, vol. 43(12), pages 2301-2315, November.
    15. Mehmet Emin Tabar & Aziz Sisman & Yasemin Sisman, 2023. "A Real Estate Appraisal Model with Artificial Neural Networks and Fuzzy Logic: A Local Case Study of Samsun City," International Real Estate Review, Global Social Science Institute, vol. 26(4), pages 569-585.
    16. Wan, Wayne Xinwei & Lindenthal, Thies, 2022. "Towards accountability in machine learning applications: A system-testing approach," ZEW Discussion Papers 22-001, ZEW - Leibniz Centre for European Economic Research.
    17. Jose Torres-Pruñonosa & Pablo García-Estévez & Camilo Prado-Román, 2021. "Artificial Neural Network, Quantile and Semi-Log Regression Modelling of Mass Appraisal in Housing," Mathematics, MDPI, vol. 9(7), pages 1-16, April.
    18. Camilo Serrano & Martin Hoesli, 2010. "Are Securitized Real Estate Returns more Predictable than Stock Returns?," The Journal of Real Estate Finance and Economics, Springer, vol. 41(2), pages 170-192, August.
    19. Kuan-Lun Pan & Hsiao Jung Teng & Shih-Yuan Lin & Yu En Cheng, 2021. "An Empirical Method for Decomposing the Contributions of Land and Building Values to Housing Value," International Real Estate Review, Global Social Science Institute, vol. 24(3), pages 385-403.
    20. Antipov, Evgeny & Pokryshevskaya, Elena, 2010. "Mass appraisal of residential apartments: An application of Random forest for valuation and a CART-based approach for model diagnostics," MPRA Paper 27645, University Library of Munich, Germany.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ire:issued:v:28:n:04:2025:p:505-527. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: IRER Graduate Assistant/Webmaster (email available below). General contact details of provider: https://www.gssinst.org/gssinst/index.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.