IDEAS home Printed from https://ideas.repec.org/a/ire/issued/v28n042025p505-527.html

Using Machine Learning Regression Algorithms to Predict House Prices in Vietnam

Author

Listed:
  • Minh-Thang Ha

    (Hung Yen University of Technology and Education)

  • Thi-Cham Nguyen

    (Haiphong University of Medicine and Pharmacy)

  • Thanh-Huyen Pham

    (Halong University)

  • Van-Hau Nguyen

    (Hung Yen University of Technology and Education)

Abstract

This study develops a comprehensive machine learning (ML) framework for house price prediction in Vietnam by utilizing a dataset of 28,156 property listings from a real estate website. We employ rigorous data preprocessing, feature engineering, and comparative analysis of ML algorithms, including CatBoost, XGBoost, and random forests. The results demonstrate the superiority of ensemble methods, with CatBoost achieving the highest performance on the main dataset (R² = 0.510, RMSE = 17.614). Regional analyses in Hanoi and Ho Chi Minh City reveal the adaptability of the models for local market dynamics. A Shapley additive explanations analysis reveals key drivers of house prices, such as area, population density, and property-specific attributes. The findings contribute to the academic understanding of real estate valuation and provide actionable insights for policymakers, investors, and other stakeholders. This study lays the groundwork for developing automated valuation models and their practical implementation, exemplified by a website application. By harnessing ML and data-driven insights, this research advances transparent, efficient, and informed decision-making in the real estate sector in Vietnam, while offering a robust methodology for house price prediction in emerging markets.

Suggested Citation

  • Minh-Thang Ha & Thi-Cham Nguyen & Thanh-Huyen Pham & Van-Hau Nguyen, 2025. "Using Machine Learning Regression Algorithms to Predict House Prices in Vietnam," International Real Estate Review, Global Social Science Institute, vol. 28(4), pages 505-527.
  • Handle: RePEc:ire:issued:v:28:n:04:2025:p:505-527
    DOI: 10.53383/100412
    as

    Download full text from publisher

    File URL: https://doi.org/10.53383/100412
    File Function: Full text
    Download Restriction: no

    File URL: https://libkey.io/10.53383/100412?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ire:issued:v:28:n:04:2025:p:505-527. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: IRER Graduate Assistant/Webmaster (email available below). General contact details of provider: https://www.gssinst.org/gssinst/index.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.