IDEAS home Printed from https://ideas.repec.org/h/spr/prbchp/978-981-95-9113-8_31.html

Application of Machine Learning for Condominium Price Prediction Using Real-Time Web Scraped Data: Evidence from Hanoi’s Emerging Market

Author

Listed:
  • Binh Minh An Nguyen

    (Hanoi University of Industry, School of Economics)

  • Tuan Khai Duong

    (Hanoi University of Industry, School of Economics)

  • Tung Lam Nguyen

    (Hanoi University of Industry, School of Economics)

  • Hung Cuong Tran

    (Hanoi University of Industry, School of Information and Communication Technology)

Abstract

Real estate valuation and prediction in emerging markets often rely heavily on subjective judgment. The study examined current valuation practices at three real estate agencies in Hanoi. The findings show inefficiencies and biases in current practices. To address these challenges, this research aims to develop a data-driven framework that integrates real-time web-scraped data to predict condominium prices in Hanoi, Vietnam. The data set was scraped from the most popular proptech platform, batdongsan.com, from 18/08/2025 to 21/11/2025. The scraped data contains 22,565 observations and 16 explanatory variables. The research proposed deploying three ensemble learning models – Random Forest, XGBoost, and CatBoost – as suitable methods for the data conditions. The performance of these models was evaluated using RMSE, MAE, MSE, and R2. The research highlighted the need for modernization and transparency in property valuation in Vietnam and advocated data-driven practices.

Suggested Citation

  • Binh Minh An Nguyen & Tuan Khai Duong & Tung Lam Nguyen & Hung Cuong Tran, 2026. "Application of Machine Learning for Condominium Price Prediction Using Real-Time Web Scraped Data: Evidence from Hanoi’s Emerging Market," Springer Proceedings in Business and Economics,, Springer.
  • Handle: RePEc:spr:prbchp:978-981-95-9113-8_31
    DOI: 10.1007/978-981-95-9113-8_31
    as

    Download full text from publisher

    To our knowledge, this item is not available for download. To find whether it is available, there are three options:
    1. Check below whether another version of this item is available online.
    2. Check on the provider's web page whether it is in fact available.
    3. Perform a
    for a similarly titled item that would be available.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:prbchp:978-981-95-9113-8_31. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.