IDEAS home Printed from https://ideas.repec.org/a/spr/ijsaem/v11y2020i2d10.1007_s13198-020-00946-3.html
   My bibliography  Save this article

Big data analytics predicting real estate prices

Author

Listed:
  • Archana Singh

    (Amity University)

  • Apoorva Sharma

    (Amity University)

  • Gaurav Dubey

    (ABES Engineering College)

Abstract

The enormous data generated on daily basis amounts to big data technologies. This large amounts of data have knowledge and hidden patterns. Real estate turning out to be another biggest application in big data. The emphasis of this paper is to map the process involved in taking large amounts of data to predict the price of a house in real estate. The real estate sounds to be a long-term investment. In this paper, the housing Sale Data from Ames, Iowa is considered for the timeframe 2006–2010 with a view to construct relevant models to estimate the final sale price of a house. Due to high number of explanatory variables several models such as linear regression, random forest and gradient boosting models have been used as tools for feature selection to determine the statistically significant characteristics that influence the final sale price of a house. It has been observed that out of all the models, the gradient boosting model returned the efficient results.

Suggested Citation

  • Archana Singh & Apoorva Sharma & Gaurav Dubey, 2020. "Big data analytics predicting real estate prices," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 11(2), pages 208-219, July.
  • Handle: RePEc:spr:ijsaem:v:11:y:2020:i:2:d:10.1007_s13198-020-00946-3
    DOI: 10.1007/s13198-020-00946-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s13198-020-00946-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s13198-020-00946-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Sonka, Steve, 2014. "Big Data and the Ag Sector: More than Lots of Numbers," International Food and Agribusiness Management Review, International Food and Agribusiness Management Association, vol. 17(1), pages 1-20, February.
    2. Peng Wang & Myounggu Kang, 2014. "An empirical analysis on the housing prices in the Pearl River Delta Economic Region of China," International Journal of Urban Sciences, Taylor & Francis Journals, vol. 18(1), pages 103-114, March.
    3. David Wheeler & Michael Tiefelsdorf, 2005. "Multicollinearity and correlation among local regression coefficients in geographically weighted regression," Journal of Geographical Systems, Springer, vol. 7(2), pages 161-187, June.
    4. Archana Singh & Ajay Rana & Jayanthi Ranjan, 2015. "Proposed analytical customer centric model for an automobile industry," International Journal of Data Mining, Modelling and Management, Inderscience Enterprises Ltd, vol. 7(4), pages 314-330.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jungsun Kim & Jaewoong Won & Hyeongsoon Kim & Joonghyeok Heo, 2021. "Machine-Learning-Based Prediction of Land Prices in Seoul, South Korea," Sustainability, MDPI, vol. 13(23), pages 1-14, November.
    2. Maral Taşcılar & Kerem Yavuz Arslanlı, 2022. "Forecasting commercial real estate indicators under COVID-19 by adopting human activity using social big data," Asia-Pacific Journal of Regional Science, Springer, vol. 6(3), pages 1111-1132, October.
    3. Cankun Wei & Meichen Fu & Li Wang & Hanbing Yang & Feng Tang & Yuqing Xiong, 2022. "The Research Development of Hedonic Price Model-Based Real Estate Appraisal in the Era of Big Data," Land, MDPI, vol. 11(3), pages 1-30, February.
    4. Marco Locurcio & Pierluigi Morano & Francesco Tajani & Felicia Di Liddo, 2020. "An Innovative GIS-Based Territorial Information Tool for the Evaluation of Corporate Properties: An Application to the Italian Context," Sustainability, MDPI, vol. 12(14), pages 1-29, July.
    5. Silva, Diego S. & Yamashita, Gabrielli Harumi & Cortimiglia, Marcelo Nogueira & Brust-Renck, Priscila G. & ten Caten, Carla Schwengber, 2022. "Are we ready to assess digital readiness? Exploring digital implications for social progress from the Network Readiness Index," Technology in Society, Elsevier, vol. 68(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chih-Hao Wang & Na Chen, 2021. "A multi-objective optimization approach to balancing economic efficiency and equity in accessibility to multi-use paths," Transportation, Springer, vol. 48(4), pages 1967-1986, August.
    2. Tsimpanos, Apostolos & Tsimbos, Cleon & Kalogirou, Stamatis, 2018. "Assessing spatial variation and heterogeneity of fertility in Greece at local authority level," MPRA Paper 100406, University Library of Munich, Germany.
    3. Diana Gutiérrez Posada & Fernando Rubiera Morollón & Ana Viñuela, 2018. "Ageing Places in an Ageing Country: The Local Dynamics of the Elderly Population in Spain," Tijdschrift voor Economische en Sociale Geografie, Royal Dutch Geographical Society KNAG, vol. 109(3), pages 332-349, July.
    4. Marco Helbich & Wolfgang Brunauer & Eric Vaz & Peter Nijkamp, 2014. "Spatial Heterogeneity in Hedonic House Price Models: The Case of Austria," Urban Studies, Urban Studies Journal Limited, vol. 51(2), pages 390-411, February.
    5. Daniel Acheampong & Tanya Benford, 2020. "Alternative Treatment Of Contribution In Aid Of Construction: The Impact On Investor-Owned Utility Plant Asset Replacement," Accounting & Taxation, The Institute for Business and Finance Research, vol. 12(1), pages 81-96.
    6. Yu, Haitao & Peng, Zhong-Ren, 2019. "Exploring the spatial variation of ridesourcing demand and its relationship to built environment and socioeconomic factors with the geographically weighted Poisson regression," Journal of Transport Geography, Elsevier, vol. 75(C), pages 147-163.
    7. Kristoffer B. Birkeland & Allan D. D'Silva & Roland Füss & Are Oust, 2021. "The Predictability of House Prices: "Human Against Machine"," International Real Estate Review, Global Social Science Institute, vol. 24(2), pages 139-183.
    8. Hoehun Ha & Wei Tu, 2018. "An Ecological Study on the Spatially Varying Relationship between County-Level Suicide Rates and Altitude in the United States," IJERPH, MDPI, vol. 15(4), pages 1-16, April.
    9. Alexis Comber & Paul Harris, 2018. "Geographically weighted elastic net logistic regression," Journal of Geographical Systems, Springer, vol. 20(4), pages 317-341, October.
    10. Whitaker, Stephan D., 2018. "Big Data versus a survey," The Quarterly Review of Economics and Finance, Elsevier, vol. 67(C), pages 285-296.
    11. Oshan, Taylor M., 2022. "Navigating the methodological landscape in spatial analysis: a comment on ‘A Route Map for Successful Applications of Geographically-Weighted Regression’," OSF Preprints rckzj, Center for Open Science.
    12. Hyunwoo Lim & Minyoung Park, 2019. "Modeling the Spatial Dimensions of Warehouse Rent Determinants: A Case Study of Seoul Metropolitan Area, South Korea," Sustainability, MDPI, vol. 12(1), pages 1-17, December.
    13. Carla Shoff & Tse-Chuan Yang, 2012. "Spatially varying predictors of teenage birth rates among counties in the United States," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 27(14), pages 377-418.
    14. Liukuan Zhang & Xiaoxiao Shi & Qing Chang, 2022. "Exploring Adaptive UHI Mitigation Solutions by Spatial Heterogeneity of Land Surface Temperature and Its Relationship to Urban Morphology in Historical Downtown Blocks, Beijing," Land, MDPI, vol. 11(4), pages 1-24, April.
    15. Ibeas, Ángel & Cordera, Ruben & dell'Olio, Luigi & Moura, Jose Luis, 2011. "Modelling demand in restricted parking zones," Transportation Research Part A: Policy and Practice, Elsevier, vol. 45(6), pages 485-498, July.
    16. Ciriaci, Daria & Palma, Daniela, 2010. "Geography, environmental efficiency and Italian economic growth: a spatially-adapted Environmental Kuznets Curve," MPRA Paper 22899, University Library of Munich, Germany.
    17. David C Wheeler, 2007. "Diagnostic Tools and a Remedial Method for Collinearity in Geographically Weighted Regression," Environment and Planning A, , vol. 39(10), pages 2464-2481, October.
    18. Wolfert, Sjaak & Ge, Lan & Verdouw, Cor & Bogaardt, Marc-Jeroen, 2017. "Big Data in Smart Farming – A review," Agricultural Systems, Elsevier, vol. 153(C), pages 69-80.
    19. Mauricio Sarrias, 2020. "Random Parameters and Spatial Heterogeneity using Rchoice in R," REGION, European Regional Science Association, vol. 7, pages 1-19.
    20. Ge, Lan & Bogaardt, Marc-Jeroen, 2015. "Bites into the Bits: Governance of Data Harvesting Initiatives in Agrifood Chains," 148th Seminar, November 30-December 1, 2015, The Hague, The Netherlands 229261, European Association of Agricultural Economists.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:ijsaem:v:11:y:2020:i:2:d:10.1007_s13198-020-00946-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.