IDEAS home Printed from https://ideas.repec.org/a/gam/jdataj/v7y2022i12p170-d986226.html
   My bibliography  Save this article

Identifying and Classifying Urban Data Sources for Machine Learning-Based Sustainable Urban Planning and Decision Support Systems Development

Author

Listed:
  • Stéphane C. K. Tékouabou

    (Center of Urban Systems (CUS), Mohammed VI Polytechnic University (UM6P), Hay Moulay Rachid, Ben Guerir 43150, Morocco)

  • Jérôme Chenal

    (Center of Urban Systems (CUS), Mohammed VI Polytechnic University (UM6P), Hay Moulay Rachid, Ben Guerir 43150, Morocco
    Urban and Regional Planning Community (CEAT), Ecole Polytechnique Federale de Lausanne (EPFL), 1015 Lausanne, Switzerland)

  • Rida Azmi

    (Center of Urban Systems (CUS), Mohammed VI Polytechnic University (UM6P), Hay Moulay Rachid, Ben Guerir 43150, Morocco)

  • Hamza Toulni

    (EIGSI, 282 Route of the Oasis, Mâarif, Casablanca 20140, Morocco
    LISTD Laboratory, Department of Computer Sciences, Mines School of Rabat, Av Hadj Ahmed Cherkaoui, Agdal, P.O. 753, Rabat 10090, Morocco)

  • El Bachir Diop

    (Center of Urban Systems (CUS), Mohammed VI Polytechnic University (UM6P), Hay Moulay Rachid, Ben Guerir 43150, Morocco)

  • Anastasija Nikiforova

    (Institute of Computer Science, Faculty of Science and Technology, University of Tartu, Narva mnt 18, 51009 Tartu, Estonia
    European Open Science Cloud Task Force “FAIR Metrics and Data Quality”, 1050 Brussels, Belgium)

Abstract

With the increase in the amount and variety of data that are constantly produced, collected, and exchanged between systems, the efficiency and accuracy of solutions/services that use data as input may suffer if an inappropriate or inaccurate technique, method, or tool is chosen to deal with them. This paper presents a global overview of urban data sources and structures used to train machine learning (ML) algorithms integrated into urban planning decision support systems (DSS). It contributes to a common understanding of choosing the right urban data for a given urban planning issue, i.e., their type, source and structure, for more efficient use in training ML models. For the purpose of this study, we conduct a systematic literature review (SLR) of all relevant peer-reviewed studies available in the Scopus database. More precisely, 248 papers were found to be relevant with their further analysis using a text-mining approach to determine (a) the main urban data sources used for ML modeling, (b) the most popular approaches used in relevant urban planning and urban problem-solving studies and their relationship to the type of data source used, and (c) the problems commonly encountered in their use. After classifying them, we identified the strengths and weaknesses of data sources depending on several predefined factors. We found that the data mainly come from two main categories of sources, namely (1) sensors and (2) statistical surveys, including social network data. They can be classified as (a) opportunistic or (b) non-opportunistic depending on the process of data acquisition, collection, and storage. Data sources are closely correlated with their structure and potential urban planning issues to be addressed. Almost all urban data have an indexed structure and, in particular, either attribute tables for statistical survey data and data from simple sensors (e.g., climate and pollution sensors) or vectors, mostly obtained from satellite images after large-scale spatio-temporal analysis. The paper also provides a discussion of the potential opportunities, emerging issues, and challenges that urban data sources face and should overcome to better catalyze intelligent/smart planning. This should contribute to the general understanding of the data, their sources and the challenges to be faced and overcome by those seeking data and integrating them into smart applications and urban-planning processes.

Suggested Citation

  • Stéphane C. K. Tékouabou & Jérôme Chenal & Rida Azmi & Hamza Toulni & El Bachir Diop & Anastasija Nikiforova, 2022. "Identifying and Classifying Urban Data Sources for Machine Learning-Based Sustainable Urban Planning and Decision Support Systems Development," Data, MDPI, vol. 7(12), pages 1-19, November.
  • Handle: RePEc:gam:jdataj:v:7:y:2022:i:12:p:170-:d:986226
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2306-5729/7/12/170/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2306-5729/7/12/170/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Ma, Jun & Cheng, Jack C.P. & Jiang, Feifeng & Chen, Weiwei & Zhang, Jingcheng, 2020. "Analyzing driving factors of land values in urban scale based on big data and non-linear machine learning techniques," Land Use Policy, Elsevier, vol. 94(C).
    2. Idan Porat & Dalit Shach-Pinsly, 2021. "Building morphometric analysis as a tool for urban renewal: Identifying post-Second World War mass public housing development potential," Environment and Planning B, , vol. 48(2), pages 248-264, February.
    3. Eren Erman Ozguven & Mark W. Horner & Ayberk Kocatepe & Jean Michael Marcelin & Yassir Abdelrazig & Thobias Sando & Ren Moses, 2016. "Metadata-based Needs Assessment for Emergency Transportation Operations with a Focus on an Aging Population: A Case Study in Florida," Transport Reviews, Taylor & Francis Journals, vol. 36(3), pages 383-412, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Qing Lu & Jing Ning & Hong You & Liyan Xu, 2023. "Urban Intensity in Theory and Practice: Empirical Determining Mechanism of Floor Area Ratio and Its Deviation from the Classic Location Theories in Beijing," Land, MDPI, vol. 12(2), pages 1-16, February.
    2. Kopczewska, Katarzyna & Ćwiakowski, Piotr, 2021. "Spatio-temporal stability of housing submarkets. Tracking spatial location of clusters of geographically weighted regression estimates of price determinants," Land Use Policy, Elsevier, vol. 103(C).
    3. Siyoon Kwon & Hyoseob Noh & Il Won Seo & Sung Hyun Jung & Donghae Baek, 2021. "Identification Framework of Contaminant Spill in Rivers Using Machine Learning with Breakthrough Curve Analysis," IJERPH, MDPI, vol. 18(3), pages 1-26, January.
    4. Seyed Vahid Razavi-Termeh & Abolghasem Sadeghi-Niaraki & Farbod Farhangi & Soo-Mi Choi, 2021. "COVID-19 Risk Mapping with Considering Socio-Economic Criteria Using Machine Learning Algorithms," IJERPH, MDPI, vol. 18(18), pages 1-21, September.
    5. Sun, Jianing & Zhou, Tao & Wang, Di, 2022. "Relationships between urban form and air quality: A reconsideration based on evidence from China’s five urban agglomerations during the COVID-19 pandemic," Land Use Policy, Elsevier, vol. 118(C).
    6. Abioye, Olumide F. & Dulebenets, Maxim A. & Ozguven, Eren Erman & Moses, Ren & Boot, Walter R. & Sando, Thobias, 2020. "Assessing perceived driving difficulties under emergency evacuation for vulnerable population groups," Socio-Economic Planning Sciences, Elsevier, vol. 72(C).
    7. Jiang, Feifeng & Ma, Jun & Li, Zheng & Ding, Yuexiong, 2022. "Prediction of energy use intensity of urban buildings using the semi-supervised deep learning model," Energy, Elsevier, vol. 249(C).
    8. Doan, Quang Cuong, 2023. "Determining the optimal land valuation model: A case study of Hanoi, Vietnam," Land Use Policy, Elsevier, vol. 127(C).
    9. Zhenwei Wang & Xiaochun Wang & Zijin Dong & Lisan Li & Wangjun Li & Shicheng Li, 2023. "More Urban Elderly Care Facilities Should Be Placed in Densely Populated Areas for an Aging Wuhan of China," Land, MDPI, vol. 12(1), pages 1-13, January.
    10. Cankun Wei & Meichen Fu & Li Wang & Hanbing Yang & Feng Tang & Yuqing Xiong, 2022. "The Research Development of Hedonic Price Model-Based Real Estate Appraisal in the Era of Big Data," Land, MDPI, vol. 11(3), pages 1-30, February.
    11. Sisman, S. & Aydinoglu, A.C., 2022. "Improving performance of mass real estate valuation through application of the dataset optimization and Spatially Constrained Multivariate Clustering Analysis," Land Use Policy, Elsevier, vol. 119(C).
    12. Kim, Daehwan & Seo, Ducksu & Kwon, Youngsang, 2021. "Novel trends in SNS customers in food and beverage patronage: An empirical study of metropolitan cities in South Korea," Land Use Policy, Elsevier, vol. 101(C).
    13. Hadas Shadar & Dalit Shach-Pinsly, 2022. "From Public Housing to Private Housing: Neglect of Urban Qualities during the Urban Regeneration Process," Land, MDPI, vol. 11(6), pages 1-17, June.
    14. Sesil Koutra & Christos S. Ioakimidis, 2022. "Unveiling the Potential of Machine Learning Applications in Urban Planning Challenges," Land, MDPI, vol. 12(1), pages 1-19, December.
    15. Dulebenets, Maxim A. & Abioye, Olumide F. & Ozguven, Eren Erman & Moses, Ren & Boot, Walter R. & Sando, Thobias, 2019. "Development of statistical models for improving efficiency of emergency evacuation in areas with vulnerable population," Reliability Engineering and System Safety, Elsevier, vol. 182(C), pages 233-249.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jdataj:v:7:y:2022:i:12:p:170-:d:986226. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.