IDEAS home Printed from https://ideas.repec.org/a/sae/envirb/v51y2024i1p216-233.html
   My bibliography  Save this article

Prediction of residential and non-residential building usage in Germany based on a novel nationwide reference data set

Author

Listed:
  • André Hartmann
  • Martin Behnisch
  • Robert Hecht
  • Gotthard Meinel

Abstract

Building usage is an important variable in modelling the energetic, material and social properties of a building stock. Gathering this data on large geographical scale, and in the necessary temporal and spatial resolution, that means, on building level, is a challenging task. Machine Learning algorithms like Random Forest have proven useful in predicting building-related features in the past but often resort to training sets of limited geographic scope, for example, cities. This study presents a workflow of predicting the semantic attribute of usage on the level of individual buildings. Based on screening data of the previous ENOB:dataNWG project, a novel building ground-truth data set distributed across Germany, a Random Forest algorithm is used to assess how the German building stock can be classified according to its residential or non-residential use. Different sampling strategies had been applied in order to find a robust evaluation metric for the classifier. Furthermore, the relevance of the feature set is highlighted and it is examined whether regional differences in classification quality exist. Results show that a classification of residential and non-residential building footprints has good prospects with an AUC of up to 0.9.

Suggested Citation

  • André Hartmann & Martin Behnisch & Robert Hecht & Gotthard Meinel, 2024. "Prediction of residential and non-residential building usage in Germany based on a novel nationwide reference data set," Environment and Planning B, , vol. 51(1), pages 216-233, January.
  • Handle: RePEc:sae:envirb:v:51:y:2024:i:1:p:216-233
    DOI: 10.1177/23998083231175680
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/23998083231175680
    Download Restriction: no

    File URL: https://libkey.io/10.1177/23998083231175680?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Bruce G. Marcot & Anca M. Hanea, 2021. "What is an optimal value of k in k-fold cross-validation in discrete Bayesian network analysis?," Computational Statistics, Springer, vol. 36(3), pages 2009-2031, September.
    2. Abhilash Bandam & Eedris Busari & Chloi Syranidou & Jochen Linssen & Detlef Stolten, 2022. "Classification of Building Types in Germany: A Data-Driven Modeling Approach," Data, MDPI, vol. 7(4), pages 1-23, April.
    3. Franz Schug & David Frantz & Sebastian van der Linden & Patrick Hostert, 2021. "Gridded population mapping for Germany based on building density, height and type from Earth Observation data using census disaggregation and bottom-up estimates," PLOS ONE, Public Library of Science, vol. 16(3), pages 1-23, March.
    4. Jamal Dabbeek & Vitor Silva, 2020. "Modeling the residential building stock in the Middle East for multi-hazard risk assessment," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 100(2), pages 781-810, January.
    5. I. T. Jolliffe, 1972. "Discarding Variables in a Principal Component Analysis. I: Artificial Data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 21(2), pages 160-173, June.
    6. Filip Biljecki & Ken Arroyo Ohori & Hugo Ledoux & Ravi Peters & Jantien Stoter, 2016. "Population Estimation Using a 3D City Model: A Multi-Scale Country-Wide Study in the Netherlands," PLOS ONE, Public Library of Science, vol. 11(6), pages 1-22, June.
    7. Felix Creutzig & Peter Agoston & Jan C. Minx & Josep G. Canadell & Robbie M. Andrew & Corinne Le Quéré & Glen P. Peters & Ayyoob Sharifi & Yoshiki Yamagata & Shobhakar Dhakal, 2016. "Urban infrastructure choices structure climate solutions," Nature Climate Change, Nature, vol. 6(12), pages 1054-1056, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. David Frantz & Franz Schug & Dominik Wiedenhofer & André Baumgart & Doris Virág & Sam Cooper & Camila Gómez-Medina & Fabian Lehmann & Thomas Udelhoven & Sebastian Linden & Patrick Hostert & Helmut Hab, 2023. "Unveiling patterns in human dominated landscapes through mapping the mass of US built structures," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    2. Hertrich Markus, 2019. "A Novel Housing Price Misalignment Indicator for Germany," German Economic Review, De Gruyter, vol. 20(4), pages 759-794, December.
    3. Fareniuk Yana & Zatonatska Tetiana & Kovalenko Oksana & Dluhopolskyi Oleksandr, 2022. "Customer churn prediction model: a case of the telecommunication market," Economics, Sciendo, vol. 10(2), pages 109-130, December.
    4. Colosimo Bianca Maria & Moya Ester Gutierrez & Moroni Giovanni & Petrò Stefano, 2008. "Statistical Sampling Strategies for Geometric Tolerance Inspection by CMM," Stochastics and Quality Control, De Gruyter, vol. 23(1), pages 109-121, January.
    5. Yazdanie, Mashael & Densing, Martin & Wokaun, Alexander, 2017. "Cost optimal urban energy systems planning in the context of national energy policies: A case study for the city of Basel," Energy Policy, Elsevier, vol. 110(C), pages 176-190.
    6. Sürücü, Lütfi & YIKILMAZ, İbrahim & MASLAKÇI, Ahmet, 2022. "Exploratory Factor Analysis (EFA) in Quantitative Researches and Practical Considerations," OSF Preprints fgd4e, Center for Open Science.
    7. Hatem Jemmali & Mohamed Salah Matoussi, 2012. "A Multidimensional Analysis of Water Poverty at A Local Scale- Application of Improved Water Poverty Index for Tunisia," Working Papers 730, Economic Research Forum, revised 2012.
    8. Xia, Huosong & Wang, Yuan & Zhang, Justin Zuopeng & Zheng, Leven J. & Kamal, Muhammad Mustafa & Arya, Varsha, 2023. "COVID-19 fake news detection: A hybrid CNN-BiLSTM-AM model," Technological Forecasting and Social Change, Elsevier, vol. 195(C).
    9. Sandra Hadam, 2023. "Experimentelle georeferenzierte Bevölkerungszahl auf Basis der Bevölkerungsfortschreibung und Mobilfunkdaten [Experimental georeferenced population figure based on intercensal population updates an," AStA Wirtschafts- und Sozialstatistisches Archiv, Springer;Deutsche Statistische Gesellschaft - German Statistical Society, vol. 17(1), pages 35-69, March.
    10. Katharina Bohnenberger, 2020. "Money, Vouchers, Public Infrastructures? A Framework for Sustainable Welfare Benefits," Sustainability, MDPI, vol. 12(2), pages 1-30, January.
    11. Abhinash Jenasamanta & Subrajeet Mohapatra, 2022. "An automated system for the assessment and grading of adolescent delinquency using a machine learning-based soft voting framework," Palgrave Communications, Palgrave Macmillan, vol. 9(1), pages 1-11, December.
    12. Pattravadee Ploykitikoon & Charles M. Weber, 2019. "Knowledge Pathways and Performance: An Empirical Study of the National Laboratories in a Technology Latecomer Country," International Journal of Innovation and Technology Management (IJITM), World Scientific Publishing Co. Pte. Ltd., vol. 16(03), pages 1-37, May.
    13. Kiatkulchai Jitt-Aer & Graham Wall & Dylan Jones & Richard Teeuw, 2022. "Use of GIS and dasymetric mapping for estimating tsunami-affected population to facilitate humanitarian relief logistics: a case study from Phuket, Thailand," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 113(1), pages 185-211, August.
    14. Gweneth Leigh & Milica Muminovic & Rachel Davey, 2023. "Enjoyed by Jack but Endured by Jill: An Exploratory Case Study Examining Differences in Adolescent Design Preferences and Perceived Impacts of a Secondary Schoolyard," IJERPH, MDPI, vol. 20(5), pages 1-14, February.
    15. Pacheco, Joaquín & Casado, Silvia & Porras, Santiago, 2013. "Exact methods for variable selection in principal component analysis: Guide functions and pre-selection," Computational Statistics & Data Analysis, Elsevier, vol. 57(1), pages 95-111.
    16. Thomas Wiedmann & Guangwu Chen & Anne Owen & Manfred Lenzen & Michael Doust & John Barrett & Kristian Steele, 2021. "Three‐scope carbon emission inventories of global cities," Journal of Industrial Ecology, Yale University, vol. 25(3), pages 735-750, June.
    17. Soomauroo, Zakia & Blechinger, Philipp & Creutzig, Felix, 2023. "Electrifying public transit benefits public finances in small island developing states," Transport Policy, Elsevier, vol. 138(C), pages 45-59.
    18. Jérome SARACCO & Marie CHAVENT & Vanessa KUENTZ, 2010. "Clustering of categorical variables around latent variables," Cahiers du GREThA (2007-2019) 2010-02, Groupe de Recherche en Economie Théorique et Appliquée (GREThA).
    19. Petros Kalakonas & Vitor Silva & Amaryllis Mouyiannou & Anirudh Rao, 2020. "Exploring the impact of epistemic uncertainty on a regional probabilistic seismic risk assessment model," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 104(1), pages 997-1020, October.
    20. Véronique Cariou & Stéphane Verdun & Emmanuelle Diaz & El Qannari & Evelyne Vigneau, 2009. "Comparison of three hypothesis testing approaches for the selection of the appropriate number of clusters of variables," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 3(3), pages 227-241, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:envirb:v:51:y:2024:i:1:p:216-233. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.