IDEAS home Printed from https://ideas.repec.org/a/gam/jdataj/v11y2026i1p16-d1838909.html

A BIM-Derived Synthetic Point Cloud (SPC) Dataset for Construction Scene Component Segmentation

Author

Listed:
  • Yiquan Zou

    (School of Civil Engineering, Architecture and Environment, Hubei University of Technology, 28 Nanli Road, Wuhan 430068, China)

  • Tianxiang Liang

    (School of Civil Engineering, Architecture and Environment, Hubei University of Technology, 28 Nanli Road, Wuhan 430068, China)

  • Wenxuan Chen

    (School of Civil Engineering, Architecture and Environment, Hubei University of Technology, 28 Nanli Road, Wuhan 430068, China)

  • Zhixiang Ren

    (School of Civil Engineering, Architecture and Environment, Hubei University of Technology, 28 Nanli Road, Wuhan 430068, China)

  • Yuhan Wen

    (School of Civil Engineering, Architecture and Environment, Hubei University of Technology, 28 Nanli Road, Wuhan 430068, China)

Abstract

In intelligent construction and BIM–Reality integration applications, high-quality, large-scale construction scene point cloud data with component-level semantic annotations constitute a fundamental basis for three-dimensional semantic understanding and automated analysis. However, point clouds acquired from real construction sites commonly suffer from high labeling costs, severe occlusion, and unstable data distributions. Existing public datasets remain insufficient in terms of scale, component coverage, and annotation consistency, limiting their suitability for data-driven approaches. To address these challenges, this paper constructs and releases a BIM-derived synthetic construction scene point cloud dataset, termed the Synthetic Point Cloud (SPC), targeting component-level point cloud semantic segmentation and related research tasks.The dataset is generated from publicly available BIM models through physics-based virtual LiDAR scanning, producing multi-view and multi-density three-dimensional point clouds while automatically inheriting component-level semantic labels from BIM without any manual intervention. The SPC dataset comprises 132 virtual scanning scenes, with an overall scale of approximately 8.75 × 10 9 points, covering typical construction components such as walls, columns, beams, and slabs. By systematically configuring scanning viewpoints, sampling densities, and occlusion conditions, the dataset introduces rich geometric and spatial distribution diversity. This paper presents a comprehensive description of the SPC data generation pipeline, semantic mapping strategy, virtual scanning configurations, and data organization scheme, followed by statistical analysis and technical validation in terms of point cloud scale evolution, spatial coverage characteristics, and component-wise semantic distributions. Furthermore, baseline experiments on component-level point cloud semantic segmentation are provided. The results demonstrate that models trained solely on the SPC dataset can achieve stable and engineering-meaningful component-level predictions on real construction point clouds, validating the dataset’s usability in virtual-to-real research scenarios. As a scalable and reproducible BIM-derived point cloud resource, the SPC dataset offers a unified data foundation and experimental support for research on construction scene point cloud semantic segmentation, virtual-to-real transfer learning, scan-to-BIM updating, and intelligent construction monitoring.

Suggested Citation

  • Yiquan Zou & Tianxiang Liang & Wenxuan Chen & Zhixiang Ren & Yuhan Wen, 2026. "A BIM-Derived Synthetic Point Cloud (SPC) Dataset for Construction Scene Component Segmentation," Data, MDPI, vol. 11(1), pages 1-19, January.
  • Handle: RePEc:gam:jdataj:v:11:y:2026:i:1:p:16-:d:1838909
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2306-5729/11/1/16/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2306-5729/11/1/16/
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jdataj:v:11:y:2026:i:1:p:16-:d:1838909. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.