IDEAS home Printed from https://ideas.repec.org/a/eee/appene/v268y2020ics0306261920304773.html
   My bibliography  Save this article

Data-driven estimation of building energy consumption with multi-source heterogeneous data

Author

Listed:
  • Pan, Yue
  • Zhang, Limao

Abstract

For better energy evaluation and management, a categorical boosting (CatBoost)-based predictive method is presented to accurately estimate building energy consumption by learning large volumes of multi-source heterogeneous data collected from buildings. To be specific, the newly-developed CatBoost model belonging to the ensemble learning has superiority in handling categorical variables and producing reliable results. As a case study, our proposed method is validated in a multi-dimensional dataset about Seattle's building energy performance provided by the city’s government, aiming to estimate the weather normalized site energy use intensity of buildings and characterize its non-linear relationship with other 12 possible influential features. Results from the 5-fold cross-validation demonstrate that the model exhibits a strong ability in predicting the exact value of energy intensity precisely, which can even outperform popular machine learning algorithms including random forest and gradient boosting decision tree under R2 of 0.897. Based on a defined threshold, these predicted values can be classified as the normal or abnormal energy consumption reaching an accuracy of 99.32% for outlier detection, which is helpful in alarming potential risks at an early stage and developing strategies to enhance the energy efficiency. Moreover, results from the established model can be interpreted objectively, suggesting that features concerning the physical and energy characteristics contribute more to energy estimation than environmental features. Since such results understand the building energy consumption and efficiency in a data-driven manner, they can eventually serve as guidance for building owners and designers in designing and renovating buildings to achieve better energy-conserving performance.

Suggested Citation

  • Pan, Yue & Zhang, Limao, 2020. "Data-driven estimation of building energy consumption with multi-source heterogeneous data," Applied Energy, Elsevier, vol. 268(C).
  • Handle: RePEc:eee:appene:v:268:y:2020:i:c:s0306261920304773
    DOI: 10.1016/j.apenergy.2020.114965
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0306261920304773
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.apenergy.2020.114965?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Jain, Rishee K. & Smith, Kevin M. & Culligan, Patricia J. & Taylor, John E., 2014. "Forecasting energy consumption of multi-family residential buildings using support vector regression: Investigating the impact of temporal and spatial monitoring granularity on performance accuracy," Applied Energy, Elsevier, vol. 123(C), pages 168-178.
    2. Guo, Yabin & Wang, Jiangyu & Chen, Huanxin & Li, Guannan & Liu, Jiangyan & Xu, Chengliang & Huang, Ronggeng & Huang, Yao, 2018. "Machine learning-based thermal response time ahead energy demand prediction for building heating systems," Applied Energy, Elsevier, vol. 221(C), pages 16-27.
    3. Nan Zhou & Nina Khanna & Wei Feng & Jing Ke & Mark Levine, 2018. "Scenarios of energy efficiency and CO2 emissions reduction potential in the buildings sector in China to year 2050," Nature Energy, Nature, vol. 3(11), pages 978-984, November.
    4. Ma, Jun & Cheng, Jack C.P., 2016. "Identifying the influential features on the regional energy use intensity of residential buildings based on Random Forests," Applied Energy, Elsevier, vol. 183(C), pages 193-201.
    5. Bartusch, Cajsa & Odlare, Monica & Wallin, Fredrik & Wester, Lars, 2012. "Exploring variance in residential electricity consumption: Household features and building properties," Applied Energy, Elsevier, vol. 92(C), pages 637-643.
    6. Biswas, M.A. Rafe & Robinson, Melvin D. & Fumo, Nelson, 2016. "Prediction of residential building energy consumption: A neural network approach," Energy, Elsevier, vol. 117(P1), pages 84-92.
    7. Li, Hong Xian & Li, Yan & Jiang, Boya & Zhang, Limao & Wu, Xianguo & Lin, Jingyi, 2020. "Energy performance optimisation of building envelope retrofit through integrated orthogonal arrays with data envelopment analysis," Renewable Energy, Elsevier, vol. 149(C), pages 1414-1423.
    8. Rahman, Aowabin & Srikumar, Vivek & Smith, Amanda D., 2018. "Predicting electricity consumption for commercial and residential buildings using deep recurrent neural networks," Applied Energy, Elsevier, vol. 212(C), pages 372-385.
    9. Abu Bakar, Nur Najihah & Hassan, Mohammad Yusri & Abdullah, Hayati & Rahman, Hasimah Abdul & Abdullah, Md Pauzi & Hussin, Faridah & Bandi, Masilah, 2015. "Energy efficiency index as an indicator for measuring building energy performance: A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 44(C), pages 1-11.
    10. Zhao, Yang & Li, Tingting & Zhang, Xuejun & Zhang, Chaobo, 2019. "Artificial intelligence-based fault detection and diagnosis methods for building energy systems: Advantages, challenges and the future," Renewable and Sustainable Energy Reviews, Elsevier, vol. 109(C), pages 85-101.
    11. Omar Isaac Asensio & Magali A. Delmas, 2017. "The effectiveness of US energy efficiency building labels," Nature Energy, Nature, vol. 2(4), pages 1-9, April.
    12. Chengdong Li & Zixiang Ding & Dongbin Zhao & Jianqiang Yi & Guiqing Zhang, 2017. "Building Energy Consumption Prediction: An Extreme Deep Learning Approach," Energies, MDPI, vol. 10(10), pages 1-20, October.
    13. Hong, Jingke & Shen, Qiping & Xue, Fan, 2016. "A multi-regional structural path analysis of the energy supply chain in China's construction industry," Energy Policy, Elsevier, vol. 92(C), pages 56-68.
    14. Annunziata, Eleonora & Frey, Marco & Rizzi, Francesco, 2013. "Towards nearly zero-energy buildings: The state-of-art of national regulations in Europe," Energy, Elsevier, vol. 57(C), pages 125-133.
    15. Amasyali, Kadir & El-Gohary, Nora M., 2018. "A review of data-driven building energy consumption prediction studies," Renewable and Sustainable Energy Reviews, Elsevier, vol. 81(P1), pages 1192-1205.
    16. Margaret Walls, 2017. "Energy efficiency: Building labels lead to savings," Nature Energy, Nature, vol. 2(4), pages 1-2, April.
    17. Wang, Zeyu & Srinivasan, Ravi S., 2017. "A review of artificial intelligence based building energy use prediction: Contrasting the capabilities of single and ensemble prediction models," Renewable and Sustainable Energy Reviews, Elsevier, vol. 75(C), pages 796-808.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhang, Yan & Teoh, Bak Koon & Wu, Maozhi & Chen, Jiayu & Zhang, Limao, 2023. "Data-driven estimation of building energy consumption and GHG emissions using explainable artificial intelligence," Energy, Elsevier, vol. 262(PA).
    2. Shen, Yuxuan & Pan, Yue, 2023. "BIM-supported automatic energy performance analysis for green building design using explainable machine learning and multi-objective optimization," Applied Energy, Elsevier, vol. 333(C).
    3. Yin, Sihua & Yang, Haidong & Xu, Kangkang & Zhu, Chengjiu & Zhang, Shaqing & Liu, Guosheng, 2022. "Dynamic real–time abnormal energy consumption detection and energy efficiency optimization analysis considering uncertainty," Applied Energy, Elsevier, vol. 307(C).
    4. Li, Renzheng & Hong, Jichao & Zhang, Huaqin & Chen, Xinbo, 2022. "Data-driven battery state of health estimation based on interval capacity for real-world electric vehicles," Energy, Elsevier, vol. 257(C).
    5. Varlamis, Iraklis & Sardianos, Christos & Chronis, Christos & Dimitrakopoulos, George & Himeur, Yassine & Alsalemi, Abdullah & Bensaali, Faycal & Amira, Abbes, 2022. "Smart fusion of sensor data and human feedback for personalized energy-saving recommendations," Applied Energy, Elsevier, vol. 305(C).
    6. Cai, Wei & Wen, Xiaodong & Li, Chaoen & Shao, Jingjing & Xu, Jianguo, 2023. "Predicting the energy consumption in buildings using the optimized support vector regression model," Energy, Elsevier, vol. 273(C).
    7. Rosenfelder, Markus & Wussow, Moritz & Gust, Gunther & Cremades, Roger & Neumann, Dirk, 2021. "Predicting residential electricity consumption using aerial and street view images," Applied Energy, Elsevier, vol. 301(C).
    8. Guo, Jing & Lin, Penghui & Zhang, Limao & Pan, Yue & Xiao, Zhonghua, 2023. "Dynamic adaptive encoder-decoder deep learning networks for multivariate time series forecasting of building energy consumption," Applied Energy, Elsevier, vol. 350(C).
    9. Sun, Jian & Liu, Gang & Sun, Boyang & Xiao, Gang, 2021. "Light-stacking strengthened fusion based building energy consumption prediction framework via variable weight feature selection," Applied Energy, Elsevier, vol. 303(C).
    10. Zhang, Xiang & Rasmussen, Christoffer & Saelens, Dirk & Roels, Staf, 2022. "Time-dependent solar aperture estimation of a building: Comparing grey-box and white-box approaches," Renewable and Sustainable Energy Reviews, Elsevier, vol. 161(C).
    11. Qu, Pengfei & Zhang, Limao & Zhu, Qizhi & Wu, Maozhi, 2023. "Probabilistic reliability assessment of twin tunnels considering fluid–solid coupling with physics-guided machine learning," Reliability Engineering and System Safety, Elsevier, vol. 231(C).
    12. Simon Wenninger & Christian Wiethe, 2021. "Benchmarking Energy Quantification Methods to Predict Heating Energy Performance of Residential Buildings in Germany," Business & Information Systems Engineering: The International Journal of WIRTSCHAFTSINFORMATIK, Springer;Gesellschaft für Informatik e.V. (GI), vol. 63(3), pages 223-242, June.
    13. Alexandru G. Berciu & Eva H. Dulf & Dan D. Micu, 2022. "Improving the Efficiency of Electricity Consumption by Applying Real-Time Fuzzy and Fractional Control," Mathematics, MDPI, vol. 10(20), pages 1-16, October.
    14. Razak Olu-Ajayi & Hafiz Alaka & Hakeem Owolabi & Lukman Akanbi & Sikiru Ganiyu, 2023. "Data-Driven Tools for Building Energy Consumption Prediction: A Review," Energies, MDPI, vol. 16(6), pages 1-20, March.
    15. Duan, Haiyan & Chen, Siyan & Song, Junnian, 2022. "Characterizing regional building energy consumption under joint climatic and socioeconomic impacts," Energy, Elsevier, vol. 245(C).
    16. Ye, Zhongnan & Cheng, Kuangly & Hsu, Shu-Chien & Wei, Hsi-Hsien & Cheung, Clara Man, 2021. "Identifying critical building-oriented features in city-block-level building energy consumption: A data-driven machine learning approach," Applied Energy, Elsevier, vol. 301(C).
    17. Yingyue Li & Hongjun Li & Rui Miao & He Qi & Yi Zhang, 2023. "Energy–Environment–Economy (3E) Analysis of the Performance of Introducing Photovoltaic and Energy Storage Systems into Residential Buildings: A Case Study in Shenzhen, China," Sustainability, MDPI, vol. 15(11), pages 1-25, June.
    18. Chunyan Wang & Hanying Jiang & Hao Wu & Yi Liu & Siyue Guo & Ming Xu, 2023. "Scaling in urban building energy use and its influencing factors," Journal of Industrial Ecology, Yale University, vol. 27(4), pages 1076-1088, August.
    19. Luka Djordjević & Jasmina Pekez & Borivoj Novaković & Mihalj Bakator & Mića Djurdjev & Dragan Ćoćkalo & Saša Jovanović, 2023. "Increasing Energy Efficiency of Buildings in Serbia—A Case of an Urban Neighborhood," Sustainability, MDPI, vol. 15(7), pages 1-20, April.
    20. Pan, Yue & Qin, Jianjun, 2022. "A novel probabilistic modeling framework for wind speed with highlight of extremes under data discrepancy and uncertainty," Applied Energy, Elsevier, vol. 326(C).
    21. Li, Qing & Zhang, Lianying & Zhang, Limao & Wu, Xianguo, 2021. "Optimizing energy efficiency and thermal comfort in building green retrofit," Energy, Elsevier, vol. 237(C).
    22. Jeeyoung Lim & Joseph J. Kim & Sunkuk Kim, 2021. "A Holistic Review of Building Energy Efficiency and Reduction Based on Big Data," Sustainability, MDPI, vol. 13(4), pages 1-18, February.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Venkatraj, V. & Dixit, M.K., 2022. "Challenges in implementing data-driven approaches for building life cycle energy assessment: A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 160(C).
    2. Zhong, Hai & Wang, Jiajun & Jia, Hongjie & Mu, Yunfei & Lv, Shilei, 2019. "Vector field-based support vector regression for building energy consumption prediction," Applied Energy, Elsevier, vol. 242(C), pages 403-414.
    3. Tran, Duc-Hoc & Luong, Duc-Long & Chou, Jui-Sheng, 2020. "Nature-inspired metaheuristic ensemble model for forecasting energy consumption in residential buildings," Energy, Elsevier, vol. 191(C).
    4. Fan, Cheng & Xiao, Fu & Yan, Chengchu & Liu, Chengliang & Li, Zhengdao & Wang, Jiayuan, 2019. "A novel methodology to explain and evaluate data-driven building energy performance models based on interpretable machine learning," Applied Energy, Elsevier, vol. 235(C), pages 1551-1560.
    5. Fathi, Soheil & Srinivasan, Ravi & Fenner, Andriel & Fathi, Sahand, 2020. "Machine learning applications in urban building energy performance forecasting: A systematic review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 133(C).
    6. Jason Runge & Radu Zmeureanu, 2021. "A Review of Deep Learning Techniques for Forecasting Energy Use in Buildings," Energies, MDPI, vol. 14(3), pages 1-26, January.
    7. Wang, Zeyu & Liu, Jian & Zhang, Yuanxin & Yuan, Hongping & Zhang, Ruixue & Srinivasan, Ravi S., 2021. "Practical issues in implementing machine-learning models for building energy efficiency: Moving beyond obstacles," Renewable and Sustainable Energy Reviews, Elsevier, vol. 143(C).
    8. Li, Xinyi & Yao, Runming, 2020. "A machine-learning-based approach to predict residential annual space heating and cooling loads considering occupant behaviour," Energy, Elsevier, vol. 212(C).
    9. Ding, Zhikun & Chen, Weilin & Hu, Ting & Xu, Xiaoxiao, 2021. "Evolutionary double attention-based long short-term memory model for building energy prediction: Case study of a green building," Applied Energy, Elsevier, vol. 288(C).
    10. Fan, Cheng & Xiao, Fu & Song, Mengjie & Wang, Jiayuan, 2019. "A graph mining-based methodology for discovering and visualizing high-level knowledge for building energy management," Applied Energy, Elsevier, vol. 251(C), pages 1-1.
    11. Deb, Chirag & Dai, Zhonghao & Schlueter, Arno, 2021. "A machine learning-based framework for cost-optimal building retrofit," Applied Energy, Elsevier, vol. 294(C).
    12. Jason Runge & Radu Zmeureanu, 2019. "Forecasting Energy Use in Buildings Using Artificial Neural Networks: A Review," Energies, MDPI, vol. 12(17), pages 1-27, August.
    13. Ahmed Gassar, Abdo Abdullah & Yun, Geun Young & Kim, Sumin, 2019. "Data-driven approach to prediction of residential energy consumption at urban scales in London," Energy, Elsevier, vol. 187(C).
    14. Fan, Cheng & Sun, Yongjun & Xiao, Fu & Ma, Jie & Lee, Dasheng & Wang, Jiayuan & Tseng, Yen Chieh, 2020. "Statistical investigations of transfer learning-based methodology for short-term building energy predictions," Applied Energy, Elsevier, vol. 262(C).
    15. Chou, Jui-Sheng & Tran, Duc-Son, 2018. "Forecasting energy consumption time series using machine learning techniques based on usage patterns of residential householders," Energy, Elsevier, vol. 165(PB), pages 709-726.
    16. Ahmad, Tanveer & Chen, Huanxin, 2018. "Potential of three variant machine-learning models for forecasting district level medium-term and long-term energy demand in smart grid environment," Energy, Elsevier, vol. 160(C), pages 1008-1020.
    17. Gautham Krishnadas & Aristides Kiprakis, 2020. "A Machine Learning Pipeline for Demand Response Capacity Scheduling," Energies, MDPI, vol. 13(7), pages 1-25, April.
    18. Ye, Zhongnan & Cheng, Kuangly & Hsu, Shu-Chien & Wei, Hsi-Hsien & Cheung, Clara Man, 2021. "Identifying critical building-oriented features in city-block-level building energy consumption: A data-driven machine learning approach," Applied Energy, Elsevier, vol. 301(C).
    19. Wang, Ran & Lu, Shilei & Feng, Wei, 2020. "A novel improved model for building energy consumption prediction based on model integration," Applied Energy, Elsevier, vol. 262(C).
    20. Thomas Wu & Bo Wang & Dongdong Zhang & Ziwei Zhao & Hongyu Zhu, 2023. "Benchmarking Evaluation of Building Energy Consumption Based on Data Mining," Sustainability, MDPI, vol. 15(6), pages 1-16, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:appene:v:268:y:2020:i:c:s0306261920304773. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/405891/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.