IDEAS home Printed from https://ideas.repec.org/a/eee/energy/v239y2022ipds0360544221025214.html
   My bibliography  Save this article

Trade-off between accuracy and fairness of data-driven building and indoor environment models: A comparative study of pre-processing methods

Author

Listed:
  • Sun, Ying
  • Haghighat, Fariborz
  • Fung, Benjamin C.M.

Abstract

Data-driven models have drawn extensive attention in the building domain in recent years, and their predictive accuracy depends on features or data distribution. Accuracy variation among users or periods creates a certain unfairness to some users. This paper addresses a new research problem called fairness-aware prediction of data-driven building and indoor environment models. First, three types of fairness definitions are introduced in building engineering. Next, Type I and Type II fairness are investigated. To achieve fairness Type I, we study the effect of suppressing the protected attribute (i.e., attribute whose value cannot be disclosed or be discriminated against) from inputs. To improve fairness Type II while preserving the predictive accuracy of data-driven building and indoor environment models, we propose three pre-processing methods for training dataset—sequential sampling, reversed preferential sampling, and sequential preferential sampling. The proposed methods are compared to two existing pre-processing methods in a case study for lighting status prediction in an apartment building. Overall, 576 study cases were used to study the effect of these pre-processing methods on the accuracy and fairness of 12 series of lighting status prediction based on 2 types of feature combinations and 4 types of classifiers. Predictive results show that suppressing the protected attribute slightly influences overall predictive accuracy, while all pre-processing methods decrease it. However, in general, sequential sampling would be a good option for improving fairness Type II with an acceptable accuracy decrease. Fairness improvement performance of other pre-processing methods varies depending on applied features and classifiers.

Suggested Citation

  • Sun, Ying & Haghighat, Fariborz & Fung, Benjamin C.M., 2022. "Trade-off between accuracy and fairness of data-driven building and indoor environment models: A comparative study of pre-processing methods," Energy, Elsevier, vol. 239(PD).
  • Handle: RePEc:eee:energy:v:239:y:2022:i:pd:s0360544221025214
    DOI: 10.1016/j.energy.2021.122273
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0360544221025214
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.energy.2021.122273?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zhang, Chaobo & Li, Junyang & Zhao, Yang & Li, Tingting & Chen, Qi & Zhang, Xuejun & Qiu, Weikang, 2021. "Problem of data imbalance in building energy load prediction: Concept, influence, and solution," Applied Energy, Elsevier, vol. 297(C).
    2. Reynolds, Jonathan & Rezgui, Yacine & Kwan, Alan & Piriou, Solène, 2018. "A zone-level, building energy optimisation combining an artificial neural network, a genetic algorithm, and model predictive control," Energy, Elsevier, vol. 151(C), pages 729-739.
    3. Ascione, Fabrizio & Bianco, Nicola & De Stasio, Claudio & Mauro, Gerardo Maria & Vanoli, Giuseppe Peter, 2017. "Artificial neural networks to predict energy performance and retrofit scenarios for any member of a building category: A novel approach," Energy, Elsevier, vol. 118(C), pages 999-1017.
    4. Peng, Yuzhen & Rysanek, Adam & Nagy, Zoltán & Schlüter, Arno, 2018. "Using machine learning techniques for occupancy-prediction-based cooling control in office buildings," Applied Energy, Elsevier, vol. 211(C), pages 1343-1358.
    5. Amasyali, Kadir & El-Gohary, Nora M., 2018. "A review of data-driven building energy consumption prediction studies," Renewable and Sustainable Energy Reviews, Elsevier, vol. 81(P1), pages 1192-1205.
    6. Tabares-Velasco, Paulo Cesar & Speake, Andrew & Harris, Maxwell & Newman, Alexandra & Vincent, Tyrone & Lanahan, Michael, 2019. "A modeling framework for optimization-based control of a residential building thermostat for time-of-use pricing," Applied Energy, Elsevier, vol. 242(C), pages 1346-1357.
    7. Naji, Sareh & Keivani, Afram & Shamshirband, Shahaboddin & Alengaram, U. Johnson & Jumaat, Mohd Zamin & Mansor, Zulkefli & Lee, Malrey, 2016. "Estimating building energy consumption using extreme learning machine method," Energy, Elsevier, vol. 97(C), pages 506-516.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Liu, Zhengguang & Guo, Zhiling & Chen, Qi & Song, Chenchen & Shang, Wenlong & Yuan, Meng & Zhang, Haoran, 2023. "A review of data-driven smart building-integrated photovoltaic systems: Challenges and objectives," Energy, Elsevier, vol. 263(PE).
    2. Li, Guannan & Li, Fan & Ahmad, Tanveer & Liu, Jiangyan & Li, Tao & Fang, Xi & Wu, Yubei, 2022. "Performance evaluation of sequence-to-sequence-Attention model for short-term multi-step ahead building energy predictions," Energy, Elsevier, vol. 259(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wang, Zeyu & Liu, Jian & Zhang, Yuanxin & Yuan, Hongping & Zhang, Ruixue & Srinivasan, Ravi S., 2021. "Practical issues in implementing machine-learning models for building energy efficiency: Moving beyond obstacles," Renewable and Sustainable Energy Reviews, Elsevier, vol. 143(C).
    2. Abhinandana Boodi & Karim Beddiar & Malek Benamour & Yassine Amirat & Mohamed Benbouzid, 2018. "Intelligent Systems for Building Energy and Occupant Comfort Optimization: A State of the Art Review and Recommendations," Energies, MDPI, vol. 11(10), pages 1-26, September.
    3. Zhang, Liang & Wen, Jin & Li, Yanfei & Chen, Jianli & Ye, Yunyang & Fu, Yangyang & Livingood, William, 2021. "A review of machine learning in building load prediction," Applied Energy, Elsevier, vol. 285(C).
    4. Kathirgamanathan, Anjukan & De Rosa, Mattia & Mangina, Eleni & Finn, Donal P., 2021. "Data-driven predictive control for unlocking building energy flexibility: A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 135(C).
    5. Meng Wang & Junqi Yu & Meng Zhou & Wei Quan & Renyin Cheng, 2023. "Joint Forecasting Model for the Hourly Cooling Load and Fluctuation Range of a Large Public Building Based on GA-SVM and IG-SVM," Sustainability, MDPI, vol. 15(24), pages 1-23, December.
    6. Gautham Krishnadas & Aristides Kiprakis, 2020. "A Machine Learning Pipeline for Demand Response Capacity Scheduling," Energies, MDPI, vol. 13(7), pages 1-25, April.
    7. Tsoumalis, Georgios I. & Bampos, Zafeirios N. & Chatzis, Georgios V. & Biskas, Pandelis N. & Keranidis, Stratos D., 2021. "Minimization of natural gas consumption of domestic boilers with convolutional, long-short term memory neural networks and genetic algorithm," Applied Energy, Elsevier, vol. 299(C).
    8. Wei, Yixuan & Xia, Liang & Pan, Song & Wu, Jinshun & Zhang, Xingxing & Han, Mengjie & Zhang, Weiya & Xie, Jingchao & Li, Qingping, 2019. "Prediction of occupancy level and energy consumption in office building using blind system identification and neural networks," Applied Energy, Elsevier, vol. 240(C), pages 276-294.
    9. Tien, Paige Wenbin & Wei, Shuangyu & Calautit, John Kaiser & Darkwa, Jo & Wood, Christopher, 2022. "Real-time monitoring of occupancy activities and window opening within buildings using an integrated deep learning-based approach for reducing energy demand," Applied Energy, Elsevier, vol. 308(C).
    10. Xu, Bin & Cheng, Yuan-xia & Chen, Xing-ni & Xie, Xing & Ji, Jie & Jiao, Dong-sheng, 2023. "Error correction method for heat flux and a new algorithm employed in inverting wall thermal resistance using an artificial neural network: Based on IN-SITU heat flux measurements," Energy, Elsevier, vol. 282(C).
    11. Amasyali, Kadir & El-Gohary, Nora M., 2021. "Real data-driven occupant-behavior optimization for reduced energy consumption and improved comfort," Applied Energy, Elsevier, vol. 302(C).
    12. Thomas Wu & Bo Wang & Dongdong Zhang & Ziwei Zhao & Hongyu Zhu, 2023. "Benchmarking Evaluation of Building Energy Consumption Based on Data Mining," Sustainability, MDPI, vol. 15(6), pages 1-16, March.
    13. Hany Habbak & Mohamed Mahmoud & Khaled Metwally & Mostafa M. Fouda & Mohamed I. Ibrahem, 2023. "Load Forecasting Techniques and Their Applications in Smart Grids," Energies, MDPI, vol. 16(3), pages 1-33, February.
    14. Fateme Dinmohammadi & Yuxuan Han & Mahmood Shafiee, 2023. "Predicting Energy Consumption in Residential Buildings Using Advanced Machine Learning Algorithms," Energies, MDPI, vol. 16(9), pages 1-23, April.
    15. Md Mijanur Rahman & Mohammad Shakeri & Sieh Kiong Tiong & Fatema Khatun & Nowshad Amin & Jagadeesh Pasupuleti & Mohammad Kamrul Hasan, 2021. "Prospective Methodologies in Hybrid Renewable Energy Systems for Energy Prediction Using Artificial Neural Networks," Sustainability, MDPI, vol. 13(4), pages 1-28, February.
    16. Liang, Xinbin & Chen, Siliang & Zhu, Xu & Jin, Xinqiao & Du, Zhimin, 2023. "Domain knowledge decomposition of building energy consumption and a hybrid data-driven model for 24-h ahead predictions," Applied Energy, Elsevier, vol. 344(C).
    17. Zhang, Yan & Teoh, Bak Koon & Wu, Maozhi & Chen, Jiayu & Zhang, Limao, 2023. "Data-driven estimation of building energy consumption and GHG emissions using explainable artificial intelligence," Energy, Elsevier, vol. 262(PA).
    18. Kapp, Sean & Choi, Jun-Ki & Hong, Taehoon, 2023. "Predicting industrial building energy consumption with statistical and machine-learning models informed by physical system parameters," Renewable and Sustainable Energy Reviews, Elsevier, vol. 172(C).
    19. Venkatraj, V. & Dixit, M.K., 2022. "Challenges in implementing data-driven approaches for building life cycle energy assessment: A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 160(C).
    20. Niemierko, Rochus & Töppel, Jannick & Tränkler, Timm, 2019. "A D-vine copula quantile regression approach for the prediction of residential heating energy consumption based on historical data," Applied Energy, Elsevier, vol. 233, pages 691-708.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:energy:v:239:y:2022:i:pd:s0360544221025214. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/energy .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.