IDEAS home Printed from https://ideas.repec.org/a/eee/energy/v83y2015icp144-155.html
   My bibliography  Save this article

Identifying key variables and interactions in statistical models of building energy consumption using regularization

Author

Listed:
  • Hsu, David

Abstract

Statistical models can only be as good as the data put into them. Data about energy consumption continues to grow, particularly its non-technical aspects, but these variables are often interpreted differently among disciplines, datasets, and contexts. Selecting key variables and interactions is therefore an important step in achieving more accurate predictions, better interpretation, and identification of key subgroups for further analysis.

Suggested Citation

  • Hsu, David, 2015. "Identifying key variables and interactions in statistical models of building energy consumption using regularization," Energy, Elsevier, vol. 83(C), pages 144-155.
  • Handle: RePEc:eee:energy:v:83:y:2015:i:c:p:144-155
    DOI: 10.1016/j.energy.2015.02.008
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0360544215001590
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.energy.2015.02.008?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zhao, Hai-xiang & Magoulès, Frédéric, 2012. "A review on the prediction of building energy consumption," Renewable and Sustainable Energy Reviews, Elsevier, vol. 16(6), pages 3586-3592.
    2. Nicolai Meinshausen & Peter Bühlmann, 2010. "Stability selection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 72(4), pages 417-473, September.
    3. Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2010. "Regularization Paths for Generalized Linear Models via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i01).
    4. Jain, Rishee K. & Smith, Kevin M. & Culligan, Patricia J. & Taylor, John E., 2014. "Forecasting energy consumption of multi-family residential buildings using support vector regression: Investigating the impact of temporal and spatial monitoring granularity on performance accuracy," Applied Energy, Elsevier, vol. 123(C), pages 168-178.
    5. Foucquier, Aurélie & Robert, Sylvain & Suard, Frédéric & Stéphan, Louis & Jay, Arnaud, 2013. "State of the art in building modelling and energy performances prediction: A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 23(C), pages 272-288.
    6. Robert Tibshirani, 2011. "Regression shrinkage and selection via the lasso: a retrospective," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 73(3), pages 273-282, June.
    7. Swan, Lukas G. & Ugursal, V. Ismet, 2009. "Modeling of end-use energy consumption in the residential sector: A review of modeling techniques," Renewable and Sustainable Energy Reviews, Elsevier, vol. 13(8), pages 1819-1835, October.
    8. Yang, Liu & Yan, Haiyan & Lam, Joseph C., 2014. "Thermal comfort and building energy consumption implications – A review," Applied Energy, Elsevier, vol. 115(C), pages 164-173.
    9. Manfren, Massimiliano & Aste, Niccolò & Moshksar, Reza, 2013. "Calibration and uncertainty analysis for computer models – A meta-model based approach for integrated building energy simulation," Applied Energy, Elsevier, vol. 103(C), pages 627-641.
    10. Chung, William, 2011. "Review of building energy-use performance benchmarking methodologies," Applied Energy, Elsevier, vol. 88(5), pages 1470-1479, May.
    11. Fan, Cheng & Xiao, Fu & Wang, Shengwei, 2014. "Development of prediction models for next-day building energy consumption and peak power demand using data mining techniques," Applied Energy, Elsevier, vol. 127(C), pages 1-10.
    12. Hui Zou & Trevor Hastie, 2005. "Addendum: Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(5), pages 768-768, November.
    13. Hui Zou & Trevor Hastie, 2005. "Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(2), pages 301-320, April.
    14. Ming Yuan & Yi Lin, 2006. "Model selection and estimation in regression with grouped variables," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(1), pages 49-67, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Namazkhan, Maliheh & Albers, Casper & Steg, Linda, 2020. "A decision tree method for explaining household gas consumption: The role of building characteristics, socio-demographic variables, psychological factors and household behaviour," Renewable and Sustainable Energy Reviews, Elsevier, vol. 119(C).
    2. Satre-Meloy, Aven, 2019. "Investigating structural and occupant drivers of annual residential electricity consumption using regularization in regression models," Energy, Elsevier, vol. 174(C), pages 148-168.
    3. Milan Straka & Rui Carvalho & Gijs van der Poel & v{L}ubov{s} Buzna, 2020. "Explaining the distribution of energy consumption at slow charging infrastructure for electric vehicles from socio-economic data," Papers 2006.01672, arXiv.org, revised Jun 2020.
    4. Shi, Xunpeng & Wang, Keying & Cheong, Tsun Se & Zhang, Hongwu, 2020. "Prioritizing driving factors of household carbon emissions: An application of the LASSO model with survey data," Energy Economics, Elsevier, vol. 92(C).
    5. Ali Movahedi & Sybil Derrible, 2021. "Interrelationships between electricity, gas, and water consumption in large‐scale buildings," Journal of Industrial Ecology, Yale University, vol. 25(4), pages 932-947, August.
    6. Ma, Jun & Cheng, Jack C.P., 2016. "Identifying the influential features on the regional energy use intensity of residential buildings based on Random Forests," Applied Energy, Elsevier, vol. 183(C), pages 193-201.
    7. Walter, Travis & Sohn, Michael D., 2016. "A regression-based approach to estimating retrofit savings using the Building Performance Database," Applied Energy, Elsevier, vol. 179(C), pages 996-1005.
    8. Sen, Parag & Roy, Mousumi & Pal, Parimal, 2016. "Application of ARIMA for forecasting energy consumption and GHG emission: A case study of an Indian pig iron manufacturing organization," Energy, Elsevier, vol. 116(P1), pages 1031-1038.
    9. Hsu, David, 2015. "Comparison of integrated clustering methods for accurate and stable prediction of building energy consumption data," Applied Energy, Elsevier, vol. 160(C), pages 153-163.
    10. Wang, Endong & Alp, Neslihan & Shi, Jonathan & Wang, Chao & Zhang, Xiaodong & Chen, Hong, 2017. "Multi-criteria building energy performance benchmarking through variable clustering based compromise TOPSIS with objective entropy weighting," Energy, Elsevier, vol. 125(C), pages 197-210.
    11. Roth, Jonathan & Lim, Benjamin & Jain, Rishee K. & Grueneich, Dian, 2020. "Examining the feasibility of using open data to benchmark building energy usage in cities: A data science and policy perspective," Energy Policy, Elsevier, vol. 139(C).
    12. Liu, Xue & Ding, Yong & Tang, Hao & Fan, Lingxiao & Lv, Jie, 2022. "Investigating the effects of key drivers on energy consumption of nonresidential buildings: A data-driven approach integrating regularization and quantile regression," Energy, Elsevier, vol. 244(PA).
    13. Anca Mehedintu & Mihaela Sterpu & Georgeta Soava, 2018. "Estimation and Forecasts for the Share of Renewable Energy Consumption in Final Energy Consumption by 2020 in the European Union," Sustainability, MDPI, vol. 10(5), pages 1-22, May.
    14. Ma, Jun & Cheng, Jack C.P., 2016. "Estimation of the building energy use intensity in the urban scale by integrating GIS and big data technology," Applied Energy, Elsevier, vol. 183(C), pages 182-192.
    15. Verstraete, Gylian & Aghezzaf, El-Houssaine & Desmet, Bram, 2019. "A data-driven framework for predicting weather impact on high-volume low-margin retail products," Journal of Retailing and Consumer Services, Elsevier, vol. 48(C), pages 169-177.
    16. Petri Hietaharju & Mika Ruusunen & Kauko Leiviskä, 2018. "A Dynamic Model for Indoor Temperature Prediction in Buildings," Energies, MDPI, vol. 11(6), pages 1-20, June.
    17. Wang, Endong, 2017. "Decomposing core energy factor structure of U.S. residential buildings through principal component analysis with variable clustering on high-dimensional mixed data," Applied Energy, Elsevier, vol. 203(C), pages 858-873.
    18. Jufri, Fauzan Hanif & Oh, Seongmun & Jung, Jaesung, 2019. "Development of Photovoltaic abnormal condition detection system using combined regression and Support Vector Machine," Energy, Elsevier, vol. 176(C), pages 457-467.
    19. Toroghi, Shahaboddin H. & Oliver, Matthew E., 2019. "Framework for estimation of the direct rebound effect for residential photovoltaic systems," Applied Energy, Elsevier, vol. 251(C), pages 1-1.
    20. Thomas Wu & Bo Wang & Dongdong Zhang & Ziwei Zhao & Hongyu Zhu, 2023. "Benchmarking Evaluation of Building Energy Consumption Based on Data Mining," Sustainability, MDPI, vol. 15(6), pages 1-16, March.
    21. Abbasabadi, Narjes & Ashayeri, Mehdi & Azari, Rahman & Stephens, Brent & Heidarinejad, Mohammad, 2019. "An integrated data-driven framework for urban energy use modeling (UEUM)," Applied Energy, Elsevier, vol. 253(C), pages 1-1.
    22. Bordbari, Mohammad Javad & Seifi, Ali Reza & Rastegar, Mohammad, 2018. "Probabilistic energy consumption analysis in buildings using point estimate method," Energy, Elsevier, vol. 142(C), pages 716-722.
    23. Papadopoulos, Sokratis & Kontokosta, Constantine E., 2019. "Grading buildings on energy performance using city benchmarking data," Applied Energy, Elsevier, vol. 233, pages 244-253.
    24. Lawal, Abiola S. & Servadio, Joseph L. & Davis, Tate & Ramaswami, Anu & Botchwey, Nisha & Russell, Armistead G., 2021. "Orthogonalization and machine learning methods for residential energy estimation with social and economic indicators," Applied Energy, Elsevier, vol. 283(C).
    25. Silva, Mafalda C. & Horta, Isabel M. & Leal, Vítor & Oliveira, Vítor, 2017. "A spatially-explicit methodological framework based on neural networks to assess the effect of urban form on energy demand," Applied Energy, Elsevier, vol. 202(C), pages 386-398.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Satre-Meloy, Aven, 2019. "Investigating structural and occupant drivers of annual residential electricity consumption using regularization in regression models," Energy, Elsevier, vol. 174(C), pages 148-168.
    2. Capanu, Marinela & Giurcanu, Mihai & Begg, Colin B. & Gönen, Mithat, 2023. "Subsampling based variable selection for generalized linear models," Computational Statistics & Data Analysis, Elsevier, vol. 184(C).
    3. Tanin Sirimongkolkasem & Reza Drikvandi, 2019. "On Regularisation Methods for Analysis of High Dimensional Data," Annals of Data Science, Springer, vol. 6(4), pages 737-763, December.
    4. Satre-Meloy, Aven & Diakonova, Marina & Grünewald, Philipp, 2020. "Cluster analysis and prediction of residential peak demand profiles using occupant activity data," Applied Energy, Elsevier, vol. 260(C).
    5. van Erp, Sara & Oberski, Daniel L. & Mulder, Joris, 2018. "Shrinkage priors for Bayesian penalized regression," OSF Preprints cg8fq, Center for Open Science.
    6. Laura Freijeiro‐González & Manuel Febrero‐Bande & Wenceslao González‐Manteiga, 2022. "A Critical Review of LASSO and Its Derivatives for Variable Selection Under Dependence Among Covariates," International Statistical Review, International Statistical Institute, vol. 90(1), pages 118-145, April.
    7. Tutz, Gerhard & Pößnecker, Wolfgang & Uhlmann, Lorenz, 2015. "Variable selection in general multinomial logit models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 207-222.
    8. Camila Epprecht & Dominique Guegan & Álvaro Veiga & Joel Correa da Rosa, 2017. "Variable selection and forecasting via automated methods for linear models: LASSO/adaLASSO and Autometrics," Post-Print halshs-00917797, HAL.
    9. Peter Martey Addo & Dominique Guegan & Bertrand Hassani, 2018. "Credit Risk Analysis Using Machine and Deep Learning Models," Risks, MDPI, vol. 6(2), pages 1-20, April.
    10. Tomáš Plíhal, 2021. "Scheduled macroeconomic news announcements and Forex volatility forecasting," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 40(8), pages 1379-1397, December.
    11. Loann David Denis Desboulets, 2018. "A Review on Variable Selection in Regression Analysis," Econometrics, MDPI, vol. 6(4), pages 1-27, November.
    12. Murat Genç & M. Revan Özkale, 2021. "Usage of the GO estimator in high dimensional linear models," Computational Statistics, Springer, vol. 36(1), pages 217-239, March.
    13. Zeng, Yaohui & Yang, Tianbao & Breheny, Patrick, 2021. "Hybrid safe–strong rules for efficient optimization in lasso-type problems," Computational Statistics & Data Analysis, Elsevier, vol. 153(C).
    14. Yoshiki Nakajima & Naoya Sueishi, 2022. "Forecasting the Japanese macroeconomy using high-dimensional data," The Japanese Economic Review, Springer, vol. 73(2), pages 299-324, April.
    15. Zanhua Yin, 2020. "Variable selection for sparse logistic regression," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 83(7), pages 821-836, October.
    16. Wei Tang & Steven L Bressler & Chad M Sylvester & Gordon L Shulman & Maurizio Corbetta, 2012. "Measuring Granger Causality between Cortical Regions from Voxelwise fMRI BOLD Signals with LASSO," PLOS Computational Biology, Public Library of Science, vol. 8(5), pages 1-14, May.
    17. Nicholson, William B. & Matteson, David S. & Bien, Jacob, 2017. "VARX-L: Structured regularization for large vector autoregressions with exogenous variables," International Journal of Forecasting, Elsevier, vol. 33(3), pages 627-651.
    18. Dmitry Kobak & Yves Bernaerts & Marissa A. Weis & Federico Scala & Andreas S. Tolias & Philipp Berens, 2021. "Sparse reduced‐rank regression for exploratory visualisation of paired multivariate data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(4), pages 980-1000, August.
    19. Pei Wang & Shunjie Chen & Sijia Yang, 2022. "Recent Advances on Penalized Regression Models for Biological Data," Mathematics, MDPI, vol. 10(19), pages 1-24, October.
    20. Soyeon Kim & Veerabhadran Baladandayuthapani & J. Jack Lee, 2017. "Prediction-Oriented Marker Selection (PROMISE): With Application to High-Dimensional Regression," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 9(1), pages 217-245, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:energy:v:83:y:2015:i:c:p:144-155. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/energy .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.