IDEAS home Printed from https://ideas.repec.org/a/bla/jorssc/v64y2015i1p139-155.html
   My bibliography  Save this article

Generalized additive models for large data sets

Author

Listed:
  • Simon N. Wood
  • Yannig Goude
  • Simon Shaw

Abstract

type="main" xml:id="rssc12068-abs-0001"> We consider an application in electricity grid load prediction, where generalized additive models are appropriate, but where the data set's size can make their use practically intractable with existing methods. We therefore develop practical generalized additive model fitting methods for large data sets in the case in which the smooth terms in the model are represented by using penalized regression splines. The methods use iterative update schemes to obtain factors of the model matrix while requiring only subblocks of the model matrix to be computed at any one time. We show that efficient smoothing parameter estimation can be carried out in a well-justified manner. The grid load prediction problem requires updates of the model fit, as new data become available, and some means for dealing with residual auto-correlation in grid load. Methods are provided for these problems and parallel implementation is covered. The methods allow estimation of generalized additive models for large data sets by using modest computer hardware, and the grid load prediction problem illustrates the utility of reduced rank spline smoothing methods for dealing with complex modelling problems.

Suggested Citation

  • Simon N. Wood & Yannig Goude & Simon Shaw, 2015. "Generalized additive models for large data sets," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 64(1), pages 139-155, January.
  • Handle: RePEc:bla:jorssc:v:64:y:2015:i:1:p:139-155
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1111/rssc.2014.64.issue-1
    Download Restriction: Access to full text is restricted to subscribers.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Shuangzhou Chen & Zhikang Bao & Junjie Chen & Linchuan Yang & Vivian Lou, 2022. "Sustainable built environment for facilitating public health of older adults: Evidence from Hong Kong," Sustainable Development, John Wiley & Sons, Ltd., vol. 30(5), pages 1086-1098, October.
    2. Ali M. Mosammam & Jorge Mateu, 2018. "A penalized likelihood method for nonseparable space–time generalized additive models," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 102(3), pages 333-357, July.
    3. Caston Sigauke & Murendeni Maurel Nemukula & Daniel Maposa, 2018. "Probabilistic Hourly Load Forecasting Using Additive Quantile Regression Models," Energies, MDPI, vol. 11(9), pages 1-21, August.
    4. Michael Lebacher & Paul W. Thurner & Göran Kauermann, 2021. "A dynamic separable network model with actor heterogeneity: An application to global weapons transfers," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(1), pages 201-226, January.
    5. Mills, Brian M. & Salaga, Steven, 2018. "A natural experiment for efficient markets: Information quality and influential agents," Journal of Financial Markets, Elsevier, vol. 40(C), pages 23-39.
    6. Anne-Sophie Krah & Zoran Nikolić & Ralf Korn, 2020. "Machine Learning in Least-Squares Monte Carlo Proxy Modeling of Life Insurance Companies," Risks, MDPI, vol. 8(1), pages 1-79, February.
    7. Zanin, Luca, 2020. "Combining multiple probability predictions in the presence of class imbalance to discriminate between potential bad and good borrowers in the peer-to-peer lending market," Journal of Behavioral and Experimental Finance, Elsevier, vol. 25(C).
    8. Salahuddin Khan, 2023. "Short-Term Electricity Load Forecasting Using a New Intelligence-Based Application," Sustainability, MDPI, vol. 15(16), pages 1-12, August.
    9. Alla A. Petukhina & Raphael C. G. Reule & Wolfgang Karl Härdle, 2021. "Rise of the machines? Intraday high-frequency trading patterns of cryptocurrencies," The European Journal of Finance, Taylor & Francis Journals, vol. 27(1-2), pages 8-30, January.
    10. Shao, Zhen & Chao, Fu & Yang, Shan-Lin & Zhou, Kai-Le, 2017. "A review of the decomposition methodology for extracting and identifying the fluctuation characteristics in electricity demand forecasting," Renewable and Sustainable Energy Reviews, Elsevier, vol. 75(C), pages 123-136.
    11. Veronica Kostenko & Eduard Ponarin & Musa Shteiwi & Olga Strebkova, 2017. "Historical Legacies and Gender Attitudes in the Middle East," Working Papers 1105, Economic Research Forum, revised 05 2017.
    12. Cui, Wenquan & Cheng, Haoyang & Sun, Jiajing, 2018. "An RKHS-based approach to double-penalized regression in high-dimensional partially linear models," Journal of Multivariate Analysis, Elsevier, vol. 168(C), pages 201-210.
    13. Schmidt, Paul & Mühlau, Mark & Schmid, Volker, 2017. "Fitting large-scale structured additive regression models using Krylov subspace methods," Computational Statistics & Data Analysis, Elsevier, vol. 105(C), pages 59-75.
    14. Lingqi Li & Kai Wu & Enhui Jiang & Huijuan Yin & Yuanjian Wang & Shimin Tian & Suzhen Dang, 2021. "Evaluating Runoff-Sediment Relationship Variations Using Generalized Additive Models That Incorporate Reservoir Indices for Check Dams," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 35(11), pages 3845-3860, September.
    15. Sviták, Jan & Tichem, Jan & Haasbeek, Stefan, 2021. "Price effects of search advertising restrictions," International Journal of Industrial Organization, Elsevier, vol. 77(C).
    16. Djeundje, Viani Biatat & Crook, Jonathan, 2019. "Identifying hidden patterns in credit risk survival data using Generalised Additive Models," European Journal of Operational Research, Elsevier, vol. 277(1), pages 366-376.
    17. Anne-Sophie Krah & Zoran Nikoli'c & Ralf Korn, 2019. "Machine Learning in Least-Squares Monte Carlo Proxy Modeling of Life Insurance Companies," Papers 1909.02182, arXiv.org.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssc:v:64:y:2015:i:1:p:139-155. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.