IDEAS home Printed from https://ideas.repec.org/a/wly/envmet/v35y2024i6ne2871.html

EM algorithm for generalized Ridge regression with spatial covariates

Author

Listed:
  • Said Obakrim
  • Pierre Ailliot
  • Valérie Monbet
  • Nicolas Raillard

Abstract

The generalized Ridge penalty is a powerful tool for dealing with multicollinearity and high‐dimensionality in regression problems. The generalized Ridge regression can be derived as the mean of a posterior distribution with a Normal prior and a given covariance matrix. The covariance matrix controls the structure of the coefficients, which depends on the particular application. For example, it is appropriate to assume that the coefficients have a spatial structure when the covariates are spatially correlated. This study proposes an Expectation‐Maximization algorithm for estimating generalized Ridge parameters whose covariance structure depends on specific parameters. We focus on three cases: diagonal (when the covariance matrix is diagonal with constant elements), Matérn, and conditional autoregressive covariances. A simulation study is conducted to evaluate the performance of the proposed method, and then the method is applied to predict ocean wave heights using wind conditions.

Suggested Citation

  • Said Obakrim & Pierre Ailliot & Valérie Monbet & Nicolas Raillard, 2024. "EM algorithm for generalized Ridge regression with spatial covariates," Environmetrics, John Wiley & Sons, Ltd., vol. 35(6), September.
  • Handle: RePEc:wly:envmet:v:35:y:2024:i:6:n:e2871
    DOI: 10.1002/env.2871
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/env.2871
    Download Restriction: no

    File URL: https://libkey.io/10.1002/env.2871?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Robert Tibshirani & Michael Saunders & Saharon Rosset & Ji Zhu & Keith Knight, 2005. "Sparsity and smoothness via the fused lasso," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(1), pages 91-108, February.
    2. C. G. Kaufman & B. A. Shaby, 2013. "The role of the range parameter for estimation and prediction in geostatistics," Biometrika, Biometrika Trust, vol. 100(2), pages 473-484.
    3. Matthew J. Heaton & Abhirup Datta & Andrew O. Finley & Reinhard Furrer & Joseph Guinness & Rajarshi Guhaniyogi & Florian Gerber & Robert B. Gramacy & Dorit Hammerling & Matthias Katzfuss & Finn Lindgr, 2019. "A Case Study Competition Among Methods for Analyzing Large Spatial Data," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 24(3), pages 398-425, September.
    4. Noel Cressie & Gardar Johannesson, 2008. "Fixed rank kriging for very large spatial data sets," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(1), pages 209-226, February.
    5. Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2010. "Regularization Paths for Generalized Linear Models via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i01).
    6. Julian Besag & Jeremy York & Annie Mollié, 1991. "Bayesian image restoration, with two applications in spatial statistics," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 43(1), pages 1-20, March.
    7. Hååvard Rue & Hååkon Tjelmeland, 2002. "Fitting Gaussian Markov Random Fields to Gaussian Fields," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 29(1), pages 31-49, March.
    8. Jiaxuan Liang & Yi Cheng & Yuqi Su & Shuyue Xiao & Yunquan Song, 2022. "Variable Selection for Spatial Logistic Autoregressive Models," Mathematics, MDPI, vol. 10(17), pages 1-16, August.
    9. Meyer, Hanna & Reudenbach, Christoph & Wöllauer, Stephan & Nauss, Thomas, 2019. "Importance of spatial predictor variable selection in machine learning applications – Moving from data reproduction to spatial prediction," Ecological Modelling, Elsevier, vol. 411(C).
    10. Andrews, Jeffrey L., 2018. "Addressing overfitting and underfitting in Gaussian model-based clustering," Computational Statistics & Data Analysis, Elsevier, vol. 127(C), pages 160-171.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Morales-Oñate, Víctor & Crudu, Federico & Bevilacqua, Moreno, 2021. "Blockwise Euclidean likelihood for spatio-temporal covariance models," Econometrics and Statistics, Elsevier, vol. 20(C), pages 176-201.
    2. Eric W Fox & Jay M Ver Hoef & Anthony R Olsen, 2020. "Comparing spatial regression to random forests for large environmental data sets," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-22, March.
    3. Håvard Rue & Sara Martino & Nicolas Chopin, 2009. "Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(2), pages 319-392, April.
    4. Sandy Burden & Noel Cressie & David G. Steel, 2015. "The SAR Model for Very Large Datasets: A Reduced Rank Approach," Econometrics, MDPI, vol. 3(2), pages 1-22, May.
    5. Ryan J. Parker & Brian J. Reich & Jo Eidsvik, 2016. "A Fused Lasso Approach to Nonstationary Spatial Covariance Estimation," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 21(3), pages 569-587, September.
    6. Hosik Choi & Eunjung Song & Seung-sik Hwang & Woojoo Lee, 2018. "A modified generalized lasso algorithm to detect local spatial clusters for count data," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 102(4), pages 537-563, October.
    7. Tutz, Gerhard & Pößnecker, Wolfgang & Uhlmann, Lorenz, 2015. "Variable selection in general multinomial logit models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 207-222.
    8. Mkhadri, Abdallah & Ouhourane, Mohamed, 2013. "An extended variable inclusion and shrinkage algorithm for correlated variables," Computational Statistics & Data Analysis, Elsevier, vol. 57(1), pages 631-644.
    9. Matthias Katzfuss & Joseph Guinness & Wenlong Gong & Daniel Zilber, 2020. "Vecchia Approximations of Gaussian-Process Predictions," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 25(3), pages 383-414, September.
    10. Lu Tang & Ling Zhou & Peter X. K. Song, 2019. "Fusion learning algorithm to combine partially heterogeneous Cox models," Computational Statistics, Springer, vol. 34(1), pages 395-414, March.
    11. Young‐Geun Choi & Lawrence P. Hanrahan & Derek Norton & Ying‐Qi Zhao, 2022. "Simultaneous spatial smoothing and outlier detection using penalized regression, with application to childhood obesity surveillance from electronic health records," Biometrics, The International Biometric Society, vol. 78(1), pages 324-336, March.
    12. Tomáš Plíhal, 2021. "Scheduled macroeconomic news announcements and Forex volatility forecasting," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 40(8), pages 1379-1397, December.
    13. Caamaño-Carrillo, Christian & Bevilacqua, Moreno & López, Cristian & Morales-Oñate, Víctor, 2024. "Nearest neighbors weighted composite likelihood based on pairs for (non-)Gaussian massive spatial data with an application to Tukey-hh random fields estimation," Computational Statistics & Data Analysis, Elsevier, vol. 191(C).
    14. Loann David Denis Desboulets, 2018. "A Review on Variable Selection in Regression Analysis," Econometrics, MDPI, vol. 6(4), pages 1-27, November.
    15. Murat Genç & M. Revan Özkale, 2021. "Usage of the GO estimator in high dimensional linear models," Computational Statistics, Springer, vol. 36(1), pages 217-239, March.
    16. Isabelle Grenier & Bruno Sansó & Jessica L. Matthews, 2024. "Multivariate nearest‐neighbors Gaussian processes with random covariance matrices," Environmetrics, John Wiley & Sons, Ltd., vol. 35(3), May.
    17. Sierra Pugh & Matthew J. Heaton & Jeff Svedin & Neil Hansen, 2019. "Spatiotemporal Lagged Models for Variable Rate Irrigation in Agriculture," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 24(4), pages 634-650, December.
    18. Zammit-Mangion, Andrew & Rougier, Jonathan, 2018. "A sparse linear algebra algorithm for fast computation of prediction variances with Gaussian Markov random fields," Computational Statistics & Data Analysis, Elsevier, vol. 123(C), pages 116-130.
    19. Camehl, Annika, 2023. "Penalized estimation of panel vector autoregressive models: A panel LASSO approach," International Journal of Forecasting, Elsevier, vol. 39(3), pages 1185-1204.
    20. Jingning Zhang & Jianan Zhan & Jin Jin & Cheng Ma & Ruzhang Zhao & Jared O’Connell & Yunxuan Jiang & Bertram L. Koelsch & Haoyu Zhang & Nilanjan Chatterjee, 2024. "An ensemble penalized regression method for multi-ancestry polygenic risk prediction," Nature Communications, Nature, vol. 15(1), pages 1-14, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:envmet:v:35:y:2024:i:6:n:e2871. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.interscience.wiley.com/jpages/1180-4009/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.