IDEAS home Printed from https://ideas.repec.org/a/wly/envmet/v33y2022i6ne2742.html
   My bibliography  Save this article

Practical strategies for generalized extreme value‐based regression models for extremes

Author

Listed:
  • Daniela Castro‐Camilo
  • Raphaël Huser
  • Håvard Rue

Abstract

The generalized extreme value (GEV) distribution is the only possible limiting distribution of properly normalized maxima of a sequence of independent and identically distributed random variables. As such, it has been widely applied to approximate the distribution of maxima over blocks. In these applications, GEV properties such as finite lower endpoint when the shape parameter ξ$$ \xi $$ is positive or the loss of moments due to the magnitude of ξ$$ \xi $$ are inherited by the finite‐sample maxima distribution. The extent to which these properties are realistic for the data at hand has been widely ignored. Motivated by these overlooked consequences in a regression setting, we here make three contributions. First, we propose a blended GEV (bGEV) distribution, which smoothly combines the left tail of a Gumbel distribution (GEV with ξ=0$$ \xi =0 $$) with the right tail of a Fréchet distribution (GEV with ξ>0$$ \xi >0 $$). Our resulting distribution has, therefore, unbounded support. Second, we proposed a principled method called property‐preserving penalized complexity (P3$$ {}^3 $$C) prior to decide on the existence of the GEV distribution first and second moments a priori. Third, we propose a reparametrization of the GEV distribution that provides a more natural interpretation of the (possibly covariate‐dependent) model parameters, which in turn helps define meaningful priors. We implement the bGEV distribution with the new parameterization and the P3$$ {}^3 $$C prior approach in the R‐INLA package to make it readily available to users. We illustrate our methods with a simulation study that reveals that the GEV and bGEV distributions are comparable when estimating the right tail under large‐sample settings. Moreover, some small‐sample settings show that the bGEV fit slightly outperforms the GEV fit. Finally, we conclude with an application to NO2$$ {}_2 $$ pollution levels in California that illustrates the suitability of the new parameterization and the P3$$ {}^3 $$C prior approach in the Bayesian framework.

Suggested Citation

  • Daniela Castro‐Camilo & Raphaël Huser & Håvard Rue, 2022. "Practical strategies for generalized extreme value‐based regression models for extremes," Environmetrics, John Wiley & Sons, Ltd., vol. 33(6), September.
  • Handle: RePEc:wly:envmet:v:33:y:2022:i:6:n:e2742
    DOI: 10.1002/env.2742
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/env.2742
    Download Restriction: no

    File URL: https://libkey.io/10.1002/env.2742?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Reynkens, Tom & Verbelen, Roel & Beirlant, Jan & Antonio, Katrien, 2017. "Modelling censored losses using splicing: A global fit strategy with mixed Erlang and extreme value distributions," Insurance: Mathematics and Economics, Elsevier, vol. 77(C), pages 65-77.
    2. Sabrina Vettori & Raphaël Huser & Marc G. Genton, 2019. "Bayesian modeling of air pollution extremes using nested multivariate max‐stable processes," Biometrics, The International Biometric Society, vol. 75(3), pages 831-841, September.
    3. Stuart G. Coles & Jonathan A. Tawn, 1996. "A Bayesian Analysis of Extreme Rainfall Data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 45(4), pages 463-478, December.
    4. Håvard Rue & Sara Martino & Nicolas Chopin, 2009. "Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(2), pages 319-392, April.
    5. M. L. Stein, 2017. "Should annual maximum temperatures follow a generalized extreme value distribution?," Biometrika, Biometrika Trust, vol. 104(1), pages 1-16.
    6. Tilmann Gneiting & Fadoua Balabdaoui & Adrian E. Raftery, 2007. "Probabilistic forecasts, calibration and sharpness," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(2), pages 243-268, April.
    7. Lindgren, Finn & Rue, Håvard, 2015. "Bayesian Spatial Modelling with R-INLA," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 63(i19).
    8. Anja B. Schmiedt, 2016. "Domains of attraction of asymptotic distributions of extreme generalized order statistics," Communications in Statistics - Theory and Methods, Taylor & Francis Journals, vol. 45(7), pages 2089-2104, April.
    9. Broussard, John Paul & Booth, G. Geoffrey, 1998. "The behavior of extreme values in Germany's stock index futures: An application to intradaily margin setting," European Journal of Operational Research, Elsevier, vol. 104(3), pages 393-402, February.
    10. Mendes, Beatriz Vaz de Melo & Lopes, Hedibert Freitas, 2004. "Data driven estimates for mixtures," Computational Statistics & Data Analysis, Elsevier, vol. 47(3), pages 583-598, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jacqueline D. Seufert & Andre Python & Christoph Weisser & Elías Cisneros & Krisztina Kis‐Katos & Thomas Kneib, 2022. "Mapping ex ante risks of COVID‐19 in Indonesia using a Bayesian geostatistical model on airport network data," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(4), pages 2121-2155, October.
    2. Cho, Daegon & Hwang, Youngdeok & Park, Jongwon, 2018. "More buzz, more vibes: Impact of social media on concert distribution," Journal of Economic Behavior & Organization, Elsevier, vol. 156(C), pages 103-113.
    3. Andre Python & Andreas Bender & Marta Blangiardo & Janine B. Illian & Ying Lin & Baoli Liu & Tim C.D. Lucas & Siwei Tan & Yingying Wen & Davit Svanidze & Jianwei Yin, 2022. "A downscaling approach to compare COVID‐19 count data from databases aggregated at different spatial scales," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(1), pages 202-218, January.
    4. Daniel Cervone & Alex D’Amour & Luke Bornn & Kirk Goldsberry, 2016. "A Multiresolution Stochastic Process Model for Predicting Basketball Possession Outcomes," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(514), pages 585-599, April.
    5. Gael M. Martin & David T. Frazier & Ruben Loaiza-Maya & Florian Huber & Gary Koop & John Maheu & Didier Nibbering & Anastasios Panagiotelis, 2023. "Bayesian Forecasting in the 21st Century: A Modern Review," Monash Econometrics and Business Statistics Working Papers 1/23, Monash University, Department of Econometrics and Business Statistics.
    6. Daniela Castro-Camilo & Raphaël Huser & Håvard Rue, 2019. "A Spliced Gamma-Generalized Pareto Model for Short-Term Extreme Wind Speed Probabilistic Forecasting," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 24(3), pages 517-534, September.
    7. Johnson, Blair T. & Sisti, Anthony & Bernstein, Mary & Chen, Kun & Hennessy, Emily A. & Acabchuk, Rebecca L. & Matos, Michaela, 2021. "Community-level factors and incidence of gun violence in the United States, 2014–2017," Social Science & Medicine, Elsevier, vol. 280(C).
    8. Zhang, Shen & Liu, Xin & Tang, Jinjun & Cheng, Shaowu & Qi, Yong & Wang, Yinhai, 2018. "Spatio-temporal modeling of destination choice behavior through the Bayesian hierarchical approach," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 512(C), pages 537-551.
    9. Paige, John & Fuglstad, Geir-Arne & Riebler, Andrea & Wakefield, Jon, 2022. "Bayesian multiresolution modeling of georeferenced data: An extension of ‘LatticeKrig’," Computational Statistics & Data Analysis, Elsevier, vol. 173(C).
    10. Aaron Osgood‐Zimmerman & Jon Wakefield, 2023. "A Statistical Review of Template Model Builder: A Flexible Tool for Spatial Modelling," International Statistical Review, International Statistical Institute, vol. 91(2), pages 318-342, August.
    11. William Gonzalez Daza & Renata L. Muylaert & Thadeu Sobral-Souza & Victor Lemes Landeiro, 2023. "Malaria Risk Drivers in the Brazilian Amazon: Land Use—Land Cover Interactions and Biological Diversity," IJERPH, MDPI, vol. 20(15), pages 1-16, August.
    12. Sameh Abdulah & Yuxiao Li & Jian Cao & Hatem Ltaief & David E. Keyes & Marc G. Genton & Ying Sun, 2023. "Large‐scale environmental data science with ExaGeoStatR," Environmetrics, John Wiley & Sons, Ltd., vol. 34(1), February.
    13. John M. Humphreys & Robert B. Srygley & David H. Branson, 2022. "Geographic Variation in Migratory Grasshopper Recruitment under Projected Climate Change," Geographies, MDPI, vol. 2(1), pages 1-19, January.
    14. John M. Humphreys, 2022. "Amplification in Time and Dilution in Space: Partitioning Spatiotemporal Processes to Assess the Role of Avian-Host Phylodiversity in Shaping Eastern Equine Encephalitis Virus Distribution," Geographies, MDPI, vol. 2(3), pages 1-16, July.
    15. Álvaro Briz-Redón, 2021. "Respondent Burden Effects on Item Non-Response and Careless Response Rates: An Analysis of Two Types of Surveys," Mathematics, MDPI, vol. 9(17), pages 1-16, August.
    16. Dong Liang & Genevieve Nesslage & Michael Wilberg & Thomas Miller, 2017. "Bayesian Calibration of Blue Crab (Callinectes sapidus) Abundance Indices Based on Probability Surveys," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 22(4), pages 481-497, December.
    17. Silius M. Vandeskog & Sara Martino & Daniela Castro-Camilo & Håvard Rue, 2022. "Modelling Sub-daily Precipitation Extremes with the Blended Generalised Extreme Value Distribution," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 27(4), pages 598-621, December.
    18. Humphreys, John M. & Srygley, Robert B. & Lawton, Douglas & Hudson, Amy R. & Branson, David H., 2022. "Grasshoppers exhibit asynchrony and spatial non-stationarity in response to the El Niño/Southern and Pacific Decadal Oscillations," Ecological Modelling, Elsevier, vol. 471(C).
    19. Carlos Díaz-Avalos & Pablo Juan & Somnath Chaudhuri & Marc Sáez & Laura Serra, 2020. "Association between the New COVID-19 Cases and Air Pollution with Meteorological Elements in Nine Counties of New York State," IJERPH, MDPI, vol. 17(23), pages 1-18, December.
    20. Beręsewicz Maciej, 2019. "Correlates of Representation Errors in Internet Data Sources for Real Estate Market," Journal of Official Statistics, Sciendo, vol. 35(3), pages 509-529, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:envmet:v:33:y:2022:i:6:n:e2742. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.interscience.wiley.com/jpages/1180-4009/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.