IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v178y2023ics0167947322001773.html
   My bibliography  Save this article

Robust prediction interval estimation for Gaussian processes by cross-validation method

Author

Listed:
  • Acharki, Naoufal
  • Bertoncello, Antoine
  • Garnier, Josselin

Abstract

Probabilistic regression models typically use the Maximum Likelihood Estimation or Cross-Validation to fit parameters. These methods can give an advantage to the solutions that fit observations on average, but they do not pay attention to the coverage and the width of Prediction Intervals. A robust two-step approach is used to address the problem of adjusting and calibrating Prediction Intervals for Gaussian Processes Regression. First, the covariance hyperparameters are determined by a standard Cross-Validation or Maximum Likelihood Estimation method. A Leave-One-Out Coverage Probability is introduced as a metric to adjust the covariance hyperparameters and assess the optimal type II Coverage Probability to a nominal level. Then a relaxation method is applied to choose the hyperparameters that minimize the Wasserstein distance between the Gaussian distribution with the initial hyperparameters (obtained by Cross-Validation or Maximum Likelihood Estimation) and the proposed Gaussian distribution with the hyperparameters that achieve the desired Coverage Probability. The method gives Prediction Intervals with appropriate coverage probabilities and small widths.

Suggested Citation

  • Acharki, Naoufal & Bertoncello, Antoine & Garnier, Josselin, 2023. "Robust prediction interval estimation for Gaussian processes by cross-validation method," Computational Statistics & Data Analysis, Elsevier, vol. 178(C).
  • Handle: RePEc:eee:csdana:v:178:y:2023:i:c:s0167947322001773
    DOI: 10.1016/j.csda.2022.107597
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947322001773
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2022.107597?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. J. F. Lawless & Marc Fredette, 2005. "Frequentist prediction intervals and predictive distributions," Biometrika, Biometrika Trust, vol. 92(3), pages 529-542, September.
    2. Jing Lei & Max G’Sell & Alessandro Rinaldo & Ryan J. Tibshirani & Larry Wasserman, 2018. "Distribution-Free Predictive Inference for Regression," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(523), pages 1094-1111, July.
    3. Berger J.O. & De Oliveira V. & Sanso B., 2001. "Objective Bayesian Analysis of Spatially Correlated Data," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1361-1374, December.
    4. Haozhe Zhang & Joshua Zimmerman & Dan Nettleton & Daniel J. Nordman, 2020. "Random Forest Prediction Intervals," The American Statistician, Taylor & Francis Journals, vol. 74(4), pages 392-406, October.
    5. Kleijnen, Jack P. C. & Sargent, Robert G., 2000. "A methodology for fitting and validating metamodels in simulation," European Journal of Operational Research, Elsevier, vol. 120(1), pages 14-29, January.
    6. Bachoc, François, 2013. "Cross Validation and Maximum Likelihood estimations of hyper-parameters of Gaussian processes with model misspecification," Computational Statistics & Data Analysis, Elsevier, vol. 66(C), pages 55-69.
    7. Jeremy E. Oakley & Anthony O'Hagan, 2004. "Probabilistic sensitivity analysis of complex models: a Bayesian approach," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 66(3), pages 751-769, August.
    8. Landon, Joshua & Singpurwalla, Nozer D., 2008. "Choosing a Coverage Probability for Prediction Intervals," The American Statistician, American Statistical Association, vol. 62, pages 120-124, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Lu, Cheng & Teng, Da & Chen, Jun-Yu & Fei, Cheng-Wei & Keshtegar, Behrooz, 2023. "Adaptive vectorial surrogate modeling framework for multi-objective reliability estimation," Reliability Engineering and System Safety, Elsevier, vol. 234(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Andrianakis, Ioannis & Challenor, Peter G., 2012. "The effect of the nugget on Gaussian process emulators of computer models," Computational Statistics & Data Analysis, Elsevier, vol. 56(12), pages 4215-4228.
    2. Marrel, Amandine & Iooss, Bertrand & Laurent, Béatrice & Roustant, Olivier, 2009. "Calculations of Sobol indices for the Gaussian process metamodel," Reliability Engineering and System Safety, Elsevier, vol. 94(3), pages 742-751.
    3. Katarzyna Growiec & Jakub Growiec & Bogumil Kaminski, 2017. "Social Network Structure and The Trade-Off Between Social Utility and Economic Performance," KAE Working Papers 2017-026, Warsaw School of Economics, Collegium of Economic Analysis.
    4. Kleijnen, Jack P. C., 2005. "An overview of the design and analysis of simulation experiments for sensitivity analysis," European Journal of Operational Research, Elsevier, vol. 164(2), pages 287-300, July.
    5. S. Cucurachi & E. Borgonovo & R. Heijungs, 2016. "A Protocol for the Global Sensitivity Analysis of Impact Assessment Models in Life Cycle Assessment," Risk Analysis, John Wiley & Sons, vol. 36(2), pages 357-377, February.
    6. Jakub Bijak & Jason D. Hilton & Eric Silverman & Viet Dung Cao, 2013. "Reforging the Wedding Ring," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 29(27), pages 729-766.
    7. Xueping Chen & Yujie Gai & Xiaodi Wang, 2023. "A-optimal designs for non-parametric symmetrical global sensitivity analysis," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 86(2), pages 219-237, February.
    8. Matieyendou Lamboni, 2020. "Uncertainty quantification: a minimum variance unbiased (joint) estimator of the non-normalized Sobol’ indices," Statistical Papers, Springer, vol. 61(5), pages 1939-1970, October.
    9. Isaac Corro Ramos & Maureen P. M. H. Rutten-van Mölken & Maiwenn J. Al, 2013. "The Role of Value-of-Information Analysis in a Health Care Research Priority Setting," Medical Decision Making, , vol. 33(4), pages 472-489, May.
    10. Veiga, Sébastien Da & Marrel, Amandine, 2020. "Gaussian process regression with linear inequality constraints," Reliability Engineering and System Safety, Elsevier, vol. 195(C).
    11. Petropoulos, G. & Wooster, M.J. & Carlson, T.N. & Kennedy, M.C. & Scholze, M., 2009. "A global Bayesian sensitivity analysis of the 1d SimSphere soil–vegetation–atmospheric transfer (SVAT) model using Gaussian model emulation," Ecological Modelling, Elsevier, vol. 220(19), pages 2427-2440.
    12. Lu, Xuefei & Borgonovo, Emanuele, 2023. "Global sensitivity analysis in epidemiological modeling," European Journal of Operational Research, Elsevier, vol. 304(1), pages 9-24.
    13. Giorgio Fagiolo & Mattia Guerini & Francesco Lamperti & Alessio Moneta & Andrea Roventini, 2017. "Validation of Agent-Based Models in Economics and Finance," LEM Papers Series 2017/23, Laboratory of Economics and Management (LEM), Sant'Anna School of Advanced Studies, Pisa, Italy.
    14. Park, Eunchun & Brorsen, B. Wade & Harri, Ardian, 2016. "Using Bayesian Spatial Smoothing and Extreme Value Theory to Develop Area-Yield Crop Insurance Rating," 2016 Annual Meeting, July 31-August 2, Boston, Massachusetts 235754, Agricultural and Applied Economics Association.
    15. H. Christopher Frey & Sumeet R. Patil, 2002. "Identification and Review of Sensitivity Analysis Methods," Risk Analysis, John Wiley & Sons, vol. 22(3), pages 553-578, June.
    16. Tianyang Wang & James S. Dyer & Warren J. Hahn, 2017. "Sensitivity analysis of decision making under dependent uncertainties using copulas," EURO Journal on Decision Processes, Springer;EURO - The Association of European Operational Research Societies, vol. 5(1), pages 117-139, November.
    17. Brian D. Williamson & Peter B. Gilbert & Marco Carone & Noah Simon, 2021. "Nonparametric variable importance assessment using machine learning techniques," Biometrics, The International Biometric Society, vol. 77(1), pages 9-22, March.
    18. Xuefei Lu & Alessandro Rudi & Emanuele Borgonovo & Lorenzo Rosasco, 2020. "Faster Kriging: Facing High-Dimensional Simulators," Operations Research, INFORMS, vol. 68(1), pages 233-249, January.
    19. Hemez, François M. & Atamturktur, Sezer, 2011. "The dangers of sparse sampling for the quantification of margin and uncertainty," Reliability Engineering and System Safety, Elsevier, vol. 96(9), pages 1220-1231.
    20. Coolen-Maturi, Tahani & Elkhafifi, Faiza F. & Coolen, Frank P.A., 2014. "Three-group ROC analysis: A nonparametric predictive approach," Computational Statistics & Data Analysis, Elsevier, vol. 78(C), pages 69-81.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:178:y:2023:i:c:s0167947322001773. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.