IDEAS home Printed from https://ideas.repec.org/a/jss/jstsof/v064i12.html
   My bibliography  Save this article

GPfit: An R Package for Fitting a Gaussian Process Model to Deterministic Simulator Outputs

Author

Listed:
  • MacDonald, Blake
  • Ranjan, Pritam
  • Chipman, Hugh

Abstract

Gaussian process (GP) models are commonly used statistical metamodels for emulating expensive computer simulators. Fitting a GP model can be numerically unstable if any pair of design points in the input space are close together. Ranjan, Haynes, and Karsten (2011) proposed a computationally stable approach for fitting GP models to deterministic computer simulators. They used a genetic algorithm based approach that is robust but computationally intensive for maximizing the likelihood. This paper implements a slightly modified version ofthe model proposed by Ranjan et al. (2011 ) in the R package GPfit. A novel parameterization of the spatial correlation function and a clustering based multi-start gradient based optimization algorithm yield robust optimization that is typically faster than the genetic algorithm based approach. We present two examples with R codes to illustrate the usage of the main functions in GPfit . Several test functions are used for performance comparison with the popular R package mlegp . We also use GPfit for a real application, i.e., for emulating the tidal kinetic energy model for the Bay of Fundy, Nova Scotia, Canada. GPfit is free software and distributed under the General Public License and available from the Comprehensive R Archive Network.

Suggested Citation

  • MacDonald, Blake & Ranjan, Pritam & Chipman, Hugh, 2015. "GPfit: An R Package for Fitting a Gaussian Process Model to Deterministic Simulator Outputs," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 64(i12).
  • Handle: RePEc:jss:jstsof:v:064:i12
    DOI: http://hdl.handle.net/10.18637/jss.v064.i12
    as

    Download full text from publisher

    File URL: https://www.jstatsoft.org/index.php/jss/article/view/v064i12/v64i12.pdf
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v064i12/GPfit_0.2-1.tar.gz
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v064i12/v64i12.R
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v064i12/tidalpower_output.txt
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v064i12/v64i12-tables.zip
    Download Restriction: no

    File URL: https://libkey.io/http://hdl.handle.net/10.18637/jss.v064.i12?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Zeileis, Achim & Hornik, Kurt & Murrell, Paul, 2009. "Escaping RGBland: Selecting colors for statistical graphics," Computational Statistics & Data Analysis, Elsevier, vol. 53(9), pages 3259-3270, July.
    2. Gramacy, Robert B & Lee, Herbert K. H, 2008. "Bayesian Treed Gaussian Process Models With an Application to Computer Modeling," Journal of the American Statistical Association, American Statistical Association, vol. 103(483), pages 1119-1130.
    3. Gramacy, Robert B., 2007. "tgp: An R Package for Bayesian Nonstationary, Semiparametric Nonlinear Regression and Design by Treed Gaussian Process Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 19(i09).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jackson Samuel E. & Vernon Ian & Liu Junli & Lindsey Keith, 2020. "Understanding hormonal crosstalk in Arabidopsis root development via emulation and history matching," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 19(2), pages 1-33, April.
    2. Palomo, Jesús & Paulo, Rui & García-Donato, Gonzalo, 2015. "SAVE: An R Package for the Statistical Analysis of Computer Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 64(i13).
    3. Oskar Gustafsson & Mattias Villani & Pär Stockhammar, 2023. "Bayesian optimization of hyperparameters from noisy marginal likelihood estimates," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 38(4), pages 577-595, June.
    4. Sameh Abdulah & Yuxiao Li & Jian Cao & Hatem Ltaief & David E. Keyes & Marc G. Genton & Ying Sun, 2023. "Large‐scale environmental data science with ExaGeoStatR," Environmetrics, John Wiley & Sons, Ltd., vol. 34(1), February.
    5. Erickson, Collin B. & Ankenman, Bruce E. & Sanchez, Susan M., 2018. "Comparison of Gaussian process modeling software," European Journal of Operational Research, Elsevier, vol. 266(1), pages 179-192.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Matthew W. Wheeler, 2019. "Bayesian additive adaptive basis tensor product models for modeling high dimensional surfaces: an application to high‐throughput toxicity testing," Biometrics, The International Biometric Society, vol. 75(1), pages 193-201, March.
    2. Erickson, Collin B. & Ankenman, Bruce E. & Sanchez, Susan M., 2018. "Comparison of Gaussian process modeling software," European Journal of Operational Research, Elsevier, vol. 266(1), pages 179-192.
    3. Monterrubio-Gómez, Karla & Roininen, Lassi & Wade, Sara & Damoulas, Theodoros & Girolami, Mark, 2020. "Posterior inference for sparse hierarchical non-stationary models," Computational Statistics & Data Analysis, Elsevier, vol. 148(C).
    4. Horiguchi, Akira & Pratola, Matthew T. & Santner, Thomas J., 2021. "Assessing variable activity for Bayesian regression trees," Reliability Engineering and System Safety, Elsevier, vol. 207(C).
    5. Daniel W. Gladish & Daniel E. Pagendam & Luk J. M. Peeters & Petra M. Kuhnert & Jai Vaze, 2018. "Emulation Engines: Choice and Quantification of Uncertainty for Complex Hydrological Models," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 23(1), pages 39-62, March.
    6. Davis, Casey B. & Hans, Christopher M. & Santner, Thomas J., 2021. "Prediction of non-stationary response functions using a Bayesian composite Gaussian process," Computational Statistics & Data Analysis, Elsevier, vol. 154(C).
    7. Waley W. J. Liang & Herbert K. H. Lee, 2019. "Bayesian nonstationary Gaussian process models via treed process convolutions," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(3), pages 797-818, September.
    8. Samuel W. Malone & Robert B. Gramacy & Enrique Ter Horst, 2016. "Timing Foreign Exchange Markets," Econometrics, MDPI, vol. 4(1), pages 1-23, March.
    9. Florian Huber & Luca Rossini, 2020. "Inference in Bayesian Additive Vector Autoregressive Tree Models," Papers 2006.16333, arXiv.org, revised Mar 2021.
    10. Al Ali, Hannah & Daneshkhah, Alireza & Boutayeb, Abdesslam & Malunguza, Noble Jahalamajaha & Mukandavire, Zindoga, 2022. "Exploring dynamical properties of a Type 1 diabetes model using sensitivity approaches," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 201(C), pages 324-342.
    11. Stanislav Katina & Liberty Vittert & Adrian W. Bowman, 2021. "Functional data analysis and visualisation of three‐dimensional surface shape," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(3), pages 691-713, June.
    12. repec:jss:jstsof:32:i01 is not listed on IDEAS
    13. Andrew Hoegh & Marco A. R. Ferreira & Scotland Leman, 2016. "Spatiotemporal model fusion: multiscale modelling of civil unrest," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 65(4), pages 529-545, August.
    14. Jonas Schöley, 2021. "The centered ternary balance scheme: A technique to visualize surfaces of unbalanced three-part compositions," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 44(19), pages 443-458.
    15. Touzani, Samir & Busby, Daniel, 2013. "Smoothing spline analysis of variance approach for global sensitivity analysis of computer codes," Reliability Engineering and System Safety, Elsevier, vol. 112(C), pages 67-81.
    16. Marco H. Benedetti & Veronica J. Berrocal & Naveen N. Narisetty, 2022. "Identifying regions of inhomogeneities in spatial processes via an M‐RA and mixture priors," Biometrics, The International Biometric Society, vol. 78(2), pages 798-811, June.
    17. Kelly R. Moran & Matthew W. Wheeler, 2022. "Fast increased fidelity samplers for approximate Bayesian Gaussian process regression," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(4), pages 1198-1228, September.
    18. Jing Chang & Herbert K.H. Lee, 2015. "Variable selection via a multi-stage strategy," Journal of Applied Statistics, Taylor & Francis Journals, vol. 42(4), pages 762-774, April.
    19. Gramacy, Robert B. & Taddy, Matthew Alan, 2010. "Categorical Inputs, Sensitivity Analysis, Optimization and Importance Tempering with tgp Version 2, an R Package for Treed Gaussian Process Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i06).
    20. Bozağaç, Doruk & Batmaz, İnci & Oğuztüzün, Halit, 2016. "Dynamic simulation metamodeling using MARS: A case of radar simulation," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 124(C), pages 69-86.
    21. Lian, Heng & Li, Gaorong, 2014. "Series expansion for functional sufficient dimension reduction," Journal of Multivariate Analysis, Elsevier, vol. 124(C), pages 150-165.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:jss:jstsof:v:064:i12. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum (email available below). General contact details of provider: http://www.jstatsoft.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.