IDEAS home Printed from https://ideas.repec.org/a/jss/jstsof/v072i02.html
   My bibliography  Save this article

Bayesian Nonparametric Mixture Estimation for Time-Indexed Functional Data in R

Author

Listed:
  • Savitsky, Terrance D.

Abstract

We present growfunctions for R that offers Bayesian nonparametric estimation models for analysis of dependent, noisy time series data indexed by a collection of domains. This data structure arises from combining periodically published government survey statistics, such as are reported in the Current Population Study (CPS). The CPS publishes monthly, by-state estimates of employment levels, where each state expresses a noisy time series. Published state-level estimates from the CPS are composed from household survey responses in a model-free manner and express high levels of volatility due to insufficient sample sizes. Existing software solutions borrow information over a modeled time-based dependence to extract a de-noised time series for each domain. These solutions, however, ignore the dependence among the domains that may be additionally leveraged to improve estimation efficiency. The growfunctions package offers two fully nonparametric mixture models that simultaneously estimate both a time and domain-indexed dependence structure for a collection of time series: (1) A Gaussian process (GP) construction, which is parameterized through the covariance matrix, estimates a latent function for each domain. The covariance parameters of the latent functions are indexed by domain under a Dirichlet process prior that permits estimation of the dependence among functions across the domains: (2) An intrinsic Gaussian Markov random field prior construction provides an alternative to the GP that expresses different computation and estimation properties. In addition to performing denoised estimation of latent functions from published domain estimates, growfunctions allows estimation of collections of functions for observation units (e.g., households), rather than aggregated domains, by accounting for an informative sampling design under which the probabilities for inclusion of observation units are related to the response variable. growfunctions includes plot functions that allow visual assessments of the fit performance and dependence structure of the estimated functions. Computational efficiency is achieved by performing the sampling for estimation functions using compiled C++.

Suggested Citation

  • Savitsky, Terrance D., 2016. "Bayesian Nonparametric Mixture Estimation for Time-Indexed Functional Data in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 72(i02).
  • Handle: RePEc:jss:jstsof:v:072:i02
    DOI: http://hdl.handle.net/10.18637/jss.v072.i02
    as

    Download full text from publisher

    File URL: https://www.jstatsoft.org/index.php/jss/article/view/v072i02/v72i02.pdf
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v072i02/growfunctions_0.13.tar.gz
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v072i02/v72i02.R
    Download Restriction: no

    File URL: https://libkey.io/http://hdl.handle.net/10.18637/jss.v072.i02?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Lee, Duncan, 2013. "CARBayes: An R Package for Bayesian Spatial Modeling with Conditional Autoregressive Priors," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 55(i13).
    2. Gramacy, Robert B. & Taddy, Matthew Alan, 2010. "Categorical Inputs, Sensitivity Analysis, Optimization and Importance Tempering with tgp Version 2, an R Package for Treed Gaussian Process Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i06).
    3. Gramacy, Robert B., 2007. "tgp: An R Package for Bayesian Nonstationary, Semiparametric Nonlinear Regression and Design by Treed Gaussian Process Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 19(i09).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Horiguchi, Akira & Pratola, Matthew T. & Santner, Thomas J., 2021. "Assessing variable activity for Bayesian regression trees," Reliability Engineering and System Safety, Elsevier, vol. 207(C).
    2. Roustant, Olivier & Ginsbourger, David & Deville, Yves, 2012. "DiceKriging, DiceOptim: Two R Packages for the Analysis of Computer Experiments by Kriging-Based Metamodeling and Optimization," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 51(i01).
    3. Davis, Casey B. & Hans, Christopher M. & Santner, Thomas J., 2021. "Prediction of non-stationary response functions using a Bayesian composite Gaussian process," Computational Statistics & Data Analysis, Elsevier, vol. 154(C).
    4. Liukkonen, Lauri & Ayllón, Daniel & Kunnasranta, Mervi & Niemi, Marja & Nabe-Nielsen, Jacob & Grimm, Volker & Nyman, Anna-Maija, 2018. "Modelling movements of Saimaa ringed seals using an individual-based approach," Ecological Modelling, Elsevier, vol. 368(C), pages 321-335.
    5. Nikoline N. Knudsen & Jörg Schullehner & Birgitte Hansen & Lisbeth F. Jørgensen & Søren M. Kristiansen & Denitza D. Voutchkova & Thomas A. Gerds & Per K. Andersen & Kristine Bihrmann & Morten Grønbæk , 2017. "Lithium in Drinking Water and Incidence of Suicide: A Nationwide Individual-Level Cohort Study with 22 Years of Follow-Up," IJERPH, MDPI, vol. 14(6), pages 1-13, June.
    6. Matthew W. Wheeler, 2019. "Bayesian additive adaptive basis tensor product models for modeling high dimensional surfaces: an application to high‐throughput toxicity testing," Biometrics, The International Biometric Society, vol. 75(1), pages 193-201, March.
    7. Al Ali, Hannah & Daneshkhah, Alireza & Boutayeb, Abdesslam & Malunguza, Noble Jahalamajaha & Mukandavire, Zindoga, 2022. "Exploring dynamical properties of a Type 1 diabetes model using sensitivity approaches," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 201(C), pages 324-342.
    8. Erickson, Collin B. & Ankenman, Bruce E. & Sanchez, Susan M., 2018. "Comparison of Gaussian process modeling software," European Journal of Operational Research, Elsevier, vol. 266(1), pages 179-192.
    9. Monterrubio-Gómez, Karla & Roininen, Lassi & Wade, Sara & Damoulas, Theodoros & Girolami, Mark, 2020. "Posterior inference for sparse hierarchical non-stationary models," Computational Statistics & Data Analysis, Elsevier, vol. 148(C).
    10. Xiaoqing Zhao & Junwei Pu & Xingyou Wang & Junxu Chen & Liang Emlyn Yang & Zexian Gu, 2018. "Land-Use Spatio-Temporal Change and Its Driving Factors in an Artificial Forest Area in Southwest China," Sustainability, MDPI, vol. 10(11), pages 1-19, November.
    11. Wiki, Jesse & Kingham, Simon & Campbell, Malcolm, 2021. "A geospatial analysis of Type 2 Diabetes Mellitus and the food environment in urban New Zealand," Social Science & Medicine, Elsevier, vol. 288(C).
    12. Tasuku Okui & Akie Hirata & Naoki Nakashima, 2022. "Association of Esophageal Cancer Mortality with Municipal Socioeconomic Deprivation Level in Japan, 2013–2017: An Ecological Study Using Nationwide Data," IJERPH, MDPI, vol. 19(9), pages 1-9, April.
    13. Gramacy, Robert B. & Taddy, Matthew Alan, 2010. "Categorical Inputs, Sensitivity Analysis, Optimization and Importance Tempering with tgp Version 2, an R Package for Treed Gaussian Process Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i06).
    14. Ferreira, Marco A.R. & Porter, Erica M. & Franck, Christopher T., 2021. "Fast and scalable computations for Gaussian hierarchical models with intrinsic conditional autoregressive spatial random effects," Computational Statistics & Data Analysis, Elsevier, vol. 162(C).
    15. Xiao Li & Michele Guindani & Chaan S. Ng & Brian P. Hobbs, 2021. "A Bayesian nonparametric model for textural pattern heterogeneity," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(2), pages 459-480, March.
    16. Kevin P. Josey & Priyanka deSouza & Xiao Wu & Danielle Braun & Rachel Nethery, 2023. "Estimating a Causal Exposure Response Function with a Continuous Error-Prone Exposure: A Study of Fine Particulate Matter and All-Cause Mortality," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 28(1), pages 20-41, March.
    17. Sanjeeva Kumar Jha & Ningthoukhongjam Vikimchandra Singh, 2023. "A Skew-Normal Spatial Simultaneous Autoregressive Model and its Implementation," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 85(1), pages 306-323, February.
    18. Gerber, Florian & Furrer, Reinhard, 2015. "Pitfalls in the Implementation of Bayesian Hierarchical Modeling of Areal Count Data: An Illustration Using BYM and Leroux Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 63(c01).
    19. Earl W Duncan & Kerrie L Mengersen, 2020. "Comparing Bayesian spatial models: Goodness-of-smoothing criteria for assessing under- and over-smoothing," PLOS ONE, Public Library of Science, vol. 15(5), pages 1-28, May.
    20. MacDonald, Blake & Ranjan, Pritam & Chipman, Hugh, 2015. "GPfit: An R Package for Fitting a Gaussian Process Model to Deterministic Simulator Outputs," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 64(i12).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:jss:jstsof:v:072:i02. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum (email available below). General contact details of provider: http://www.jstatsoft.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.