IDEAS home Printed from https://ideas.repec.org/a/spr/aistmt/v74y2022i3d10.1007_s10463-021-00810-6.html
   My bibliography  Save this article

Bayes factor asymptotics for variable selection in the Gaussian process framework

Author

Listed:
  • Minerva Mukhopadhyay

    (Indian Institute of Technology)

  • Sourabh Bhattacharya

    (Indian Statistical Institute)

Abstract

We investigate Bayesian variable selection in models driven by Gaussian processes, which allows us to treat linear, nonlinear and nonparametric models, in conjunction with even dependent setups, in the same vein. We consider the Bayes factor route to variable selection, and develop a general asymptotic theory for the Gaussian process framework in the “large p, large n” settings even with $$p\gg n$$ p ≫ n , establishing almost sure exponential convergence of the Bayes factor under appropriately mild conditions. The fixed p setup is included as a special case. To illustrate, we apply our result to variable selection in linear regression, Gaussian process model with squared exponential covariance function accommodating the covariates, and a first-order autoregressive process with time-varying covariates. We also follow up our theoretical investigations with ample simulation experiments in the above regression contexts and variable selection in a real, riboflavin data consisting of 71 observations and 4088 covariates. For implementation of variable selection using Bayes factors, we develop a novel and effective general-purpose transdimensional, transformation-based Markov chain Monte Carlo algorithm, which has played a crucial role in simulated and real data applications.

Suggested Citation

  • Minerva Mukhopadhyay & Sourabh Bhattacharya, 2022. "Bayes factor asymptotics for variable selection in the Gaussian process framework," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(3), pages 581-613, June.
  • Handle: RePEc:spr:aistmt:v:74:y:2022:i:3:d:10.1007_s10463-021-00810-6
    DOI: 10.1007/s10463-021-00810-6
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10463-021-00810-6
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10463-021-00810-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Tiago M. Fragoso & Wesley Bertoli & Francisco Louzada, 2018. "Bayesian Model Averaging: A Systematic Review and Conceptual Classification," International Statistical Review, International Statistical Institute, vol. 86(1), pages 1-28, April.
    2. Meyer M.C. & Laud P.W., 2002. "Predictive Variable Selection in Generalized Linear Models," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 859-871, September.
    3. Jan R. Magnus, 1978. "The moments of products of quadratic forms in normal variables," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 32(4), pages 201-210, December.
    4. Newey, Whitney K, 1991. "Uniform Convergence in Probability and Stochastic Equicontinuity," Econometrica, Econometric Society, vol. 59(4), pages 1161-1167, July.
    5. repec:dau:papers:123456789/7848 is not listed on IDEAS
    6. Valen E. Johnson & David Rossell, 2012. "Bayesian Model Selection in High-Dimensional Settings," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(498), pages 649-660, June.
    7. Suprateek Kundu & David B. Dunson, 2014. "Bayes Variable Selection in Semiparametric Linear Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(505), pages 437-447, March.
    8. M.‐H. Chen & J. G. Ibrahim & C. Yiannoutsos, 1999. "Prior elicitation, variable selection and Bayesian computation for logistic regression models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 61(1), pages 223-242.
    9. Hong, Han & Preston, Bruce, 2012. "Bayesian averaging, prediction and nonnested model selection," Journal of Econometrics, Elsevier, vol. 167(2), pages 358-369.
    10. Jean-Michel Marin & Natesh S. Pillai & Christian P. Robert & Judith Rousseau, 2014. "Relevant statistics for Bayesian model choice," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(5), pages 833-859, November.
    11. Elías Moreno & F. Girón, 2008. "Comparison of Bayesian objective procedures for variable selection in linear regression," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 17(3), pages 472-490, November.
    12. Elías Moreno & F. Girón, 2008. "Comparison of Bayesian objective procedures for variable selection in linear regression," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 17(3), pages 491-492, November.
    13. Marra, Giampiero & Wood, Simon N., 2011. "Practical variable selection for generalized additive models," Computational Statistics & Data Analysis, Elsevier, vol. 55(7), pages 2372-2387, July.
    14. Liang, Feng & Paulo, Rui & Molina, German & Clyde, Merlise A. & Berger, Jim O., 2008. "Mixtures of g Priors for Bayesian Variable Selection," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 410-423, March.
    15. Minerva Mukhopadhyay & Tapas Samanta & Arijit Chakrabarti, 2015. "On consistency and optimality of Bayesian variable selection based on $$g$$ g -prior in normal linear regression models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 67(5), pages 963-997, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Li, Cheng & Jiang, Wenxin, 2016. "On oracle property and asymptotic validity of Bayesian generalized method of moments," Journal of Multivariate Analysis, Elsevier, vol. 145(C), pages 132-147.
    2. Mark F. J. Steel, 2020. "Model Averaging and Its Use in Economics," Journal of Economic Literature, American Economic Association, vol. 58(3), pages 644-719, September.
    3. Dimitris Korobilis & Kenichi Shimizu, 2022. "Bayesian Approaches to Shrinkage and Sparse Estimation," Foundations and Trends(R) in Econometrics, now publishers, vol. 11(4), pages 230-354, June.
    4. Minerva Mukhopadhyay & Tapas Samanta, 2017. "A mixture of g-priors for variable selection when the number of regressors grows with the sample size," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 26(2), pages 377-404, June.
    5. Dimitris Fouskakis & Ioannis Ntzoufras, 2017. "Information consistency of the Jeffreys power-expected-posterior prior in Gaussian linear models," METRON, Springer;Sapienza Università di Roma, vol. 75(3), pages 371-380, December.
    6. Moreno, E. & Girón, F.J. & Martínez, M.L. & Vázquez-Polo, F.J. & Negrín, M.A., 2013. "Optimal treatments in cost-effectiveness analysis in the presence of covariates: Improving patient subgroup definition," European Journal of Operational Research, Elsevier, vol. 226(1), pages 173-182.
    7. Faming Liang & Momiao Xiong, 2013. "Bayesian Detection of Causal Rare Variants under Posterior Consistency," PLOS ONE, Public Library of Science, vol. 8(7), pages 1-16, July.
    8. Moreno, Elías & Girón, F.J. & Vázquez-Polo, F.J. & Negrín, M.A., 2012. "Optimal healthcare decisions: The importance of the covariates in cost–effectiveness analysis," European Journal of Operational Research, Elsevier, vol. 218(2), pages 512-522.
    9. Byron Botha & Rulof Burger & Kevin Kotzé & Neil Rankin & Daan Steenkamp, 2023. "Big data forecasting of South African inflation," Empirical Economics, Springer, vol. 65(1), pages 149-188, July.
    10. Nadja Klein & Michael Stanley Smith, 2021. "Bayesian variable selection for non‐Gaussian responses: a marginally calibrated copula approach," Biometrics, The International Biometric Society, vol. 77(3), pages 809-823, September.
    11. Fouskakis, Dimitris & Ntzoufras, Ioannis & Perrakis, Konstantinos, 2020. "Variations of power-expected-posterior priors in normal regression models," Computational Statistics & Data Analysis, Elsevier, vol. 143(C).
    12. Latouche, Pierre & Mattei, Pierre-Alexandre & Bouveyron, Charles & Chiquet, Julien, 2016. "Combining a relaxed EM algorithm with Occam’s razor for Bayesian variable selection in high-dimensional regression," Journal of Multivariate Analysis, Elsevier, vol. 146(C), pages 177-190.
    13. Sang Gil Kang & Woo Dong Lee & Yongku Kim, 2022. "Objective Bayesian group variable selection for linear model," Computational Statistics, Springer, vol. 37(3), pages 1287-1310, July.
    14. Andrew J. Womack & Luis León-Novelo & George Casella, 2014. "Inference From Intrinsic Bayes' Procedures Under Model Selection and Uncertainty," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(507), pages 1040-1053, September.
    15. Ho-Hsiang Wu & Marco A. R. Ferreira & Mohamed Elkhouly & Tieming Ji, 2020. "Hyper Nonlocal Priors for Variable Selection in Generalized Linear Models," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 82(1), pages 147-185, February.
    16. Chen, Min & Wang, Xinlei, 2011. "Approximate predictive densities and their applications in generalized linear models," Computational Statistics & Data Analysis, Elsevier, vol. 55(4), pages 1570-1580, April.
    17. Shi, Guiling & Lim, Chae Young & Maiti, Tapabrata, 2019. "Bayesian model selection for generalized linear models using non-local priors," Computational Statistics & Data Analysis, Elsevier, vol. 133(C), pages 285-296.
    18. Chang, Jinyuan & Chen, Song Xi & Chen, Xiaohong, 2015. "High dimensional generalized empirical likelihood for moment restrictions with dependent data," Journal of Econometrics, Elsevier, vol. 185(1), pages 283-304.
    19. Domenico Giannone & Michele Lenza & Lucrezia Reichlin, 2011. "Market Freedom and the Global Recession," IMF Economic Review, Palgrave Macmillan;International Monetary Fund, vol. 59(1), pages 111-135, April.
    20. Isaiah Andrews & Anna Mikusheva, 2016. "Conditional Inference With a Functional Nuisance Parameter," Econometrica, Econometric Society, vol. 84, pages 1571-1612, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:aistmt:v:74:y:2022:i:3:d:10.1007_s10463-021-00810-6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.