IDEAS home Printed from https://ideas.repec.org/a/bla/jorssc/v70y2021i4p886-908.html
   My bibliography  Save this article

A computationally efficient Bayesian seemingly unrelated regressions model for high‐dimensional quantitative trait loci discovery

Author

Listed:
  • Leonardo Bottolo
  • Marco Banterle
  • Sylvia Richardson
  • Mika Ala‐Korpela
  • Marjo‐Riitta Järvelin
  • Alex Lewin

Abstract

Our work is motivated by the search for metabolite quantitative trait loci (QTL) in a cohort of more than 5000 people. There are 158 metabolites measured by NMR spectroscopy in the 31‐year follow‐up of the Northern Finland Birth Cohort 1966 (NFBC66). These metabolites, as with many multivariate phenotypes produced by high‐throughput biomarker technology, exhibit strong correlation structures. Existing approaches for combining such data with genetic variants for multivariate QTL analysis generally ignore phenotypic correlations or make restrictive assumptions about the associations between phenotypes and genetic loci. We present a computationally efficient Bayesian seemingly unrelated regressions model for high‐dimensional data, with cell‐sparse variable selection and sparse graphical structure for covariance selection. Cell sparsity allows different phenotype responses to be associated with different genetic predictors and the graphical structure is used to represent the conditional dependencies between phenotype variables. To achieve feasible computation of the large model space, we exploit a factorisation of the covariance matrix. Applying the model to the NFBC66 data with 9000 directly genotyped single nucleotide polymorphisms, we are able to simultaneously estimate genotype–phenotype associations and the residual dependence structure among the metabolites. The R package BayesSUR with full documentation is available at https://cran.r‐project.org/web/packages/BayesSUR/

Suggested Citation

  • Leonardo Bottolo & Marco Banterle & Sylvia Richardson & Mika Ala‐Korpela & Marjo‐Riitta Järvelin & Alex Lewin, 2021. "A computationally efficient Bayesian seemingly unrelated regressions model for high‐dimensional quantitative trait loci discovery," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(4), pages 886-908, August.
  • Handle: RePEc:bla:jorssc:v:70:y:2021:i:4:p:886-908
    DOI: 10.1111/rssc.12490
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/rssc.12490
    Download Restriction: no

    File URL: https://libkey.io/10.1111/rssc.12490?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. P. J. Brown & M. Vannucci & T. Fearn, 2002. "Bayes model averaging with selection of regressors," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(3), pages 519-536, August.
    2. Wang, Hao, 2010. "Sparse seemingly unrelated regression modelling: Applications in finance and econometrics," Computational Statistics & Data Analysis, Elsevier, vol. 54(11), pages 2866-2877, November.
    3. Alberto Roverato, 2002. "Hyper Inverse Wishart Distribution for Non‐decomposable Graphs and its Application to Bayesian Inference for Gaussian Graphical Models," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 29(3), pages 391-411, September.
    4. Anindya Bhadra & Bani K. Mallick, 2013. "Joint High-Dimensional Bayesian Variable and Covariance Selection with an Application to eQTL Analysis," Biometrics, The International Biometric Society, vol. 69(2), pages 447-457, June.
    5. Johannes Kettunen & Ayşe Demirkan & Peter Würtz & Harmen H.M. Draisma & Toomas Haller & Rajesh Rawal & Anika Vaarhorst & Antti J. Kangas & Leo-Pekka Lyytikäinen & Matti Pirinen & René Pool & Antti-Pek, 2016. "Genome-wide study for circulating metabolites identifies 62 loci and reveals novel systemic effects of LPA," Nature Communications, Nature, vol. 7(1), pages 1-9, September.
    6. Enrico Petretto & Leonardo Bottolo & Sarah R Langley & Matthias Heinig & Chris McDermott-Roe & Rizwan Sarwar & Michal Pravenec & Norbert Hübner & Timothy J Aitman & Stuart A Cook & Sylvia Richardson, 2010. "New Insights into the Genetic Control of Gene Expression using a Bayesian Multi-tissue Approach," PLOS Computational Biology, Public Library of Science, vol. 6(4), pages 1-13, April.
    7. P. J. Brown & M. Vannucci & T. Fearn, 1998. "Multivariate Bayesian variable selection and prediction," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(3), pages 627-641.
    8. Scott-Boyer Marie Pier & Imholte Gregory C. & Tayeb Arafat & Labbe Aurelie & Deschepper Christian F. & Gottardo Raphael, 2012. "An Integrated Hierarchical Bayesian Model for Multivariate eQTL Mapping," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(4), pages 1-30, July.
    9. Nicoló Fusi & Oliver Stegle & Neil D Lawrence, 2012. "Joint Modelling of Confounding Factors and Prominent Genetic Regulators Provides Increased Accuracy in Genetical Genomics Studies," PLOS Computational Biology, Public Library of Science, vol. 8(1), pages 1-9, January.
    10. Peter J. Green & Alun Thomas, 2013. "Sampling decomposable graphs using a Markov chain on junction trees," Biometrika, Biometrika Trust, vol. 100(1), pages 91-110.
    11. Zellner, Arnold & Ando, Tomohiro, 2010. "A direct Monte Carlo approach for Bayesian analysis of the seemingly unrelated regression model," Journal of Econometrics, Elsevier, vol. 159(1), pages 33-45, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Guido Consonni & Luca La Rocca & Stefano Peluso, 2017. "Objective Bayes Covariate-Adjusted Sparse Graphical Model Selection," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 44(3), pages 741-764, September.
    2. Codazzi, Laura & Colombi, Alessandro & Gianella, Matteo & Argiento, Raffaele & Paci, Lucia & Pini, Alessia, 2022. "Gaussian graphical modeling for spectrometric data analysis," Computational Statistics & Data Analysis, Elsevier, vol. 174(C).
    3. Abdul Salam & Marco Grzegorczyk, 2023. "Model averaging for sparse seemingly unrelated regression using Bayesian networks among the errors," Computational Statistics, Springer, vol. 38(2), pages 779-808, June.
    4. Wang, Hao, 2010. "Sparse seemingly unrelated regression modelling: Applications in finance and econometrics," Computational Statistics & Data Analysis, Elsevier, vol. 54(11), pages 2866-2877, November.
    5. Theo S. Eicher & Chris Papageorgiou & Adrian E. Raftery, 2011. "Default priors and predictive performance in Bayesian model averaging, with application to growth determinants," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 26(1), pages 30-55, January/F.
    6. Dimitris Korobilis, 2008. "Forecasting in vector autoregressions with many predictors," Advances in Econometrics, in: Bayesian Econometrics, pages 403-431, Emerald Group Publishing Limited.
    7. Anastasia Dimiski, 2020. "Factors that affect Students’ performance in Science: An application using Gini-BMA methodology in PISA 2015 dataset," Working Papers 2004, University of Guelph, Department of Economics and Finance.
    8. Bai, Ray & Ghosh, Malay, 2018. "High-dimensional multivariate posterior consistency under global–local shrinkage priors," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 157-170.
    9. Daniel Felix Ahelegbey & Monica Billio & Roberto Casarin, 2016. "Sparse Graphical Vector Autoregression: A Bayesian Approach," Annals of Economics and Statistics, GENES, issue 123-124, pages 333-361.
    10. Ram C. Kafle & Netra Khanal & Chris P. Tsokos, 2014. "Bayesian age-stratified joinpoint regression model: an application to lung and brain cancer mortality," Journal of Applied Statistics, Taylor & Francis Journals, vol. 41(12), pages 2727-2742, December.
    11. Ouysse, Rachida & Kohn, Robert, 2010. "Bayesian variable selection and model averaging in the arbitrage pricing theory model," Computational Statistics & Data Analysis, Elsevier, vol. 54(12), pages 3249-3268, December.
    12. Federico Castelletti, 2020. "Bayesian Model Selection of Gaussian Directed Acyclic Graph Structures," International Statistical Review, International Statistical Institute, vol. 88(3), pages 752-775, December.
    13. Steven N. Durlauf & Andros Kourtellos & Chih Ming Tan, 2005. "How Robust Are the Linkages Between Religiosity and Economic Growth," Discussion Papers Series, Department of Economics, Tufts University 0510, Department of Economics, Tufts University.
    14. Beatrice Franzolini & Alexandros Beskos & Maria De Iorio & Warrick Poklewski Koziell & Karolina Grzeszkiewicz, 2022. "Change point detection in dynamic Gaussian graphical models: the impact of COVID-19 pandemic on the US stock market," Papers 2208.00952, arXiv.org, revised May 2023.
    15. Gruber, Lutz F. & West, Mike, 2017. "Bayesian online variable selection and scalable multivariate volatility forecasting in simultaneous graphical dynamic linear models," Econometrics and Statistics, Elsevier, vol. 3(C), pages 3-22.
    16. Riccardo (Jack) Lucchetti & Luca Pedini, 2020. "ParMA: Parallelised Bayesian Model Averaging for Generalised Linear Models," Working Papers 2020:28, Department of Economics, University of Venice "Ca' Foscari".
    17. Anindya Bhadra & Arvind Rao & Veerabhadran Baladandayuthapani, 2018. "Inferring network structure in non†normal and mixed discrete†continuous genomic data," Biometrics, The International Biometric Society, vol. 74(1), pages 185-195, March.
    18. Leonardo Bottolo & Marc Chadeau-Hyam & David I Hastie & Tanja Zeller & Benoit Liquet & Paul Newcombe & Loic Yengo & Philipp S Wild & Arne Schillert & Andreas Ziegler & Sune F Nielsen & Adam S Butterwo, 2013. "GUESS-ing Polygenic Associations with Multiple Phenotypes Using a GPU-Based Evolutionary Stochastic Search Algorithm," PLOS Genetics, Public Library of Science, vol. 9(8), pages 1-17, August.
    19. Kan Shao & Mitchell J. Small, 2011. "Potential Uncertainty Reduction in Model‐Averaged Benchmark Dose Estimates Informed by an Additional Dose Study," Risk Analysis, John Wiley & Sons, vol. 31(10), pages 1561-1575, October.
    20. Stephan Wachtel & Thomas Otter, 2013. "Successive Sample Selection and Its Relevance for Management Decisions," Marketing Science, INFORMS, vol. 32(1), pages 170-185, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssc:v:70:y:2021:i:4:p:886-908. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.