IDEAS home Printed from https://ideas.repec.org/a/plo/pgen00/1010184.html
   My bibliography  Save this article

A Bayesian model selection approach to mediation analysis

Author

Listed:
  • Wesley L Crouse
  • Gregory R Keele
  • Madeleine S Gastonguay
  • Gary A Churchill
  • William Valdar

Abstract

Genetic studies often seek to establish a causal chain of events originating from genetic variation through to molecular and clinical phenotypes. When multiple phenotypes share a common genetic association, one phenotype may act as an intermediate for the genetic effects on the other. Alternatively, the phenotypes may be causally unrelated but share genetic loci. Mediation analysis represents a class of causal inference approaches used to determine which of these scenarios is most plausible. We have developed a general approach to mediation analysis based on Bayesian model selection and have implemented it in an R package, bmediatR. Bayesian model selection provides a flexible framework that can be tailored to different analyses. Our approach can incorporate prior information about the likelihood of models and the strength of causal effects. It can also accommodate multiple genetic variants or multi-state haplotypes. Our approach reports posterior probabilities that can be useful in interpreting uncertainty among competing models. We compared bmediatR with other popular methods, including the Sobel test, Mendelian randomization, and Bayesian network analysis using simulated data. We found that bmediatR performed as well or better than these alternatives in most scenarios. We applied bmediatR to proteome data from Diversity Outbred (DO) mice, a multi-parent population, and demonstrate the power of mediation with multi-state haplotypes. We also applied bmediatR to data from human cell lines to identify transcripts that are mediated through or are expressed independently from local chromatin accessibility. We demonstrate that Bayesian model selection provides a powerful and versatile approach to identify causal relationships in genetic studies using model organism or human data.Author summary: Mediation analysis is a powerful tool for the discovery of causal relationships. Causal inference is increasingly being applied to genetic and genomic data to understand the chain of intermediate steps that propagate the effects of genetic variation. We describe a Bayesian model selection approach to mediation analysis, implemented in our bmediatR software. Using simulations, we show that bmediatR performs as well or better than established causal inference methods including the Sobel test, Mendelian randomization, and Bayesian network analysis. Bayesian model selection can incorporate prior information, provides greater flexibility in model specification, and can be tailored to address different inference questions. The model selection perspective helps to delineate the assumptions of causal analyses using the Sobel test or Mendelian randomization. We applied bmediatR to data from Diversity Outbred mice and human cell lines to demonstrate its ability to derive biologically meaningful findings in a variety of contexts.

Suggested Citation

  • Wesley L Crouse & Gregory R Keele & Madeleine S Gastonguay & Gary A Churchill & William Valdar, 2022. "A Bayesian model selection approach to mediation analysis," PLOS Genetics, Public Library of Science, vol. 18(5), pages 1-33, May.
  • Handle: RePEc:plo:pgen00:1010184
    DOI: 10.1371/journal.pgen.1010184
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1010184
    Download Restriction: no

    File URL: https://journals.plos.org/plosgenetics/article/file?id=10.1371/journal.pgen.1010184&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pgen.1010184?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Claudia Giambartolomei & Damjan Vukcevic & Eric E Schadt & Lude Franke & Aroon D Hingorani & Chris Wallace & Vincent Plagnol, 2014. "Bayesian Test for Colocalisation between Pairs of Genetic Association Studies Using Summary Statistics," PLOS Genetics, Public Library of Science, vol. 10(5), pages 1-15, May.
    2. Christopher N. Foley & James R. Staley & Philip G. Breen & Benjamin B. Sun & Paul D. W. Kirk & Stephen Burgess & Joanna M. M. Howson, 2021. "A fast and efficient colocalization algorithm for identifying shared genetic risk factors across multiple traits," Nature Communications, Nature, vol. 12(1), pages 1-18, December.
    3. Jeffrey T Leek & John D Storey, 2007. "Capturing Heterogeneity in Gene Expression Studies by Surrogate Variable Analysis," PLOS Genetics, Public Library of Science, vol. 3(9), pages 1-12, September.
    4. Chris Wallace, 2021. "A more accurate method for colocalisation analysis allowing for multiple causal variants," PLOS Genetics, Public Library of Science, vol. 17(9), pages 1-11, September.
    5. Scutari, Marco, 2010. "Learning Bayesian Networks with the bnlearn R Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 35(i03).
    6. Richard Howey & Alexander D Clark & Najib Naamane & Louise N Reynard & Arthur G Pratt & Heather J Cordell, 2021. "A Bayesian network approach incorporating imputation of missing data enables exploratory analysis of complex causal biological relationships," PLOS Genetics, Public Library of Science, vol. 17(9), pages 1-28, September.
    7. Richard Howey & So-Youn Shin & Caroline Relton & George Davey Smith & Heather J Cordell, 2020. "Bayesian network analysis incorporating genetic anchors complements conventional Mendelian randomization approaches for exploratory analysis of causal relationships in complex data," PLOS Genetics, Public Library of Science, vol. 16(3), pages 1-35, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Karl Smith-Byrne & Åsa Hedman & Marios Dimitriou & Trishna Desai & Alexandr V. Sokolov & Helgi B. Schioth & Mine Koprulu & Maik Pietzner & Claudia Langenberg & Joshua Atkins & Ricardo Cortez Penha & J, 2024. "Identifying therapeutic targets for cancer among 2074 circulating proteins and risk of nine cancers," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    2. Ichcha Manipur & Guillermo Reales & Jae Hoon Sul & Myung Kyun Shin & Simonne Longerich & Adrian Cortes & Chris Wallace, 2024. "CoPheScan: phenome-wide association studies accounting for linkage disequilibrium," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    3. Valur Emilsson & Elias F. Gudmundsson & Thorarinn Jonmundsson & Brynjolfur G. Jonsson & Michael Twarog & Valborg Gudmundsdottir & Zhiguang Li & Nancy Finkel & Stephen Poor & Xin Liu & Robert Esterberg, 2022. "A proteogenomic signature of age-related macular degeneration in blood," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    4. V. E. Jackson & Y. Wu & R. Bonelli & J. P. Owen & L. W. Scott & S. Farashi & Y. Kihara & M. L. Gantner & C. Egan & K. M. Williams & B. R. E. Ansell & A. Tufail & A. Y. Lee & M. Bahlo, 2025. "Multi-omic spatial effects on high-resolution AI-derived retinal thickness," Nature Communications, Nature, vol. 16(1), pages 1-19, December.
    5. David Stacey & Lingyan Chen & Paulina J. Stanczyk & Joanna M. M. Howson & Amy M. Mason & Stephen Burgess & Stephen MacDonald & Jonathan Langdown & Harriett McKinney & Kate Downes & Neda Farahi & James, 2022. "Elucidating mechanisms of genetic cross-disease associations at the PROCR vascular disease locus," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    6. Matthew T. Patrick & Qinmengge Li & Rachael Wasikowski & Nehal Mehta & Johann E. Gudjonsson & James T. Elder & Xiang Zhou & Lam C. Tsoi, 2022. "Shared genetic risk factors and causal association between psoriasis and coronary artery disease," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    7. Hanmin Guo & Alexander Eckehart Urban & Wing Hung Wong, 2024. "Prioritizing disease-related rare variants by integrating gene expression data," PLOS Genetics, Public Library of Science, vol. 20(9), pages 1-16, September.
    8. Andrew A. Brown & Juan J. Fernandez-Tajes & Mun-gwan Hong & Caroline A. Brorsson & Robert W. Koivula & David Davtian & Théo Dupuis & Ambra Sartori & Theodora-Dafni Michalettou & Ian M. Forgie & Jonath, 2023. "Genetic analysis of blood molecular phenotypes reveals common properties in the regulatory networks affecting complex traits," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    9. Amil M. Shah & Peder L. Myhre & Victoria Arthur & Pranav Dorbala & Humaira Rasheed & Leo F. Buckley & Brian Claggett & Guning Liu & Jianzhong Ma & Ngoc Quynh Nguyen & Kunihiro Matsushita & Chiadi Ndum, 2024. "Large scale plasma proteomics identifies novel proteins and protein networks associated with heart failure development," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    10. Lucas A. Mavromatis & Daniel B. Rosoff & Andrew S. Bell & Jeesun Jung & Josephin Wagner & Falk W. Lohoff, 2023. "Multi-omic underpinnings of epigenetic aging and human longevity," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    11. Sourya Bhattacharyya & Ferhat Ay, 2024. "Identifying genetic variants associated with chromatin looping and genome function," Nature Communications, Nature, vol. 15(1), pages 1-22, December.
    12. Paula Laccourreye & Concha Bielza & Pedro Larrañaga, 2022. "Explainable Machine Learning for Longitudinal Multi-Omic Microbiome," Mathematics, MDPI, vol. 10(12), pages 1-23, June.
    13. Lingyan Chen & James E. Peters & Bram Prins & Elodie Persyn & Matthew Traylor & Praveen Surendran & Savita Karthikeyan & Ekaterina Yonova-Doing & Emanuele Angelantonio & David J. Roberts & Nicholas A., 2022. "Systematic Mendelian randomization using the human plasma proteome to discover potential therapeutic targets for stroke," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    14. Maik Pietzner & Robert Lorenz Chua & Eleanor Wheeler & Katharina Jechow & Julian D. S. Willett & Helena Radbruch & Saskia Trump & Bettina Heidecker & Hugo Zeberg & Frank L. Heppner & Roland Eils & Mar, 2022. "ELF5 is a potential respiratory epithelial cell-specific risk gene for severe COVID-19," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    15. Bryan R. Gorman & Sun-Gou Ji & Michael Francis & Anoop K. Sendamarai & Yunling Shi & Poornima Devineni & Uma Saxena & Elizabeth Partan & Andrea K. DeVito & Jinyoung Byun & Younghun Han & Xiangjun Xiao, 2024. "Multi-ancestry GWAS meta-analyses of lung cancer reveal susceptibility loci and elucidate smoking-independent genetic risk," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    16. Benjamin B. Sun & Stephanie J. Loomis & Fabrizio Pizzagalli & Natalia Shatokhina & Jodie N. Painter & Christopher N. Foley & Megan E. Jensen & Donald G. McLaren & Sai Spandana Chintapalli & Alyssa H. , 2022. "Genetic map of regional sulcal morphology in the human brain from UK biobank data," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    17. Prabal Das & D. A. Sachindra & Kironmala Chanda, 2022. "Machine Learning-Based Rainfall Forecasting with Multiple Non-Linear Feature Selection Algorithms," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 36(15), pages 6043-6071, December.
    18. Jacob Joseph & Chang Liu & Qin Hui & Krishna Aragam & Zeyuan Wang & Brian Charest & Jennifer E. Huffman & Jacob M. Keaton & Todd L. Edwards & Serkalem Demissie & Luc Djousse & Juan P. Casas & J. Micha, 2022. "Genetic architecture of heart failure with preserved versus reduced ejection fraction," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    19. Vuong, Quan-Hoang & La, Viet-Phuong, 2019. "The bayesvl R package. User guide v0.8.1," OSF Preprints w5dx6, Center for Open Science.
    20. F. Cugnata & G. Perucca & S. Salini, 2017. "Bayesian networks and the assessment of universities' value added," Journal of Applied Statistics, Taylor & Francis Journals, vol. 44(10), pages 1785-1806, July.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pgen00:1010184. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosgenetics (email available below). General contact details of provider: https://journals.plos.org/plosgenetics/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.