IDEAS home Printed from https://ideas.repec.org/a/plo/pgen00/1011372.html
   My bibliography  Save this article

Finemap-MiXeR: A variational Bayesian approach for genetic finemapping

Author

Listed:
  • Bayram Cevdet Akdeniz
  • Oleksandr Frei
  • Alexey Shadrin
  • Dmitry Vetrov
  • Dmitry Kropotov
  • Eivind Hovig
  • Ole A Andreassen
  • Anders M Dale

Abstract

Genome-wide association studies (GWAS) implicate broad genomic loci containing clusters of highly correlated genetic variants. Finemapping techniques can select and prioritize variants within each GWAS locus which are more likely to have a functional influence on the trait. Here, we present a novel method, Finemap-MiXeR, for finemapping causal variants from GWAS summary statistics, controlling for correlation among variants due to linkage disequilibrium. Our method is based on a variational Bayesian approach and direct optimization of the Evidence Lower Bound (ELBO) of the likelihood function derived from the MiXeR model. After obtaining the analytical expression for ELBO’s gradient, we apply Adaptive Moment Estimation (ADAM) algorithm for optimization, allowing us to obtain the posterior causal probability of each variant. Using these posterior causal probabilities, we validated Finemap-MiXeR across a wide range of scenarios using both synthetic data, and real data on height from the UK Biobank. Comparison of Finemap-MiXeR with two existing methods, FINEMAP and SuSiE RSS, demonstrated similar or improved accuracy. Furthermore, our method is computationally efficient in several aspects. For example, unlike many other methods in the literature, its computational complexity does not increase with the number of true causal variants in a locus and it does not require any matrix inversion operation. The mathematical framework of Finemap-MiXeR is flexible and may also be applied to other problems including cross-trait and cross-ancestry finemapping.Author summary: Genome-Wide Association Studies report the effect size of each genomic variant as summary statistics. Due to the correlated structure of the genomic variants, it may not be straightforward to determine the actual causal genomic variants from these summary statistics. Finemapping studies aim to identify these causal SNPs using different approaches. Here, we presented a novel finemapping method, called Finemap-MiXeR, to determine the actual causal variants using summary statistics data and weighted linkage disequilibrium matrix as input. Our method is based on Variational Bayesian inference on MiXeR model and Evidence Lower Bound of the model is determined to obtain a tractable optimization function. Afterwards, we determined the first derivatives of this Evidence Lower Bound, and finally, Adaptive Moment Estimation is applied to perform optimization. Our method has been validated on synthetic and real data, and similar or better performance than the existing finemapping tools has been observed.

Suggested Citation

  • Bayram Cevdet Akdeniz & Oleksandr Frei & Alexey Shadrin & Dmitry Vetrov & Dmitry Kropotov & Eivind Hovig & Ole A Andreassen & Anders M Dale, 2024. "Finemap-MiXeR: A variational Bayesian approach for genetic finemapping," PLOS Genetics, Public Library of Science, vol. 20(8), pages 1-21, August.
  • Handle: RePEc:plo:pgen00:1011372
    DOI: 10.1371/journal.pgen.1011372
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1011372
    Download Restriction: no

    File URL: https://journals.plos.org/plosgenetics/article/file?id=10.1371/journal.pgen.1011372&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pgen.1011372?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Dominic Holland & Oleksandr Frei & Rahul Desikan & Chun-Chieh Fan & Alexey A Shadrin & Olav B Smeland & V S Sundar & Paul Thompson & Ole A Andreassen & Anders M Dale, 2020. "Beyond SNP heritability: Polygenicity and discoverability of phenotypes estimated with a univariate Gaussian mixture model," PLOS Genetics, Public Library of Science, vol. 16(5), pages 1-30, May.
    2. Gao Wang & Abhishek Sarkar & Peter Carbonetto & Matthew Stephens, 2020. "A simple new approach to variable selection in regression, with application to genetic fine mapping," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 82(5), pages 1273-1300, December.
    3. Oleksandr Frei & Dominic Holland & Olav B. Smeland & Alexey A. Shadrin & Chun Chieh Fan & Steffen Maeland & Kevin S. O’Connell & Yunpeng Wang & Srdjan Djurovic & Wesley K. Thompson & Ole A. Andreassen, 2019. "Bivariate causal mixture model quantifies polygenic overlap between complex traits beyond genetic correlation," Nature Communications, Nature, vol. 10(1), pages 1-11, December.
    4. Yuxin Zou & Peter Carbonetto & Gao Wang & Matthew Stephens, 2022. "Fine-mapping from summary data with the “Sum of Single Effects” model," PLOS Genetics, Public Library of Science, vol. 18(7), pages 1-24, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mengge Liu & Lu Wang & Yujie Zhang & Haoyang Dong & Caihong Wang & Yayuan Chen & Qian Qian & Nannan Zhang & Shaoying Wang & Guoshu Zhao & Zhihui Zhang & Minghuan Lei & Sijia Wang & Qiyu Zhao & Feng Li, 2024. "Investigating the shared genetic architecture between depression and subcortical volumes," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    2. Royce E. Clifford & Adam X. Maihofer & Chris Chatzinakos & Jonathan R. I. Coleman & Nikolaos P. Daskalakis & Marianna Gasperi & Kelleigh Hogan & Elizabeth A. Mikita & Murray B. Stein & Catherine Tchea, 2024. "Genetic architecture distinguishes tinnitus from hearing loss," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    3. Natalie DeForest & Yuqi Wang & Zhiyi Zhu & Jacqueline S. Dron & Ryan Koesterer & Pradeep Natarajan & Jason Flannick & Tiffany Amariuta & Gina M. Peloso & Amit R. Majithia, 2024. "Genome-wide discovery and integrative genomic characterization of insulin resistance loci using serum triglycerides to HDL-cholesterol ratio as a proxy," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    4. Sylvia Hartmann & Summaira Yasmeen & Benjamin M. Jacobs & Spiros Denaxas & Munir Pirmohamed & Eric R. Gamazon & Mark J. Caulfield & Harry Hemingway & Maik Pietzner & Claudia Langenberg, 2023. "ADRA2A and IRX1 are putative risk genes for Raynaud’s phenomenon," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    5. Ichcha Manipur & Guillermo Reales & Jae Hoon Sul & Myung Kyun Shin & Simonne Longerich & Adrian Cortes & Chris Wallace, 2024. "CoPheScan: phenome-wide association studies accounting for linkage disequilibrium," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    6. Eugene Lin & Yu-Ting Yan & Mu-Hong Chen & Albert C. Yang & Po-Hsiu Kuo & Shih-Jen Tsai, 2025. "Gene clusters linked to insulin resistance identified in a genome-wide study of the Taiwan Biobank population," Nature Communications, Nature, vol. 16(1), pages 1-14, December.
    7. Isabelle Austin-Zimmerman & Daniel F. Levey & Olga Giannakopoulou & Joseph D. Deak & Marco Galimberti & Keyrun Adhikari & Hang Zhou & Spiros Denaxas & Haritz Irizar & Karoline Kuchenbaecker & Andrew M, 2023. "Genome-wide association studies and cross-population meta-analyses investigating short and long sleep duration," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    8. Wenhan Chen & Yang Wu & Zhili Zheng & Ting Qi & Peter M. Visscher & Zhihong Zhu & Jian Yang, 2021. "Improved analyses of GWAS summary statistics by reducing data heterogeneity and errors," Nature Communications, Nature, vol. 12(1), pages 1-10, December.
    9. Nathan LaPierre & Kodi Taraszka & Helen Huang & Rosemary He & Farhad Hormozdiari & Eleazar Eskin, 2021. "Identifying causal variants by fine mapping across multiple studies," PLOS Genetics, Public Library of Science, vol. 17(9), pages 1-19, September.
    10. Cheng, Yuanyuan, 2023. "A method of 3R to evaluate the correlation and predictive value of variables," OSF Preprints c79tu, Center for Open Science.
    11. Morten Dybdahl Krebs & Gonçalo Espregueira Themudo & Michael Eriksen Benros & Ole Mors & Anders D. Børglum & David Hougaard & Preben Bo Mortensen & Merete Nordentoft & Michael J. Gandal & Chun Chieh F, 2021. "Associations between patterns in comorbid diagnostic trajectories of individuals with schizophrenia and etiological factors," Nature Communications, Nature, vol. 12(1), pages 1-12, December.
    12. Mary P. LaPierre & Katherine Lawler & Svenja Godbersen & I. Sadaf Farooqi & Markus Stoffel, 2022. "MicroRNA-7 regulates melanocortin circuits involved in mammalian energy homeostasis," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    13. Yunfeng Huang & Dora Bodnar & Chia-Yen Chen & Gabriela Sanchez-Andrade & Mark Sanderson & Jun Shi & Katherine G. Meilleur & Matthew E. Hurles & Sebastian S. Gerety & Ellen A. Tsai & Heiko Runz, 2023. "Rare genetic variants impact muscle strength," Nature Communications, Nature, vol. 14(1), pages 1-8, December.
    14. Huiying He & Yue Leng & Xinglan Cao & Yiwang Zhu & Xiaoxia Li & Qiaoling Yuan & Bin Zhang & Wenchuang He & Hua Wei & Xiangpei Liu & Qiang Xu & Mingliang Guo & Hong Zhang & Longbo Yang & Yang Lv & Xian, 2024. "The pan-tandem repeat map highlights multiallelic variants underlying gene expression and agronomic traits in rice," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    15. Joel T. Rämö & Tuomo Kiiskinen & Richard Seist & Kristi Krebs & Masahiro Kanai & Juha Karjalainen & Mitja Kurki & Eija Hämäläinen & Paavo Häppölä & Aki S. Havulinna & Heidi Hautakangas & Reedik Mägi &, 2023. "Genome-wide screen of otosclerosis in population biobanks: 27 loci and shared associations with skeletal structure," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    16. Elmo C. Saarentaus & Juha Karjalainen & Joel T. Rämö & Tuomo Kiiskinen & Aki S. Havulinna & Juha Mehtonen & Heidi Hautakangas & Sanni Ruotsalainen & Max Tamlander & Nina Mars & Sanna Toppila-Salmi & M, 2023. "Inflammatory and infectious upper respiratory diseases associate with 41 genomic loci and type 2 inflammation," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    17. Mingxuan Cai & Zhiwei Wang & Jiashun Xiao & Xianghong Hu & Gang Chen & Can Yang, 2023. "XMAP: Cross-population fine-mapping by leveraging genetic diversity and accounting for confounding bias," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    18. Linda Ottensmann & Rubina Tabassum & Sanni E. Ruotsalainen & Mathias J. Gerl & Christian Klose & Elisabeth Widén & Kai Simons & Samuli Ripatti & Matti Pirinen, 2023. "Genome-wide association analysis of plasma lipidome identifies 495 genetic associations," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    19. N. Hernández & J. Soenksen & P. Newcombe & M. Sandhu & I. Barroso & C. Wallace & J. L. Asimit, 2021. "The flashfm approach for fine-mapping multiple quantitative traits," Nature Communications, Nature, vol. 12(1), pages 1-14, December.
    20. Priya Gupta & Marco Galimberti & Yue Liu & Sarah Beck & Aliza Wingo & Thomas Wingo & Keyrun Adhikari & Henry R. Kranzler & Murray B. Stein & Joel Gelernter & Daniel F. Levey, 2024. "A genome-wide investigation into the underlying genetic architecture of personality traits and overlap with psychopathology," Nature Human Behaviour, Nature, vol. 8(11), pages 2235-2249, November.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pgen00:1011372. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosgenetics (email available below). General contact details of provider: https://journals.plos.org/plosgenetics/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.