IDEAS home Printed from https://ideas.repec.org/a/plo/pgen00/1011372.html
   My bibliography  Save this article

Finemap-MiXeR: A variational Bayesian approach for genetic finemapping

Author

Listed:
  • Bayram Cevdet Akdeniz
  • Oleksandr Frei
  • Alexey Shadrin
  • Dmitry Vetrov
  • Dmitry Kropotov
  • Eivind Hovig
  • Ole A Andreassen
  • Anders M Dale

Abstract

Genome-wide association studies (GWAS) implicate broad genomic loci containing clusters of highly correlated genetic variants. Finemapping techniques can select and prioritize variants within each GWAS locus which are more likely to have a functional influence on the trait. Here, we present a novel method, Finemap-MiXeR, for finemapping causal variants from GWAS summary statistics, controlling for correlation among variants due to linkage disequilibrium. Our method is based on a variational Bayesian approach and direct optimization of the Evidence Lower Bound (ELBO) of the likelihood function derived from the MiXeR model. After obtaining the analytical expression for ELBO’s gradient, we apply Adaptive Moment Estimation (ADAM) algorithm for optimization, allowing us to obtain the posterior causal probability of each variant. Using these posterior causal probabilities, we validated Finemap-MiXeR across a wide range of scenarios using both synthetic data, and real data on height from the UK Biobank. Comparison of Finemap-MiXeR with two existing methods, FINEMAP and SuSiE RSS, demonstrated similar or improved accuracy. Furthermore, our method is computationally efficient in several aspects. For example, unlike many other methods in the literature, its computational complexity does not increase with the number of true causal variants in a locus and it does not require any matrix inversion operation. The mathematical framework of Finemap-MiXeR is flexible and may also be applied to other problems including cross-trait and cross-ancestry finemapping.Author summary: Genome-Wide Association Studies report the effect size of each genomic variant as summary statistics. Due to the correlated structure of the genomic variants, it may not be straightforward to determine the actual causal genomic variants from these summary statistics. Finemapping studies aim to identify these causal SNPs using different approaches. Here, we presented a novel finemapping method, called Finemap-MiXeR, to determine the actual causal variants using summary statistics data and weighted linkage disequilibrium matrix as input. Our method is based on Variational Bayesian inference on MiXeR model and Evidence Lower Bound of the model is determined to obtain a tractable optimization function. Afterwards, we determined the first derivatives of this Evidence Lower Bound, and finally, Adaptive Moment Estimation is applied to perform optimization. Our method has been validated on synthetic and real data, and similar or better performance than the existing finemapping tools has been observed.

Suggested Citation

  • Bayram Cevdet Akdeniz & Oleksandr Frei & Alexey Shadrin & Dmitry Vetrov & Dmitry Kropotov & Eivind Hovig & Ole A Andreassen & Anders M Dale, 2024. "Finemap-MiXeR: A variational Bayesian approach for genetic finemapping," PLOS Genetics, Public Library of Science, vol. 20(8), pages 1-21, August.
  • Handle: RePEc:plo:pgen00:1011372
    DOI: 10.1371/journal.pgen.1011372
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1011372
    Download Restriction: no

    File URL: https://journals.plos.org/plosgenetics/article/file?id=10.1371/journal.pgen.1011372&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pgen.1011372?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Dominic Holland & Oleksandr Frei & Rahul Desikan & Chun-Chieh Fan & Alexey A Shadrin & Olav B Smeland & V S Sundar & Paul Thompson & Ole A Andreassen & Anders M Dale, 2020. "Beyond SNP heritability: Polygenicity and discoverability of phenotypes estimated with a univariate Gaussian mixture model," PLOS Genetics, Public Library of Science, vol. 16(5), pages 1-30, May.
    2. Yuxin Zou & Peter Carbonetto & Gao Wang & Matthew Stephens, 2022. "Fine-mapping from summary data with the “Sum of Single Effects” model," PLOS Genetics, Public Library of Science, vol. 18(7), pages 1-24, July.
    3. Gao Wang & Abhishek Sarkar & Peter Carbonetto & Matthew Stephens, 2020. "A simple new approach to variable selection in regression, with application to genetic fine mapping," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 82(5), pages 1273-1300, December.
    4. Oleksandr Frei & Dominic Holland & Olav B. Smeland & Alexey A. Shadrin & Chun Chieh Fan & Steffen Maeland & Kevin S. O’Connell & Yunpeng Wang & Srdjan Djurovic & Wesley K. Thompson & Ole A. Andreassen, 2019. "Bivariate causal mixture model quantifies polygenic overlap between complex traits beyond genetic correlation," Nature Communications, Nature, vol. 10(1), pages 1-11, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mengge Liu & Lu Wang & Yujie Zhang & Haoyang Dong & Caihong Wang & Yayuan Chen & Qian Qian & Nannan Zhang & Shaoying Wang & Guoshu Zhao & Zhihui Zhang & Minghuan Lei & Sijia Wang & Qiyu Zhao & Feng Li, 2024. "Investigating the shared genetic architecture between depression and subcortical volumes," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    2. Ying Chen & Panpan Liu & Aniko Sabo & Dongyin Guan, 2025. "Human genetic variation determines 24-hour rhythmic gene expression and disease risk," Nature Communications, Nature, vol. 16(1), pages 1-14, December.
    3. Royce E. Clifford & Adam X. Maihofer & Chris Chatzinakos & Jonathan R. I. Coleman & Nikolaos P. Daskalakis & Marianna Gasperi & Kelleigh Hogan & Elizabeth A. Mikita & Murray B. Stein & Catherine Tchea, 2024. "Genetic architecture distinguishes tinnitus from hearing loss," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    4. Natalie DeForest & Yuqi Wang & Zhiyi Zhu & Jacqueline S. Dron & Ryan Koesterer & Pradeep Natarajan & Jason Flannick & Tiffany Amariuta & Gina M. Peloso & Amit R. Majithia, 2024. "Genome-wide discovery and integrative genomic characterization of insulin resistance loci using serum triglycerides to HDL-cholesterol ratio as a proxy," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    5. Sylvia Hartmann & Summaira Yasmeen & Benjamin M. Jacobs & Spiros Denaxas & Munir Pirmohamed & Eric R. Gamazon & Mark J. Caulfield & Harry Hemingway & Maik Pietzner & Claudia Langenberg, 2023. "ADRA2A and IRX1 are putative risk genes for Raynaud’s phenomenon," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    6. Emma Hazelwood & Daffodil M. Canson & Benedita Deslandes & Xuemin Wang & Pik Fang Kho & Danny Legge & Andrei-Emil Constantinescu & Matthew A. Lee & D. Timothy Bishop & Andrew T. Chan & Stephen B. Grub, 2025. "Multi-tissue expression and splicing data prioritise anatomical subsite- and sex-specific colorectal cancer susceptibility genes," Nature Communications, Nature, vol. 16(1), pages 1-13, December.
    7. Eugene Lin & Yu-Ting Yan & Mu-Hong Chen & Albert C. Yang & Po-Hsiu Kuo & Shih-Jen Tsai, 2025. "Gene clusters linked to insulin resistance identified in a genome-wide study of the Taiwan Biobank population," Nature Communications, Nature, vol. 16(1), pages 1-14, December.
    8. Isabelle Austin-Zimmerman & Daniel F. Levey & Olga Giannakopoulou & Joseph D. Deak & Marco Galimberti & Keyrun Adhikari & Hang Zhou & Spiros Denaxas & Haritz Irizar & Karoline Kuchenbaecker & Andrew M, 2023. "Genome-wide association studies and cross-population meta-analyses investigating short and long sleep duration," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    9. Nathan LaPierre & Kodi Taraszka & Helen Huang & Rosemary He & Farhad Hormozdiari & Eleazar Eskin, 2021. "Identifying causal variants by fine mapping across multiple studies," PLOS Genetics, Public Library of Science, vol. 17(9), pages 1-19, September.
    10. Cheng, Yuanyuan, 2023. "A method of 3R to evaluate the correlation and predictive value of variables," OSF Preprints c79tu, Center for Open Science.
    11. Morten Dybdahl Krebs & Gonçalo Espregueira Themudo & Michael Eriksen Benros & Ole Mors & Anders D. Børglum & David Hougaard & Preben Bo Mortensen & Merete Nordentoft & Michael J. Gandal & Chun Chieh F, 2021. "Associations between patterns in comorbid diagnostic trajectories of individuals with schizophrenia and etiological factors," Nature Communications, Nature, vol. 12(1), pages 1-12, December.
    12. Yunfeng Huang & Dora Bodnar & Chia-Yen Chen & Gabriela Sanchez-Andrade & Mark Sanderson & Jun Shi & Katherine G. Meilleur & Matthew E. Hurles & Sebastian S. Gerety & Ellen A. Tsai & Heiko Runz, 2023. "Rare genetic variants impact muscle strength," Nature Communications, Nature, vol. 14(1), pages 1-8, December.
    13. Huiying He & Yue Leng & Xinglan Cao & Yiwang Zhu & Xiaoxia Li & Qiaoling Yuan & Bin Zhang & Wenchuang He & Hua Wei & Xiangpei Liu & Qiang Xu & Mingliang Guo & Hong Zhang & Longbo Yang & Yang Lv & Xian, 2024. "The pan-tandem repeat map highlights multiallelic variants underlying gene expression and agronomic traits in rice," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    14. Mingxuan Cai & Zhiwei Wang & Jiashun Xiao & Xianghong Hu & Gang Chen & Can Yang, 2023. "XMAP: Cross-population fine-mapping by leveraging genetic diversity and accounting for confounding bias," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    15. Linda Ottensmann & Rubina Tabassum & Sanni E. Ruotsalainen & Mathias J. Gerl & Christian Klose & Elisabeth Widén & Kai Simons & Samuli Ripatti & Matti Pirinen, 2023. "Genome-wide association analysis of plasma lipidome identifies 495 genetic associations," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    16. N. Hernández & J. Soenksen & P. Newcombe & M. Sandhu & I. Barroso & C. Wallace & J. L. Asimit, 2021. "The flashfm approach for fine-mapping multiple quantitative traits," Nature Communications, Nature, vol. 12(1), pages 1-14, December.
    17. Priya Gupta & Marco Galimberti & Yue Liu & Sarah Beck & Aliza Wingo & Thomas Wingo & Keyrun Adhikari & Henry R. Kranzler & Murray B. Stein & Joel Gelernter & Daniel F. Levey, 2024. "A genome-wide investigation into the underlying genetic architecture of personality traits and overlap with psychopathology," Nature Human Behaviour, Nature, vol. 8(11), pages 2235-2249, November.
    18. Ian B. Stanaway & Pradeep Suri & Niloofar Afari & Daniel Dochtermann & Armand Gerstenberger & Saiju Pyarajan & Eric J. Roseen & Marianna Gasperi, 2025. "Multi-ancestry meta-analysis of genome-wide association studies discovers 67 new loci associated with chronic back pain," Nature Communications, Nature, vol. 16(1), pages 1-12, December.
    19. Tzu-Ting Chen & Jaeyoung Kim & Max Lam & Yi-Fang Chuang & Yen-Ling Chiu & Shu-Chin Lin & Sang-Hyuk Jung & Beomsu Kim & Soyeon Kim & Chamlee Cho & Injeong Shim & Sanghyeon Park & Yeeun Ahn & Aysu Okbay, 2024. "Shared genetic architectures of educational attainment in East Asian and European populations," Nature Human Behaviour, Nature, vol. 8(3), pages 562-575, March.
    20. Hongru Li & Jingyi Zhao & Jinglan Dai & Dongfang You & Yang Zhao & David C. Christiani & Feng Chen & Sipeng Shen, 2025. "Multi-ancestry sequencing-based genome-wide association study of C-reactive protein in 513,273 genomes," Nature Communications, Nature, vol. 16(1), pages 1-11, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pgen00:1011372. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosgenetics (email available below). General contact details of provider: https://journals.plos.org/plosgenetics/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.