IDEAS home Printed from https://ideas.repec.org/a/plo/pgen00/1010299.html
   My bibliography  Save this article

Fine-mapping from summary data with the “Sum of Single Effects” model

Author

Listed:
  • Yuxin Zou
  • Peter Carbonetto
  • Gao Wang
  • Matthew Stephens

Abstract

In recent work, Wang et al introduced the “Sum of Single Effects” (SuSiE) model, and showed that it provides a simple and efficient approach to fine-mapping genetic variants from individual-level data. Here we present new methods for fitting the SuSiE model to summary data, for example to single-SNP z-scores from an association study and linkage disequilibrium (LD) values estimated from a suitable reference panel. To develop these new methods, we first describe a simple, generic strategy for extending any individual-level data method to deal with summary data. The key idea is to replace the usual regression likelihood with an analogous likelihood based on summary data. We show that existing fine-mapping methods such as FINEMAP and CAVIAR also (implicitly) use this strategy, but in different ways, and so this provides a common framework for understanding different methods for fine-mapping. We investigate other common practical issues in fine-mapping with summary data, including problems caused by inconsistencies between the z-scores and LD estimates, and we develop diagnostics to identify these inconsistencies. We also present a new refinement procedure that improves model fits in some data sets, and hence improves overall reliability of the SuSiE fine-mapping results. Detailed evaluations of fine-mapping methods in a range of simulated data sets show that SuSiE applied to summary data is competitive, in both speed and accuracy, with the best available fine-mapping methods for summary data.Author summary: The goal of fine-mapping is to identify the genetic variants that causally affect some trait of interest. Fine-mapping is challenging because the genetic variants can be highly correlated due to a phenomenon called linkage disequilibrium (LD). The most successful current approaches to fine-mapping frame the problem as a variable selection problem, and here we focus on one such approach based on the “Sum of Single Effects” (SuSiE) model. The main contribution of this paper is to extend SuSiE to work with summary data, which is often accessible when the full genotype and phenotype data are not. In the process of extending SuSiE, we developed a new mathematical framework that helps to explain existing fine-mapping methods for summary data, why they work well (or not), and under what circumstances. In simulations, we show that SuSiE applied to summary data is competitive with the best available fine-mapping methods for summary data. We also show how different factors such as accuracy of the LD estimates can affect the quality of the fine-mapping.

Suggested Citation

  • Yuxin Zou & Peter Carbonetto & Gao Wang & Matthew Stephens, 2022. "Fine-mapping from summary data with the “Sum of Single Effects” model," PLOS Genetics, Public Library of Science, vol. 18(7), pages 1-24, July.
  • Handle: RePEc:plo:pgen00:1010299
    DOI: 10.1371/journal.pgen.1010299
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1010299
    Download Restriction: no

    File URL: https://journals.plos.org/plosgenetics/article/file?id=10.1371/journal.pgen.1010299&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pgen.1010299?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Claudia Giambartolomei & Damjan Vukcevic & Eric E Schadt & Lude Franke & Aroon D Hingorani & Chris Wallace & Vincent Plagnol, 2014. "Bayesian Test for Colocalisation between Pairs of Genetic Association Studies Using Summary Statistics," PLOS Genetics, Public Library of Science, vol. 10(5), pages 1-15, May.
    2. Buhm Han & Hyun Min Kang & Eleazar Eskin, 2009. "Rapid and Accurate Multiple Testing Correction and Power Estimation for Millions of Correlated Markers," PLOS Genetics, Public Library of Science, vol. 5(4), pages 1-13, April.
    3. Gao Wang & Abhishek Sarkar & Peter Carbonetto & Matthew Stephens, 2020. "A simple new approach to variable selection in regression, with application to genetic fine mapping," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 82(5), pages 1273-1300, December.
    4. repec:plo:pgen00:1005272 is not listed on IDEAS
    5. repec:plo:pgen00:1004722 is not listed on IDEAS
    6. Wenhan Chen & Yang Wu & Zhili Zheng & Ting Qi & Peter M. Visscher & Zhihong Zhu & Jian Yang, 2021. "Improved analyses of GWAS summary statistics by reducing data heterogeneity and errors," Nature Communications, Nature, vol. 12(1), pages 1-10, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Bayram Cevdet Akdeniz & Oleksandr Frei & Alexey Shadrin & Dmitry Vetrov & Dmitry Kropotov & Eivind Hovig & Ole A Andreassen & Anders M Dale, 2024. "Finemap-MiXeR: A variational Bayesian approach for genetic finemapping," PLOS Genetics, Public Library of Science, vol. 20(8), pages 1-21, August.
    2. Ying Chen & Panpan Liu & Aniko Sabo & Dongyin Guan, 2025. "Human genetic variation determines 24-hour rhythmic gene expression and disease risk," Nature Communications, Nature, vol. 16(1), pages 1-14, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Natalie DeForest & Yuqi Wang & Zhiyi Zhu & Jacqueline S. Dron & Ryan Koesterer & Pradeep Natarajan & Jason Flannick & Tiffany Amariuta & Gina M. Peloso & Amit R. Majithia, 2024. "Genome-wide discovery and integrative genomic characterization of insulin resistance loci using serum triglycerides to HDL-cholesterol ratio as a proxy," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    2. Sylvia Hartmann & Summaira Yasmeen & Benjamin M. Jacobs & Spiros Denaxas & Munir Pirmohamed & Eric R. Gamazon & Mark J. Caulfield & Harry Hemingway & Maik Pietzner & Claudia Langenberg, 2023. "ADRA2A and IRX1 are putative risk genes for Raynaud’s phenomenon," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    3. Emma Hazelwood & Daffodil M. Canson & Benedita Deslandes & Xuemin Wang & Pik Fang Kho & Danny Legge & Andrei-Emil Constantinescu & Matthew A. Lee & D. Timothy Bishop & Andrew T. Chan & Stephen B. Grub, 2025. "Multi-tissue expression and splicing data prioritise anatomical subsite- and sex-specific colorectal cancer susceptibility genes," Nature Communications, Nature, vol. 16(1), pages 1-13, December.
    4. Mingxuan Cai & Zhiwei Wang & Jiashun Xiao & Xianghong Hu & Gang Chen & Can Yang, 2023. "XMAP: Cross-population fine-mapping by leveraging genetic diversity and accounting for confounding bias," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    5. Priya Gupta & Marco Galimberti & Yue Liu & Sarah Beck & Aliza Wingo & Thomas Wingo & Keyrun Adhikari & Henry R. Kranzler & Murray B. Stein & Joel Gelernter & Daniel F. Levey, 2024. "A genome-wide investigation into the underlying genetic architecture of personality traits and overlap with psychopathology," Nature Human Behaviour, Nature, vol. 8(11), pages 2235-2249, November.
    6. Hongru Li & Jingyi Zhao & Jinglan Dai & Dongfang You & Yang Zhao & David C. Christiani & Feng Chen & Sipeng Shen, 2025. "Multi-ancestry sequencing-based genome-wide association study of C-reactive protein in 513,273 genomes," Nature Communications, Nature, vol. 16(1), pages 1-11, December.
    7. Jiapei Yuan & Yang Tong & Le Wang & Xiaoxiao Yang & Xiaochuan Liu & Meng Shu & Zekun Li & Wen Jin & Chenchen Guan & Yuting Wang & Qiang Zhang & Yang Yang, 2024. "A compendium of genetic variations associated with promoter usage across 49 human tissues," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    8. Javier Botey-Bataller & Hedwig D. Vrijmoeth & Jeanine Ursinus & Bart-Jan Kullberg & Cees C. Wijngaard & Hadewych Hofstede & Ahmed Alaswad & Manoj K. Gupta & Lennart M. Roesner & Jochen Huehn & Thomas , 2024. "A comprehensive genetic map of cytokine responses in Lyme borreliosis," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    9. Ying Chen & Panpan Liu & Aniko Sabo & Dongyin Guan, 2025. "Human genetic variation determines 24-hour rhythmic gene expression and disease risk," Nature Communications, Nature, vol. 16(1), pages 1-14, December.
    10. Katelyn E. Connelly & Katherine Hullin & Ehssan Abdolalizadeh & Jun Zhong & Daina Eiser & Aidan O’Brien & Irene Collins & Sudipto Das & Gerard Duncan & Stephen J. Chanock & Rachael Z. Stolzenberg-Solo, 2025. "Allelic effects on KLHL17 expression underlie a pancreatic cancer genome-wide association signal at chr1p36.33," Nature Communications, Nature, vol. 16(1), pages 1-18, December.
    11. Bryan R. Gorman & Sun-Gou Ji & Michael Francis & Anoop K. Sendamarai & Yunling Shi & Poornima Devineni & Uma Saxena & Elizabeth Partan & Andrea K. DeVito & Jinyoung Byun & Younghun Han & Xiangjun Xiao, 2024. "Multi-ancestry GWAS meta-analyses of lung cancer reveal susceptibility loci and elucidate smoking-independent genetic risk," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    12. Qingbo S. Wang & Ryuya Edahiro & Ho Namkoong & Takanori Hasegawa & Yuya Shirai & Kyuto Sonehara & Hiromu Tanaka & Ho Lee & Ryunosuke Saiki & Takayoshi Hyugaji & Eigo Shimizu & Kotoe Katayama & Masahir, 2022. "The whole blood transcriptional regulation landscape in 465 COVID-19 infected samples from Japan COVID-19 Task Force," Nature Communications, Nature, vol. 13(1), pages 1-19, December.
    13. Ya Cui & Frederick J. Arnold & Fanglue Peng & Dan Wang & Jason Sheng Li & Sebastian Michels & Eric J. Wagner & Albert R. Spada & Wei Li, 2023. "Alternative polyadenylation transcriptome-wide association study identifies APA-linked susceptibility genes in brain disorders," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    14. Eeva Sliz & Jaakko S. Tyrmi & Nilufer Rahmioglu & Krina T. Zondervan & Christian M. Becker & Outi Uimari & Johannes Kettunen, 2023. "Evidence of a causal effect of genetic tendency to gain muscle mass on uterine leiomyomata," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    15. Ichcha Manipur & Guillermo Reales & Jae Hoon Sul & Myung Kyun Shin & Simonne Longerich & Adrian Cortes & Chris Wallace, 2024. "CoPheScan: phenome-wide association studies accounting for linkage disequilibrium," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    16. Wenhan Chen & Yang Wu & Zhili Zheng & Ting Qi & Peter M. Visscher & Zhihong Zhu & Jian Yang, 2021. "Improved analyses of GWAS summary statistics by reducing data heterogeneity and errors," Nature Communications, Nature, vol. 12(1), pages 1-10, December.
    17. Mary P. LaPierre & Katherine Lawler & Svenja Godbersen & I. Sadaf Farooqi & Markus Stoffel, 2022. "MicroRNA-7 regulates melanocortin circuits involved in mammalian energy homeostasis," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    18. Sourya Bhattacharyya & Ferhat Ay, 2024. "Identifying genetic variants associated with chromatin looping and genome function," Nature Communications, Nature, vol. 15(1), pages 1-22, December.
    19. Alan Selewa & Kaixuan Luo & Michael Wasney & Linsin Smith & Xiaotong Sun & Chenwei Tang & Heather Eckart & Ivan P. Moskowitz & Anindita Basu & Xin He & Sebastian Pott, 2023. "Single-cell genomics improves the discovery of risk variants and genes of atrial fibrillation," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    20. Yanyu Xiao & Jingjing Wang & Jiaqi Li & Peijing Zhang & Jingyu Li & Yincong Zhou & Qing Zhou & Ming Chen & Xin Sheng & Zhihong Liu & Xiaoping Han & Guoji Guo, 2023. "An analytical framework for decoding cell type-specific genetic variation of gene regulation," Nature Communications, Nature, vol. 14(1), pages 1-12, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pgen00:1010299. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosgenetics (email available below). General contact details of provider: https://journals.plos.org/plosgenetics/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.