IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1010172.html
   My bibliography  Save this article

sumSTAAR: A flexible framework for gene-based association studies using GWAS summary statistics

Author

Listed:
  • Nadezhda M Belonogova
  • Gulnara R Svishcheva
  • Anatoly V Kirichenko
  • Irina V Zorkoltseva
  • Yakov A Tsepilov
  • Tatiana I Axenovich

Abstract

Gene-based association analysis is an effective gene-mapping tool. Many gene-based methods have been proposed recently. However, their power depends on the underlying genetic architecture, which is rarely known in complex traits, and so it is likely that a combination of such methods could serve as a universal approach. Several frameworks combining different gene-based methods have been developed. However, they all imply a fixed set of methods, weights and functional annotations. Moreover, most of them use individual phenotypes and genotypes as input data. Here, we introduce sumSTAAR, a framework for gene-based association analysis using summary statistics obtained from genome-wide association studies (GWAS). It is an extended and modified version of STAAR framework proposed by Li and colleagues in 2020. The sumSTAAR framework offers a wider range of gene-based methods to combine. It allows the user to arbitrarily define a set of these methods, weighting functions and probabilities of genetic variants being causal. The methods used in the framework were adapted to analyse genes with large number of SNPs to decrease the running time. The framework includes the polygene pruning procedure to guard against the influence of the strong GWAS signals outside the gene. We also present new improved matrices of correlations between the genotypes of variants within genes. These matrices estimated on a sample of 265,000 individuals are a state-of-the-art replacement of widely used matrices based on the 1000 Genomes Project data.Author summary: Gene-based association analysis is an effective gene mapping tool. Quite a few frameworks have been proposed recently for gene-based association analysis using a combination of different methods. However, all of these frameworks have at least one of the disadvantages: they use a fixed set of methods, they cannot use functional annotations, or they use individual phenotypes and genotypes as input data. To overcome these limitations, we propose sumSTAAR, a framework for gene-based association analysis using GWAS summary statistics. Our framework allows the user to arbitrarily define a set of the methods and functional annotations. Moreover, we adopted the methods for the analysis of genes with a large number of SNPs to decrease the running time. The framework includes the polygene pruning procedure to guard against the influence of the strong GWAS signals outside the gene. We also present new improved matrices of correlations between the genotypes of variants within genes, which now allows to include ultra-rare variants (MAF

Suggested Citation

  • Nadezhda M Belonogova & Gulnara R Svishcheva & Anatoly V Kirichenko & Irina V Zorkoltseva & Yakov A Tsepilov & Tatiana I Axenovich, 2022. "sumSTAAR: A flexible framework for gene-based association studies using GWAS summary statistics," PLOS Computational Biology, Public Library of Science, vol. 18(6), pages 1-12, June.
  • Handle: RePEc:plo:pcbi00:1010172
    DOI: 10.1371/journal.pcbi.1010172
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1010172
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1010172&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1010172?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Corbin Quick & Xiaoquan Wen & Gonçalo Abecasis & Michael Boehnke & Hyun Min Kang, 2020. "Integrating comprehensive functional annotations to boost power and accuracy in gene-based association analysis," PLOS Genetics, Public Library of Science, vol. 16(12), pages 1-23, December.
    2. repec:plo:pmed00:1001779 is not listed on IDEAS
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tiffany Eulalio & Min Woo Sun & Olivier Gevaert & Michael D. Greicius & Thomas J. Montine & Daniel Nachun & Stephen B. Montgomery, 2025. "regionalpcs improve discovery of DNA methylation associations with complex traits," Nature Communications, Nature, vol. 16(1), pages 1-19, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1010172. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.