Author
Listed:
- Aastha Khatiwada
- Ayse Selen Yilmaz
- Bethany J Wolf
- Maciej Pietrzak
- Dongjun Chung
Abstract
Genome-wide association studies (GWAS) have successfully identified over two hundred thousand genotype-trait associations. Yet some challenges remain. First, complex traits are often associated with many single nucleotide polymorphisms (SNPs), most with small or moderate effect sizes, making them difficult to detect. Second, many complex traits share a common genetic basis due to ‘pleiotropy’ and and though few methods consider it, leveraging pleiotropy can improve statistical power to detect genotype-trait associations with weaker effect sizes. Third, currently available statistical methods are limited in explaining the functional mechanisms through which genetic variants are associated with specific or multiple traits. We propose multi-GPA-Tree to address these challenges. The multi-GPA-Tree approach can identify risk SNPs associated with single as well as multiple traits while also identifying the combinations of functional annotations that can explain the mechanisms through which risk-associated SNPs are linked with the traits. First, we implemented simulation studies to evaluate the proposed multi-GPA-Tree method and compared its performance with existing statistical approaches. The results indicate that multi-GPA-Tree outperforms existing statistical approaches in detecting risk-associated SNPs for multiple traits. Second, we applied multi-GPA-Tree to a systemic lupus erythematosus (SLE) and rheumatoid arthritis (RA), and to a Crohn’s disease (CD) and ulcertive colitis (UC) GWAS, and functional annotation data including GenoSkyline and GenoSkylinePlus. Our results demonstrate that multi-GPA-Tree can be a powerful tool that improves association mapping while facilitating understanding of the underlying genetic architecture of complex traits and potential mechanisms linking risk-associated SNPs with complex traits.Author summary: In spite of continued success in developing statistical methodologies that integrate GWAS summary statistics and functional annotation data, existing methods are unable to pinpoint the interactions between functional annotations that influence one or more traits. Hence, the underlying interactions between biological mechanisms linking risk-associated SNPs to traits remain unknown. We propose multi-GPA-Tree to identify risk-associated SNPs and the combinations of functional annotations related to one or more trait risk-associated SNPs. Notably, multi-GPA-Tree requires only GWAS p-value summary statistics, instead of individual level genotype-phenotype data, making it more viable to implement. Compared to the existing state-of-the-art methods, multi-GPA-Tree showed improved performance in simulation studies and validated results for several auto-immune diseases in real data application. These combined results suggest that multi-GPA-Tree is an effective tool for integrative analysis and can potentially be valuable to clinical genomic researchers for hypothesis generation and validation.
Suggested Citation
Aastha Khatiwada & Ayse Selen Yilmaz & Bethany J Wolf & Maciej Pietrzak & Dongjun Chung, 2023.
"multi-GPA-Tree: Statistical approach for pleiotropy informed and functional annotation tree guided prioritization of GWAS results,"
PLOS Computational Biology, Public Library of Science, vol. 19(12), pages 1-27, December.
Handle:
RePEc:plo:pcbi00:1011686
DOI: 10.1371/journal.pcbi.1011686
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1011686. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.