IDEAS home Printed from https://ideas.repec.org/a/plo/pgen00/1006646.html
   My bibliography  Save this article

Integrating molecular QTL data into genome-wide genetic association analysis: Probabilistic assessment of enrichment and colocalization

Author

Listed:
  • Xiaoquan Wen
  • Roger Pique-Regi
  • Francesca Luca

Abstract

We propose a novel statistical framework for integrating the result from molecular quantitative trait loci (QTL) mapping into genome-wide genetic association analysis of complex traits, with the primary objectives of quantitatively assessing the enrichment of the molecular QTLs in complex trait-associated genetic variants and the colocalizations of the two types of association signals. We introduce a natural Bayesian hierarchical model that treats the latent association status of molecular QTLs as SNP-level annotations for candidate SNPs of complex traits. We detail a computational procedure to seamlessly perform enrichment, fine-mapping and colocalization analyses, which is a distinct feature compared to the existing colocalization analysis procedures in the literature. The proposed approach is computationally efficient and requires only summary-level statistics. We evaluate and demonstrate the proposed computational approach through extensive simulation studies and analyses of blood lipid data and the whole blood eQTL data from the GTEx project. In addition, a useful utility from our proposed method enables the computation of expected colocalization signals using simple characteristics of the association data. Using this utility, we further illustrate the importance of enrichment analysis on the ability to discover colocalized signals and the potential limitations of currently available molecular QTL data. The software pipeline that implements the proposed computation procedures, enloc, is freely available at https://github.com/xqwen/integrative.Author summary: Genome-wide association studies (GWAS) have been tremendously successful in identifying genetic variants that impact complex diseases. However, the roles of such studies in disease etiology remain poorly understood, primarily because a large proportion of the GWAS findings are located in the non-coding region of the genome. Recent advancements in high-throughput sequencing technology enable the systematic investigation of molecular quantitative trait loci (QTLs), which are genetic variants that directly affect molecular phenotypes (e.g., gene expression, transcription factor binding and DNA methylation). Linking molecular QTLs to GWAS findings intuitively represents an important step for interpreting the biological and clinical relevance of the GWAS results. In this paper, we describe a rigorous and efficient computational approach that assesses the enrichment and overlap between the GWAS findings and molecular QTLs. Importantly, we illustrate that the accurate quantification of overlapping between molecular QTL and GWAS signals requires reliable enrichment estimation. Our proposed approach fully accounts for the intrinsic uncertainty embedded in the association analyses of GWAS and molecular QTL mapping, and it outperforms the existing state-of-the-art approaches. Applying the proposed approach to the GWAS data of blood lipid traits and the whole blood expression QTLs (eQTLs) yields some novel biological insights and also illustrates the potential limitations of the currently available molecular QTL data.

Suggested Citation

  • Xiaoquan Wen & Roger Pique-Regi & Francesca Luca, 2017. "Integrating molecular QTL data into genome-wide genetic association analysis: Probabilistic assessment of enrichment and colocalization," PLOS Genetics, Public Library of Science, vol. 13(3), pages 1-25, March.
  • Handle: RePEc:plo:pgen00:1006646
    DOI: 10.1371/journal.pgen.1006646
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1006646
    Download Restriction: no

    File URL: https://journals.plos.org/plosgenetics/article/file?id=10.1371/journal.pgen.1006646&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pgen.1006646?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Jacob F. Degner & Athma A. Pai & Roger Pique-Regi & Jean-Baptiste Veyrieras & Daniel J. Gaffney & Joseph K. Pickrell & Sherryl De Leon & Katelyn Michelini & Noah Lewellen & Gregory E. Crawford & Matth, 2012. "DNase I sensitivity QTLs are a major determinant of human expression variation," Nature, Nature, vol. 482(7385), pages 390-394, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Alexandra Barry & Michelle T. McNulty & Xiaoyuan Jia & Yask Gupta & Hanna Debiec & Yang Luo & China Nagano & Tomoko Horinouchi & Seulgi Jung & Manuela Colucci & Dina F. Ahram & Adele Mitrotti & Aditi , 2023. "Multi-population genome-wide association study implicates immune and non-immune factors in pediatric steroid-sensitive nephrotic syndrome," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    2. Gianluca Ursini & Pasquale Di Carlo & Sreya Mukherjee & Qiang Chen & Shizhong Han & Jiyoung Kim & Maya Deyssenroth & Carmen J. Marsit & Jia Chen & Ke Hao & Giovanna Punzi & Daniel R. Weinberger, 2023. "Prioritization of potential causative genes for schizophrenia in placenta," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    3. Seong Kyu Han & Michelle T. McNulty & Christopher J. Benway & Pei Wen & Anya Greenberg & Ana C. Onuchic-Whitford & Dongkeun Jang & Jason Flannick & Noël P. Burtt & Parker C. Wilson & Benjamin D. Humph, 2023. "Mapping genomic regulation of kidney disease and traits through high-resolution and interpretable eQTLs," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    4. Yangqing Deng & Wei Pan, 2020. "A powerful and versatile colocalization test," PLOS Computational Biology, Public Library of Science, vol. 16(4), pages 1-18, April.
    5. Anqi Zhu & Nana Matoba & Emma P Wilson & Amanda L Tapia & Yun Li & Joseph G Ibrahim & Jason L Stein & Michael I Love, 2021. "MRLocus: Identifying causal genes mediating a trait through Bayesian estimation of allelic heterogeneity," PLOS Genetics, Public Library of Science, vol. 17(4), pages 1-24, April.
    6. Xianyong Yin & Lap Sum Chan & Debraj Bose & Anne U. Jackson & Peter VandeHaar & Adam E. Locke & Christian Fuchsberger & Heather M. Stringham & Ryan Welch & Ketian Yu & Lilian Fernandes Silva & Susan K, 2022. "Genome-wide association studies of metabolites in Finnish men identify disease-relevant loci," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    7. Xinyuan Dong & Yu-Ru Su & Richard Barfield & Stephanie A Bien & Qianchuan He & Tabitha A Harrison & Jeroen R Huyghe & Temitope O Keku & Noralane M Lindor & Clemens Schafmayer & Andrew T Chan & Stephen, 2020. "A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study," PLOS Genetics, Public Library of Science, vol. 16(8), pages 1-21, August.
    8. Hyun-Jung Kim & Paul Cheng & Stanislao Travisano & Chad Weldy & João P. Monteiro & Ramendra Kundu & Trieu Nguyen & Disha Sharma & Huitong Shi & Yi Lin & Boxiang Liu & Saptarsi Haldar & Simon Jackson &, 2023. "Molecular mechanisms of coronary artery disease risk at the PDGFD locus," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    9. Andrew R. Hamel & Wenjun Yan & John M. Rouhana & Aboozar Monovarfeshani & Xinyi Jiang & Puja A. Mehta & Jayshree Advani & Yuyang Luo & Qingnan Liang & Skanda Rajasundaram & Arushi Shrivastava & Kather, 2024. "Integrating genetic regulation and single-cell expression with GWAS prioritizes causal genes and cell types for glaucoma," Nature Communications, Nature, vol. 15(1), pages 1-25, December.
    10. Naim Panjwani & Fan Wang & Scott Mastromatteo & Allen Bao & Cheng Wang & Gengming He & Jiafen Gong & Johanna M Rommens & Lei Sun & Lisa J Strug, 2020. "LocusFocus: Web-based colocalization for the annotation and functional follow-up of GWAS," PLOS Computational Biology, Public Library of Science, vol. 16(10), pages 1-8, October.
    11. Jayshree Advani & Puja A. Mehta & Andrew R. Hamel & Sudeep Mehrotra & Christina Kiel & Tobias Strunz & Ximena Corso-Díaz & Madeline Kwicklis & Freekje Asten & Rinki Ratnapriya & Emily Y. Chew & Dena G, 2024. "QTL mapping of human retina DNA methylation identifies 87 gene-epigenome interactions in age-related macular degeneration," Nature Communications, Nature, vol. 15(1), pages 1-20, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wendiao Zhang & Ming Zhang & Zhenhong Xu & Hongye Yan & Huimin Wang & Jiamei Jiang & Juan Wan & Beisha Tang & Chunyu Liu & Chao Chen & Qingtuan Meng, 2023. "Human forebrain organoid-based multi-omics analyses of PCCB as a schizophrenia associated gene linked to GABAergic pathways," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    2. Iksoo Huh & Isabel Mendizabal & Taesung Park & Soojin V Yi, 2018. "Functional conservation of sequence determinants at rapidly evolving regulatory regions across mammals," PLOS Computational Biology, Public Library of Science, vol. 14(10), pages 1-21, October.
    3. Mary F Feitosa & Aldi T Kraja & Daniel I Chasman & Yun J Sung & Thomas W Winkler & Ioanna Ntalla & Xiuqing Guo & Nora Franceschini & Ching-Yu Cheng & Xueling Sim & Dina Vojinovic & Jonathan Marten & S, 2018. "Novel genetic associations for blood pressure identified via gene-alcohol interaction in up to 570K individuals across multiple ancestries," PLOS ONE, Public Library of Science, vol. 13(6), pages 1-36, June.
    4. Schraiber, Joshua G. & Landis, Michael J., 2015. "Sensitivity of quantitative traits to mutational effects and number of loci," Theoretical Population Biology, Elsevier, vol. 102(C), pages 85-93.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pgen00:1006646. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosgenetics (email available below). General contact details of provider: https://journals.plos.org/plosgenetics/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.