IDEAS home Printed from https://ideas.repec.org/a/plo/pgen00/1001276.html
   My bibliography  Save this article

Joint Genetic Analysis of Gene Expression Data with Inferred Cellular Phenotypes

Author

Listed:
  • Leopold Parts
  • Oliver Stegle
  • John Winn
  • Richard Durbin

Abstract

Even within a defined cell type, the expression level of a gene differs in individual samples. The effects of genotype, measured factors such as environmental conditions, and their interactions have been explored in recent studies. Methods have also been developed to identify unmeasured intermediate factors that coherently influence transcript levels of multiple genes. Here, we show how to bring these two approaches together and analyse genetic effects in the context of inferred determinants of gene expression. We use a sparse factor analysis model to infer hidden factors, which we treat as intermediate cellular phenotypes that in turn affect gene expression in a yeast dataset. We find that the inferred phenotypes are associated with locus genotypes and environmental conditions and can explain genetic associations to genes in trans. For the first time, we consider and find interactions between genotype and intermediate phenotypes inferred from gene expression levels, complementing and extending established results.Author Summary: The first step in transmitting heritable information, expressing RNA molecules, is highly regulated and depends on activations of specific pathways and regulatory factors. The state of the cell is hard to measure, making it difficult to understand what drives the changes in the gene expression. To close this gap, we apply a statistical model to infer the state of the cell, such as activations of transcription factors and molecular pathways, from gene expression data. We demonstrate how the inferred state helps to explain the effects of variation in the DNA and environment on the expression trait via both direct regulatory effects and interactions with the genetic state. Such analysis, exploiting inferred intermediate phenotypes, will aid understanding effects of genetic variability on global traits and will help to interpret the data from existing and forthcoming large scale studies.

Suggested Citation

  • Leopold Parts & Oliver Stegle & John Winn & Richard Durbin, 2011. "Joint Genetic Analysis of Gene Expression Data with Inferred Cellular Phenotypes," PLOS Genetics, Public Library of Science, vol. 7(1), pages 1-10, January.
  • Handle: RePEc:plo:pgen00:1001276
    DOI: 10.1371/journal.pgen.1001276
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1001276
    Download Restriction: no

    File URL: https://journals.plos.org/plosgenetics/article/file?id=10.1371/journal.pgen.1001276&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pgen.1001276?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Michael Morley & Cliona M. Molony & Teresa M. Weber & James L. Devlin & Kathryn G. Ewens & Richard S. Spielman & Vivian G. Cheung, 2004. "Genetic analysis of genome-wide variation in human gene expression," Nature, Nature, vol. 430(7001), pages 743-747, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Seong Kyu Han & Michelle T. McNulty & Christopher J. Benway & Pei Wen & Anya Greenberg & Ana C. Onuchic-Whitford & Dongkeun Jang & Jason Flannick & Noël P. Burtt & Parker C. Wilson & Benjamin D. Humph, 2023. "Mapping genomic regulation of kidney disease and traits through high-resolution and interpretable eQTLs," Nature Communications, Nature, vol. 14(1), pages 1-16, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Julia Schröder & Vitalia Schüller & Andrea May & Christian Gerges & Mario Anders & Jessica Becker & Timo Hess & Nicole Kreuser & René Thieme & Kerstin U Ludwig & Tania Noder & Marino Venerito & Lothar, 2019. "Identification of loci of functional relevance to Barrett’s esophagus and esophageal adenocarcinoma: Cross-referencing of expression quantitative trait loci data from disease-relevant tissues with gen," PLOS ONE, Public Library of Science, vol. 14(12), pages 1-12, December.
    2. Bo Jiang & Jun S. Liu, 2015. "Bayesian Partition Models for Identifying Expression Quantitative Trait Loci," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(512), pages 1350-1361, December.
    3. Yixin Fang & Yang Feng & Ming Yuan, 2014. "Regularized principal components of heritability," Computational Statistics, Springer, vol. 29(3), pages 455-465, June.
    4. Witten Daniela M & Tibshirani Robert J., 2009. "Extensions of Sparse Canonical Correlation Analysis with Applications to Genomic Data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-27, June.
    5. Lingxue Zhang & Seyoung Kim, 2014. "Learning Gene Networks under SNP Perturbations Using eQTL Datasets," PLOS Computational Biology, Public Library of Science, vol. 10(2), pages 1-20, February.
    6. Cipolli III, William & Hanson, Timothy & McLain, Alexander C., 2016. "Bayesian nonparametric multiple testing," Computational Statistics & Data Analysis, Elsevier, vol. 101(C), pages 64-79.
    7. Barbara E Stranger & Stephen B Montgomery & Antigone S Dimas & Leopold Parts & Oliver Stegle & Catherine E Ingle & Magda Sekowska & George Davey Smith & David Evans & Maria Gutierrez-Arcelus & Alkes P, 2012. "Patterns of Cis Regulatory Variation in Diverse Human Populations," PLOS Genetics, Public Library of Science, vol. 8(4), pages 1-13, April.
    8. Eric R Gamazon & Hae-Kyung Im & Shiwei Duan & Yves A Lussier & Nancy J Cox & M Eileen Dolan & Wei Zhang, 2010. "ExprTarget: An Integrative Approach to Predicting Human MicroRNA Targets," PLOS ONE, Public Library of Science, vol. 5(10), pages 1-8, October.
    9. Ryan Abo & Gregory D Jenkins & Liewei Wang & Brooke L Fridley, 2012. "Identifying the Genetic Variation of Gene Expression Using Gene Sets: Application of Novel Gene Set eQTL Approach to PharmGKB and KEGG," PLOS ONE, Public Library of Science, vol. 7(8), pages 1-11, August.
    10. Mitsutaka Kadota & Howard H Yang & Nan Hu & Chaoyu Wang & Ying Hu & Philip R Taylor & Kenneth H Buetow & Maxwell P Lee, 2007. "Allele-Specific Chromatin Immunoprecipitation Studies Show Genetic Influence on Chromatin State in Human Genome," PLOS Genetics, Public Library of Science, vol. 3(5), pages 1-11, May.
    11. Oualkacha Karim & Labbe Aurelie & Ciampi Antonio & Roy Marc-Andre & Maziade Michel, 2012. "Principal Components of Heritability for High Dimension Quantitative Traits and General Pedigrees," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(2), pages 1-27, January.
    12. Enrico Petretto & Leonardo Bottolo & Sarah R Langley & Matthias Heinig & Chris McDermott-Roe & Rizwan Sarwar & Michal Pravenec & Norbert Hübner & Timothy J Aitman & Stuart A Cook & Sylvia Richardson, 2010. "New Insights into the Genetic Control of Gene Expression using a Bayesian Multi-tissue Approach," PLOS Computational Biology, Public Library of Science, vol. 6(4), pages 1-13, April.
    13. Bergersen Linn Cecilie & Glad Ingrid K. & Lyng Heidi, 2011. "Weighted Lasso with Data Integration," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 10(1), pages 1-29, August.
    14. Jin Hyun Ju & Sushila A Shenoy & Ronald G Crystal & Jason G Mezey, 2017. "An independent component analysis confounding factor correction framework for identifying broad impact expression quantitative trait loci," PLOS Computational Biology, Public Library of Science, vol. 13(5), pages 1-26, May.
    15. Parkhomenko Elena & Tritchler David & Beyene Joseph, 2009. "Sparse Canonical Correlation Analysis with Application to Genomic Data Integration," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-34, January.
    16. Ning Jiang & Minghui Wang & Tianye Jia & Lin Wang & Lindsey Leach & Christine Hackett & David Marshall & Zewei Luo, 2011. "A Robust Statistical Method for Association-Based eQTL Analysis," PLOS ONE, Public Library of Science, vol. 6(8), pages 1-11, August.
    17. Paul C Boutros & Ivy D Moffat & Allan B Okey & Raimo Pohjanvirta, 2011. "mRNA Levels in Control Rat Liver Display Strain-Specific, Hereditary, and AHR-Dependent Components," PLOS ONE, Public Library of Science, vol. 6(7), pages 1-15, July.
    18. Hui-Min Wang & Ching-Lin Hsiao & Ai-Ru Hsieh & Ying-Chao Lin & Cathy S J Fann, 2012. "Constructing Endophenotypes of Complex Diseases Using Non-Negative Matrix Factorization and Adjusted Rand Index," PLOS ONE, Public Library of Science, vol. 7(7), pages 1-12, July.
    19. Xiaohong Li & Steven G Self & Patricia C Galipeau & Thomas G Paulson & Brian J Reid, 2007. "Direct Inference of SNP Heterozygosity Rates and Resolution of LOH Detection," PLOS Computational Biology, Public Library of Science, vol. 3(11), pages 1-10, November.
    20. Urmo Võsa & Tõnu Esko & Silva Kasela & Tarmo Annilo, 2015. "Altered Gene Expression Associated with microRNA Binding Site Polymorphisms," PLOS ONE, Public Library of Science, vol. 10(10), pages 1-24, October.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pgen00:1001276. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosgenetics (email available below). General contact details of provider: https://journals.plos.org/plosgenetics/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.