IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0038667.html
   My bibliography  Save this article

Genotype-Based Test in Mapping Cis-Regulatory Variants from Allele-Specific Expression Data

Author

Listed:
  • Jean Francois Lefebvre
  • Emilio Vello
  • Bing Ge
  • Stephen B Montgomery
  • Emmanouil T Dermitzakis
  • Tomi Pastinen
  • Damian Labuda

Abstract

Identifying and understanding the impact of gene regulatory variation is of considerable importance in evolutionary and medical genetics; such variants are thought to be responsible for human-specific adaptation [1] and to have an important role in genetic disease. Regulatory variation in cis is readily detected in individuals showing uneven expression of a transcript from its two allelic copies, an observation referred to as allelic imbalance (AI). Identifying individuals exhibiting AI allows mapping of regulatory DNA regions and the potential to identify the underlying causal genetic variant(s). However, existing mapping methods require knowledge of the haplotypes, which make them sensitive to phasing errors. In this study, we introduce a genotype-based mapping test that does not require haplotype-phase inference to locate regulatory regions. The test relies on partitioning genotypes of individuals exhibiting AI and those not expressing AI in a 2×3 contingency table. The performance of this test to detect linkage disequilibrium (LD) between a potential regulatory site and a SNP located in this region was examined by analyzing the simulated and the empirical AI datasets. In simulation experiments, the genotype-based test outperforms the haplotype-based tests with the increasing distance separating the regulatory region from its regulated transcript. The genotype-based test performed equally well with the experimental AI datasets, either from genome–wide cDNA hybridization arrays or from RNA sequencing. By avoiding the need of haplotype inference, the genotype-based test will suit AI analyses in population samples of unknown haplotype structure and will additionally facilitate the identification of cis-regulatory variants that are located far away from the regulated transcript.

Suggested Citation

  • Jean Francois Lefebvre & Emilio Vello & Bing Ge & Stephen B Montgomery & Emmanouil T Dermitzakis & Tomi Pastinen & Damian Labuda, 2012. "Genotype-Based Test in Mapping Cis-Regulatory Variants from Allele-Specific Expression Data," PLOS ONE, Public Library of Science, vol. 7(6), pages 1-15, June.
  • Handle: RePEc:plo:pone00:0038667
    DOI: 10.1371/journal.pone.0038667
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0038667
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0038667&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0038667?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Joseph K. Pickrell & John C. Marioni & Athma A. Pai & Jacob F. Degner & Barbara E. Engelhardt & Everlyne Nkadori & Jean-Baptiste Veyrieras & Matthew Stephens & Yoav Gilad & Jonathan K. Pritchard, 2010. "Understanding mechanisms underlying human gene expression variation with RNA sequencing," Nature, Nature, vol. 464(7289), pages 768-772, April.
    2. Stephen B. Montgomery & Micha Sammeth & Maria Gutierrez-Arcelus & Radoslaw P. Lach & Catherine Ingle & James Nisbett & Roderic Guigo & Emmanouil T. Dermitzakis, 2010. "Transcriptome genetics using second generation sequencing in a Caucasian population," Nature, Nature, vol. 464(7289), pages 773-777, April.
    3. Noam Kaplan & Irene K. Moore & Yvonne Fondufe-Mittendorf & Andrea J. Gossett & Desiree Tillo & Yair Field & Emily M. LeProust & Timothy R. Hughes & Jason D. Lieb & Jonathan Widom & Eran Segal, 2009. "The DNA-encoded nucleosome organization of a eukaryotic genome," Nature, Nature, vol. 458(7236), pages 362-366, March.
    4. James R Wagner & Bing Ge & Dmitry Pokholok & Kevin L Gunderson & Tomi Pastinen & Mathieu Blanchette, 2010. "Computational Analysis of Whole-Genome Differential Allelic Expression Data in Human," PLOS Computational Biology, Public Library of Science, vol. 6(7), pages 1-12, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Faisal Shahla & Tutz Gerhard, 2017. "Missing value imputation for gene expression data by tailored nearest neighbors," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 16(2), pages 95-106, April.
    2. Thanh Nguyen & Asim Bhatti & Samuel Yang & Saeid Nahavandi, 2016. "RNA-Seq Count Data Modelling by Grey Relational Analysis and Nonparametric Gaussian Process," PLOS ONE, Public Library of Science, vol. 11(10), pages 1-18, October.
    3. Kensuke Yamaguchi & Kazuyoshi Ishigaki & Akari Suzuki & Yumi Tsuchida & Haruka Tsuchiya & Shuji Sumitomo & Yasuo Nagafuchi & Fuyuki Miya & Tatsuhiko Tsunoda & Hirofumi Shoda & Keishi Fujio & Kazuhiko , 2022. "Splicing QTL analysis focusing on coding sequences reveals mechanisms for disease susceptibility loci," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    4. Alexandra C Nica & Leopold Parts & Daniel Glass & James Nisbet & Amy Barrett & Magdalena Sekowska & Mary Travers & Simon Potter & Elin Grundberg & Kerrin Small & Åsa K Hedman & Veronique Bataille & Jo, 2011. "The Architecture of Gene Regulatory Variation across Multiple Human Tissues: The MuTHER Study," PLOS Genetics, Public Library of Science, vol. 7(2), pages 1-9, February.
    5. Daria V Zhernakova & Eleonora de Klerk & Harm-Jan Westra & Anastasios Mastrokolias & Shoaib Amini & Yavuz Ariyurek & Rick Jansen & Brenda W Penninx & Jouke J Hottenga & Gonneke Willemsen & Eco J de Ge, 2013. "DeepSAGE Reveals Genetic Variants Associated with Alternative Polyadenylation and Expression of Coding and Non-coding Transcripts," PLOS Genetics, Public Library of Science, vol. 9(6), pages 1-15, June.
    6. Sora Yoon & Seon-Young Kim & Dougu Nam, 2016. "Improving Gene-Set Enrichment Analysis of RNA-Seq Data with Small Replicates," PLOS ONE, Public Library of Science, vol. 11(11), pages 1-16, November.
    7. Kate E. Stanley & Tatjana Jatsenko & Stefania Tuveri & Dhanya Sudhakaran & Lore Lannoo & Kristel Calsteren & Marie Borre & Ilse Parijs & Leen Coillie & Kris Bogaert & Rodrigo Almeida Toledo & Liesbeth, 2024. "Cell type signatures in cell-free DNA fragmentation profiles reveal disease biology," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    8. Wolfram Möbius & Ulrich Gerland, 2010. "Quantitative Test of the Barrier Nucleosome Model for Statistical Positioning of Nucleosomes Up- and Downstream of Transcription Start Sites," PLOS Computational Biology, Public Library of Science, vol. 6(8), pages 1-11, August.
    9. Pingting Ying & Can Chen & Zequn Lu & Shuoni Chen & Ming Zhang & Yimin Cai & Fuwei Zhang & Jinyu Huang & Linyun Fan & Caibo Ning & Yanmin Li & Wenzhuo Wang & Hui Geng & Yizhuo Liu & Wen Tian & Zhiyong, 2023. "Genome-wide enhancer-gene regulatory maps link causal variants to target genes underlying human cancer risk," Nature Communications, Nature, vol. 14(1), pages 1-20, December.
    10. Kyung-Won Hong & Seok Won Jeong & Myungguen Chung & Seong Beom Cho, 2014. "Association between Expression Quantitative Trait Loci and Metabolic Traits in Two Korean Populations," PLOS ONE, Public Library of Science, vol. 9(12), pages 1-13, December.
    11. Barbara E Stranger & Stephen B Montgomery & Antigone S Dimas & Leopold Parts & Oliver Stegle & Catherine E Ingle & Magda Sekowska & George Davey Smith & David Evans & Maria Gutierrez-Arcelus & Alkes P, 2012. "Patterns of Cis Regulatory Variation in Diverse Human Populations," PLOS Genetics, Public Library of Science, vol. 8(4), pages 1-13, April.
    12. Xiaodong Cai & Juan Andrés Bazerque & Georgios B Giannakis, 2013. "Inference of Gene Regulatory Networks with Sparse Structural Equation Models Exploiting Genetic Perturbations," PLOS Computational Biology, Public Library of Science, vol. 9(5), pages 1-13, May.
    13. Meijiang Gao & Marina Veil & Marcus Rosenblatt & Aileen Julia Riesle & Anna Gebhard & Helge Hass & Lenka Buryanova & Lev Y. Yampolsky & Björn Grüning & Sergey V. Ulianov & Jens Timmer & Daria Onichtch, 2022. "Pluripotency factors determine gene expression repertoire at zygotic genome activation," Nature Communications, Nature, vol. 13(1), pages 1-19, December.
    14. Nicoló Fusi & Oliver Stegle & Neil D Lawrence, 2012. "Joint Modelling of Confounding Factors and Prominent Genetic Regulators Provides Increased Accuracy in Genetical Genomics Studies," PLOS Computational Biology, Public Library of Science, vol. 8(1), pages 1-9, January.
    15. Bin Wang, 2020. "A Zipf-plot based normalization method for high-throughput RNA-seq data," PLOS ONE, Public Library of Science, vol. 15(4), pages 1-15, April.
    16. Jin Hyun Ju & Sushila A Shenoy & Ronald G Crystal & Jason G Mezey, 2017. "An independent component analysis confounding factor correction framework for identifying broad impact expression quantitative trait loci," PLOS Computational Biology, Public Library of Science, vol. 13(5), pages 1-26, May.
    17. Jungsoo Gim & Sungho Won & Taesung Park, 2016. "LPEseq: Local-Pooled-Error Test for RNA Sequencing Experiments with a Small Number of Replicates," PLOS ONE, Public Library of Science, vol. 11(8), pages 1-15, August.
    18. Aileen Julia Riesle & Meijiang Gao & Marcus Rosenblatt & Jacques Hermes & Helge Hass & Anna Gebhard & Marina Veil & Björn Grüning & Jens Timmer & Daria Onichtchouk, 2023. "Activator-blocker model of transcriptional regulation by pioneer-like factors," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    19. Eric O Johnson & Dana B Hancock & Nathan C Gaddis & Joshua L Levy & Grier Page & Scott P Novak & Cristie Glasheen & Nancy L Saccone & John P Rice & Michael P Moreau & Kimberly F Doheny & Jane M Romm &, 2015. "Novel Genetic Locus Implicated for HIV-1 Acquisition with Putative Regulatory Links to HIV Replication and Infectivity: A Genome-Wide Association Study," PLOS ONE, Public Library of Science, vol. 10(3), pages 1-15, March.
    20. Tang Clara S. & Ferreira Manuel A. R., 2012. "GENOVA: Gene Overlap Analysis of GWAS Results," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(3), pages 1-15, February.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0038667. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.