IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/0040013.html

Genomic Sequence Is Highly Predictive of Local Nucleosome Depletion

Author

Listed:
  • Guo-Cheng Yuan
  • Jun S Liu

Abstract

The regulation of DNA accessibility through nucleosome positioning is important for transcription control. Computational models have been developed to predict genome-wide nucleosome positions from DNA sequences, but these models consider only nucleosome sequences, which may have limited their power. We developed a statistical multi-resolution approach to identify a sequence signature, called the N-score, that distinguishes nucleosome binding DNA from non-nucleosome DNA. This new approach has significantly improved the prediction accuracy. The sequence information is highly predictive for local nucleosome enrichment or depletion, whereas predictions of the exact positions are only modestly more accurate than a null model, suggesting the importance of other regulatory factors in fine-tuning the nucleosome positions. The N-score in promoter regions is negatively correlated with gene expression levels. Regulatory elements are enriched in low N-score regions. While our model is derived from yeast data, the N-score pattern computed from this model agrees well with recent high-resolution protein-binding data in human.Author Summary: A eukaryotic genome is packaged into chromatin. The chromatin not only makes it possible to fit the relatively long genome into a tiny nucleus, but also plays an important regulatory role. The nucleosome is the fundamental repeating unit of chromatin. High-resolution tiling array experiments have shown that many nucleosomes are well-positioned in vivo, consistent with an important regulatory role. However, the mechanisms that determine nucleosome positioning are still poorly understood. We have developed a novel computational method for predicting nucleosome positions using only the genomic sequence information. The method detects periodic sequence signatures that discriminate nucleosome sequences from linker sequences. We show that this approach has significantly improved predictive power compared to previous studies. Interestingly, the most predictable regions tend to be located where stringent regulations are needed, i.e., the neighborhood of a transcription start site. This model predicts that nucleosome occupancy is not strongly controlled by short DNA sequence motifs but rather progressively controlled by regular organization of short elements into periodic patterns. We also provide evidence that sequence specificity for nucleosome binding is conserved from yeast to human.

Suggested Citation

  • Guo-Cheng Yuan & Jun S Liu, 2008. "Genomic Sequence Is Highly Predictive of Local Nucleosome Depletion," PLOS Computational Biology, Public Library of Science, vol. 4(1), pages 1-11, January.
  • Handle: RePEc:plo:pcbi00:0040013
    DOI: 10.1371/journal.pcbi.0040013
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.0040013
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.0040013&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.0040013?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Istvan Albert & Travis N. Mavrich & Lynn P. Tomsho & Ji Qi & Sara J. Zanton & Stephan C. Schuster & B. Franklin Pugh, 2007. "Translational and rotational settings of H2A.Z nucleosomes across the Saccharomyces cerevisiae genome," Nature, Nature, vol. 446(7135), pages 572-576, March.
    2. Christopher T. Harbison & D. Benjamin Gordon & Tong Ihn Lee & Nicola J. Rinaldi & Kenzie D. Macisaac & Timothy W. Danford & Nancy M. Hannett & Jean-Bosco Tagne & David B. Reynolds & Jane Yoo & Ezra G., 2004. "Transcriptional regulatory code of a eukaryotic genome," Nature, Nature, vol. 431(7004), pages 99-104, September.
    3. Hongkai Ji & Wing Hung Wong, 2006. "Computational Biology: Toward Deciphering Gene Regulatory Information in Mammalian Genomes," Biometrics, The International Biometric Society, vol. 62(3), pages 645-663, September.
    4. Eran Segal & Yvonne Fondufe-Mittendorf & Lingyi Chen & AnnChristine Thåström & Yair Field & Irene K. Moore & Ji-Ping Z. Wang & Jonathan Widom, 2006. "A genomic code for nucleosome positioning," Nature, Nature, vol. 442(7104), pages 772-778, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Wei Chen & Hao Lin & Peng-Mian Feng & Chen Ding & Yong-Chun Zuo & Kuo-Chen Chou, 2012. "iNuc-PhysChem: A Sequence-Based Predictor for Identifying Nucleosomes via Physicochemical Properties," PLOS ONE, Public Library of Science, vol. 7(10), pages 1-9, October.
    2. Iksoo Huh & Isabel Mendizabal & Taesung Park & Soojin V Yi, 2018. "Functional conservation of sequence determinants at rapidly evolving regulatory regions across mammals," PLOS Computational Biology, Public Library of Science, vol. 14(10), pages 1-21, October.
    3. Wolfram Möbius & Ulrich Gerland, 2010. "Quantitative Test of the Barrier Nucleosome Model for Statistical Positioning of Nucleosomes Up- and Downstream of Transcription Start Sites," PLOS Computational Biology, Public Library of Science, vol. 6(8), pages 1-11, August.
    4. Alexander W. Blocker & Edoardo M. Airoldi, 2016. "Template-Based Models for Genome-Wide Analysis of Next-Generation Sequencing Data at Base-Pair Resolution," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(515), pages 967-987, July.
    5. Moser Carlee & Gupta Mayetri, 2012. "A Generalized Hidden Markov Model for Determining Sequence-based Predictors of Nucleosome Positioning," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(2), pages 1-23, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zing Tsung-Yeh Tsai & Shin-Han Shiu & Huai-Kuang Tsai, 2015. "Contribution of Sequence Motif, Chromatin State, and DNA Structure Features to Predictive Models of Transcription Factor Binding in Yeast," PLOS Computational Biology, Public Library of Science, vol. 11(8), pages 1-22, August.
    2. Ji-Ping Wang & Yvonne Fondufe-Mittendorf & Liqun Xi & Guei-Feng Tsai & Eran Segal & Jonathan Widom, 2008. "Preferentially Quantized Linker DNA Lengths in Saccharomyces cerevisiae," PLOS Computational Biology, Public Library of Science, vol. 4(9), pages 1-10, September.
    3. Wolfram Möbius & Ulrich Gerland, 2010. "Quantitative Test of the Barrier Nucleosome Model for Statistical Positioning of Nucleosomes Up- and Downstream of Transcription Start Sites," PLOS Computational Biology, Public Library of Science, vol. 6(8), pages 1-11, August.
    4. Leelavati Narlikar & Raluca Gordân & Alexander J Hartemink, 2007. "A Nucleosome-Guided Map of Transcription Factor Binding Sites in Yeast," PLOS Computational Biology, Public Library of Science, vol. 3(11), pages 1-10, November.
    5. Alexander W. Blocker & Edoardo M. Airoldi, 2016. "Template-Based Models for Genome-Wide Analysis of Next-Generation Sequencing Data at Base-Pair Resolution," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(515), pages 967-987, July.
    6. Fang Liu & Eivind Tøstesen & Jostein K Sundet & Tor-Kristian Jenssen & Christoph Bock & Geir Ivar Jerstad & William G Thilly & Eivind Hovig, 2007. "The Human Genomic Melting Map," PLOS Computational Biology, Public Library of Science, vol. 3(5), pages 1-13, May.
    7. Yue Yuan & Qiang Huo & Ziru Zhang & Qun Wang & Juanxia Wang & Shuaikang Chang & Peng Cai & Karen M. Song & David W. Galbraith & Weixiao Zhang & Long Huang & Rentao Song & Zeyang Ma, 2024. "Decoding the gene regulatory network of endosperm differentiation in maize," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    8. repec:plo:pcbi00:1000010 is not listed on IDEAS
    9. Gong Chen & Qing Zhou, 2010. "Heterogeneity in DNA Multiple Alignments: Modeling, Inference, and Applications in Motif Finding," Biometrics, The International Biometric Society, vol. 66(3), pages 694-704, September.
    10. Matvei Khoroshkin & Andrey Buyan & Martin Dodel & Albertas Navickas & Johnny Yu & Fathima Trejo & Anthony Doty & Rithvik Baratam & Shaopu Zhou & Sean B. Lee & Tanvi Joshi & Kristle Garcia & Benedict C, 2024. "Systematic identification of post-transcriptional regulatory modules," Nature Communications, Nature, vol. 15(1), pages 1-21, December.
    11. Jiayi Fan & Andrew T. Moreno & Alexander S. Baier & Joseph J. Loparo & Craig L. Peterson, 2022. "H2A.Z deposition by SWR1C involves multiple ATP-dependent steps," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    12. Gross, Eitan, 2015. "Effect of environmental stress on regulation of gene expression in the yeast," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 430(C), pages 224-235.
    13. Harsh Nagpal & Ahmad Ali-Ahmad & Yasuhiro Hirano & Wei Cai & Mario Halic & Tatsuo Fukagawa & Nikolina Sekulić & Beat Fierz, 2023. "CENP-A and CENP-B collaborate to create an open centromeric chromatin state," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    14. repec:plo:pcbi00:1007496 is not listed on IDEAS
    15. Segal Mark R, 2008. "Re-Cracking the Nucleosome Positioning Code," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 7(1), pages 1-24, April.
    16. Magali Prigent & Hélène Jean-Jacques & Delphine Naquin & Stéphane Chédin & Marie-Hélène Cuif & Renaud Legouis & Laurent Kuras, 2024. "Sulfur starvation-induced autophagy in Saccharomyces cerevisiae involves SAM-dependent signaling and transcription activator Met4," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    17. Adam M. Zahm & William S. Owens & Samuel R. Himes & Braden S. Fallon & Kathleen E. Rondem & Alexa N. Gormick & Joshua S. Bloom & Sriram Kosuri & Henry Chan & Justin G. English, 2024. "A massively parallel reporter assay library to screen short synthetic promoters in mammalian cells," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    18. Timothy E Reddy & Charles DeLisi & Boris E Shakhnovich, 2007. "Binding Site Graphs: A New Graph Theoretical Framework for Prediction of Transcription Factor Binding Sites," PLOS Computational Biology, Public Library of Science, vol. 3(5), pages 1-11, May.
    19. Kroll, K.M. & Ferrantini, A. & Domany, E., 2010. "Introduction to biology and chromosomal instabilities in cancer," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 389(20), pages 4374-4388.
    20. Moser Carlee & Gupta Mayetri, 2012. "A Generalized Hidden Markov Model for Determining Sequence-based Predictors of Nucleosome Positioning," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(2), pages 1-23, January.
    21. Xianfu Yi & Yu-Dong Cai & Zhisong He & WeiRen Cui & Xiangyin Kong, 2010. "Prediction of Nucleosome Positioning Based on Transcription Factor Binding Sites," PLOS ONE, Public Library of Science, vol. 5(9), pages 1-7, September.
    22. Behrouz Eslami-Mossallam & Raoul D Schram & Marco Tompitak & John van Noort & Helmut Schiessel, 2016. "Multiplexing Genetic and Nucleosome Positioning Codes: A Computational Approach," PLOS ONE, Public Library of Science, vol. 11(6), pages 1-14, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:0040013. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.