IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0163491.html
   My bibliography  Save this article

LMethyR-SVM: Predict Human Enhancers Using Low Methylated Regions based on Weighted Support Vector Machines

Author

Listed:
  • Jingting Xu
  • Hong Hu
  • Yang Dai

Abstract

Background: The identification of enhancers is a challenging task. Various types of epigenetic information including histone modification have been utilized in the construction of enhancer prediction models based on a diverse panel of machine learning schemes. However, DNA methylation profiles generated from the whole genome bisulfite sequencing (WGBS) have not been fully explored for their potential in enhancer prediction despite the fact that low methylated regions (LMRs) have been implied to be distal active regulatory regions. Method: In this work, we propose a prediction framework, LMethyR-SVM, using LMRs identified from cell-type-specific WGBS DNA methylation profiles and a weighted support vector machine learning framework. In LMethyR-SVM, the set of cell-type-specific LMRs is further divided into three sets: reliable positive, like positive and likely negative, according to their resemblance to a small set of experimentally validated enhancers in the VISTA database based on an estimated non-parametric density distribution. Then, the prediction model is obtained by solving a weighted support vector machine. Results: We demonstrate the performance of LMethyR-SVM by using the WGBS DNA methylation profiles derived from the human embryonic stem cell type (H1) and the fetal lung fibroblast cell type (IMR90). The predicted enhancers are highly conserved with a reasonable validation rate based on a set of commonly used positive markers including transcription factors, p300 binding and DNase-I hypersensitive sites. In addition, we show evidence that the large fraction of the LMethyR-SVM predicted enhancers are not predicted by ChromHMM in H1 cell type and they are more enriched for the FANTOM5 enhancers. Conclusion: Our work suggests that low methylated regions detected from the WGBS data are useful as complementary resources to histone modification marks in developing models for the prediction of cell-type-specific enhancers.

Suggested Citation

  • Jingting Xu & Hong Hu & Yang Dai, 2016. "LMethyR-SVM: Predict Human Enhancers Using Low Methylated Regions based on Weighted Support Vector Machines," PLOS ONE, Public Library of Science, vol. 11(9), pages 1-18, September.
  • Handle: RePEc:plo:pone00:0163491
    DOI: 10.1371/journal.pone.0163491
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0163491
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0163491&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0163491?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Ryan Lister & Mattia Pelizzola & Yasuyuki S. Kida & R. David Hawkins & Joseph R. Nery & Gary Hon & Jessica Antosiewicz-Bourget & Ronan O’Malley & Rosa Castanon & Sarit Klugman & Michael Downes & Ruth , 2011. "Hotspots of aberrant epigenomic reprogramming in human induced pluripotent stem cells," Nature, Nature, vol. 471(7336), pages 68-73, March.
    2. Azzalini, Adelchi & Menardi, Giovanna, 2014. "Clustering via Nonparametric Density Estimation: The R Package pdfCluster," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 57(i11).
    3. Nathaniel D. Heintzman & Gary C. Hon & R. David Hawkins & Pouya Kheradpour & Alexander Stark & Lindsey F. Harp & Zhen Ye & Leonard K. Lee & Rhona K. Stuart & Christina W. Ching & Keith A. Ching & Jess, 2009. "Histone modifications at human enhancers reflect global cell-type-specific gene expression," Nature, Nature, vol. 459(7243), pages 108-112, May.
    4. Yiming Lu & Wubin Qu & Guangyu Shan & Chenggang Zhang, 2015. "DELTA: A Distal Enhancer Locating Tool Based on AdaBoost Algorithm and Shape Features of Chromatin Modifications," PLOS ONE, Public Library of Science, vol. 10(6), pages 1-20, June.
    5. Michael B. Stadler & Rabih Murr & Lukas Burger & Robert Ivanek & Florian Lienert & Anne Schöler & Erik van Nimwegen & Christiane Wirbelauer & Edward J. Oakeley & Dimos Gaidatzis & Vijay K. Tiwari & Di, 2011. "DNA-binding factors shape the mouse methylome at distal regulatory regions," Nature, Nature, vol. 480(7378), pages 490-495, December.
    6. Susan E. Celniker & Laura A. L. Dillon & Mark B. Gerstein & Kristin C. Gunsalus & Steven Henikoff & Gary H. Karpen & Manolis Kellis & Eric C. Lai & Jason D. Lieb & David M. MacAlpine & Gos Micklem & F, 2009. "Unlocking the secrets of the genome," Nature, Nature, vol. 459(7249), pages 927-930, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Claire Vinel & Gabriel Rosser & Loredana Guglielmi & Myrianni Constantinou & Nicola Pomella & Xinyu Zhang & James R. Boot & Tania A. Jones & Thomas O. Millner & Anaelle A. Dumas & Vardhman Rakyan & Je, 2021. "Comparative epigenetic analysis of tumour initiating cells and syngeneic EPSC-derived neural stem cells in glioblastoma," Nature Communications, Nature, vol. 12(1), pages 1-20, December.
    2. Xinhao Hou & Mingjing Xu & Chengming Zhu & Jianing Gao & Meili Li & Xiangyang Chen & Cheng Sun & Björn Nashan & Jianye Zang & Ying Zhou & Shouhong Guang & Xuezhu Feng, 2023. "Systematic characterization of chromodomain proteins reveals an H3K9me1/2 reader regulating aging in C. elegans," Nature Communications, Nature, vol. 14(1), pages 1-19, December.
    3. Yayoi Natsume-Kitatani & Hiroshi Mamitsuka, 2016. "Classification of Promoters Based on the Combination of Core Promoter Elements Exhibits Different Histone Modification Patterns," PLOS ONE, Public Library of Science, vol. 11(3), pages 1-18, March.
    4. Anyou Wang & Ying Du & Qianchuan He & Chunxiao Zhou, 2013. "A Quantitative System for Discriminating Induced Pluripotent Stem Cells, Embryonic Stem Cells and Somatic Cells," PLOS ONE, Public Library of Science, vol. 8(2), pages 1-10, February.
    5. Dafne Ibarra-Morales & Michael Rauer & Piergiuseppe Quarato & Leily Rabbani & Fides Zenk & Mariana Schulte-Sasse & Francesco Cardamone & Alejandro Gomez-Auli & Germano Cecere & Nicola Iovino, 2021. "Histone variant H2A.Z regulates zygotic genome activation," Nature Communications, Nature, vol. 12(1), pages 1-14, December.
    6. Stefano Tonellato, 2019. "Bayesian nonparametric clustering as a community detection problem," Working Papers 2019: 20, Department of Economics, University of Venice "Ca' Foscari".
    7. Patricia Gerdes & Sue Mei Lim & Adam D. Ewing & Michael R. Larcombe & Dorothy Chan & Francisco J. Sanchez-Luque & Lucinda Walker & Alexander L. Carleton & Cini James & Anja S. Knaupp & Patricia E. Car, 2022. "Retrotransposon instability dominates the acquired mutation landscape of mouse induced pluripotent stem cells," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    8. Dahong Chen & Catherine E. McManus & Behram Radmanesh & Leah H. Matzat & Elissa P. Lei, 2021. "Temporal inhibition of chromatin looping and enhancer accessibility during neuronal remodeling," Nature Communications, Nature, vol. 12(1), pages 1-10, December.
    9. Lakhal-Chaieb Lajmi & Greenwood Celia M.T. & Ouhourane Mohamed & Zhao Kaiqiong & Abdous Belkacem & Oualkacha Karim, 2017. "A smoothed EM-algorithm for DNA methylation profiles from sequencing-based methods in cell lines or for a single cell type," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 16(5-6), pages 333-347, December.
    10. Marko Dunjić & Felix Jonas & Gilad Yaakov & Roye More & Yoav Mayshar & Yoach Rais & Ayelet-Hashahar Orenbuch & Saifeng Cheng & Naama Barkai & Yonatan Stelzer, 2023. "Histone exchange sensors reveal variant specific dynamics in mouse embryonic stem cells," Nature Communications, Nature, vol. 14(1), pages 1-19, December.
    11. Shijia Zhu & Guohua Wang & Bo Liu & Yadong Wang, 2013. "Modeling Exon Expression Using Histone Modifications," PLOS ONE, Public Library of Science, vol. 8(6), pages 1-15, June.
    12. Graeme J. Thorn & Christopher T. Clarkson & Anne Rademacher & Hulkar Mamayusupova & Gunnar Schotta & Karsten Rippe & Vladimir B. Teif, 2022. "DNA sequence-dependent formation of heterochromatin nanodomains," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    13. Yanting Luo & Jianlin He & Xiguang Xu & Ming-an Sun & Xiaowei Wu & Xuemei Lu & Hehuang Xie, 2018. "Integrative single-cell omics analyses reveal epigenetic heterogeneity in mouse embryonic stem cells," PLOS Computational Biology, Public Library of Science, vol. 14(3), pages 1-21, March.
    14. Adelchi Azzalini & Giovanna Menardi, 2016. "Density-based clustering with non-continuous data," Computational Statistics, Springer, vol. 31(2), pages 771-798, June.
    15. Nina Schmolka & Ino D. Karemaker & Richard Cardoso da Silva & Davide C. Recchia & Vincent Spegg & Jahnavi Bhaskaran & Michael Teske & Nathalie P. Wagenaar & Matthias Altmeyer & Tuncay Baubec, 2023. "Dissecting the roles of MBD2 isoforms and domains in regulating NuRD complex function during cellular differentiation," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    16. Hao Wu & Hongkai Ji, 2014. "PolyaPeak: Detecting Transcription Factor Binding Sites from ChIP-seq Using Peak Shape Information," PLOS ONE, Public Library of Science, vol. 9(3), pages 1-9, March.
    17. Anne Senabouth & Maciej Daniszewski & Grace E. Lidgerwood & Helena H. Liang & Damián Hernández & Mehdi Mirzaei & Stacey N. Keenan & Ran Zhang & Xikun Han & Drew Neavin & Louise Rooney & Maria Isabel G, 2022. "Transcriptomic and proteomic retinal pigment epithelium signatures of age-related macular degeneration," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    18. Abby Flynt & Nema Dean, 2016. "A Survey of Popular R Packages for Cluster Analysis," Journal of Educational and Behavioral Statistics, , vol. 41(2), pages 205-225, April.
    19. van Iterson Maarten & Duijkers Floor A.M. & Meijerink Jules P.P. & Admiraal Pieter & van Ommen Gert-Jan B. & Boer Judith M. & van Noesel Max M. & Menezes Renee X., 2012. "A Novel and Fast Normalization Method for High-Density Arrays," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(4), pages 1-31, July.
    20. Sun Shuying & Yu Xiaoqing, 2016. "HMM-Fisher: identifying differential methylation using a hidden Markov model and Fisher’s exact test," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 15(1), pages 55-67, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0163491. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.