IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0020059.html
   My bibliography  Save this article

A Linear Model for Transcription Factor Binding Affinity Prediction in Protein Binding Microarrays

Author

Listed:
  • Matti Annala
  • Kirsti Laurila
  • Harri Lähdesmäki
  • Matti Nykter

Abstract

Protein binding microarrays (PBM) are a high throughput technology used to characterize protein-DNA binding. The arrays measure a protein's affinity toward thousands of double-stranded DNA sequences at once, producing a comprehensive binding specificity catalog. We present a linear model for predicting the binding affinity of a protein toward DNA sequences based on PBM data. Our model represents the measured intensity of an individual probe as a sum of the binding affinity contributions of the probe's subsequences. These subsequences characterize a DNA binding motif and can be used to predict the intensity of protein binding against arbitrary DNA sequences. Our method was the best performer in the Dialogue for Reverse Engineering Assessments and Methods 5 (DREAM5) transcription factor/DNA motif recognition challenge. For the DREAM5 bonus challenge, we also developed an approach for the identification of transcription factors based on their PBM binding profiles. Our approach for TF identification achieved the best performance in the bonus challenge.

Suggested Citation

  • Matti Annala & Kirsti Laurila & Harri Lähdesmäki & Matti Nykter, 2011. "A Linear Model for Transcription Factor Binding Affinity Prediction in Protein Binding Microarrays," PLOS ONE, Public Library of Science, vol. 6(5), pages 1-13, May.
  • Handle: RePEc:plo:pone00:0020059
    DOI: 10.1371/journal.pone.0020059
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0020059
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0020059&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0020059?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Harri Lähdesmäki & Alistair G Rust & Ilya Shmulevich, 2008. "Probabilistic Inference of Transcription Factor Binding from Multiple Data Sources," PLOS ONE, Public Library of Science, vol. 3(3), pages 1-24, March.
    2. Phaedra Agius & Aaron Arvey & William Chang & William Stafford Noble & Christina Leslie, 2010. "High Resolution Models of Transcription Factor-DNA Affinities Improve In Vitro and In Vivo Binding Predictions," PLOS Computational Biology, Public Library of Science, vol. 6(9), pages 1-12, September.
    3. Eran Segal & Yvonne Fondufe-Mittendorf & Lingyi Chen & AnnChristine Thåström & Yair Field & Irene K. Moore & Ji-Ping Z. Wang & Jonathan Widom, 2006. "A genomic code for nucleosome positioning," Nature, Nature, vol. 442(7104), pages 772-778, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ji-Ping Wang & Yvonne Fondufe-Mittendorf & Liqun Xi & Guei-Feng Tsai & Eran Segal & Jonathan Widom, 2008. "Preferentially Quantized Linker DNA Lengths in Saccharomyces cerevisiae," PLOS Computational Biology, Public Library of Science, vol. 4(9), pages 1-10, September.
    2. Zing Tsung-Yeh Tsai & Shin-Han Shiu & Huai-Kuang Tsai, 2015. "Contribution of Sequence Motif, Chromatin State, and DNA Structure Features to Predictive Models of Transcription Factor Binding in Yeast," PLOS Computational Biology, Public Library of Science, vol. 11(8), pages 1-22, August.
    3. Segal Mark R, 2008. "Re-Cracking the Nucleosome Positioning Code," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 7(1), pages 1-24, April.
    4. Moser Carlee & Gupta Mayetri, 2012. "A Generalized Hidden Markov Model for Determining Sequence-based Predictors of Nucleosome Positioning," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(2), pages 1-23, January.
    5. Monica Naughtin & Zofia Haftek-Terreau & Johan Xavier & Sam Meyer & Maud Silvain & Yan Jaszczyszyn & Nicolas Levy & Vincent Miele & Mohamed Salah Benleulmi & Marc Ruff & Vincent Parissi & Cédric Vaill, 2015. "DNA Physical Properties and Nucleosome Positions Are Major Determinants of HIV-1 Integrase Selectivity," PLOS ONE, Public Library of Science, vol. 10(6), pages 1-28, June.
    6. Anthony Mathelier & Wyeth W Wasserman, 2013. "The Next Generation of Transcription Factor Binding Site Prediction," PLOS Computational Biology, Public Library of Science, vol. 9(9), pages 1-18, September.
    7. Wolfram Möbius & Ulrich Gerland, 2010. "Quantitative Test of the Barrier Nucleosome Model for Statistical Positioning of Nucleosomes Up- and Downstream of Transcription Start Sites," PLOS Computational Biology, Public Library of Science, vol. 6(8), pages 1-11, August.
    8. Guo-Cheng Yuan & Jun S Liu, 2008. "Genomic Sequence Is Highly Predictive of Local Nucleosome Depletion," PLOS Computational Biology, Public Library of Science, vol. 4(1), pages 1-11, January.
    9. Leelavati Narlikar & Raluca Gordân & Alexander J Hartemink, 2007. "A Nucleosome-Guided Map of Transcription Factor Binding Sites in Yeast," PLOS Computational Biology, Public Library of Science, vol. 3(11), pages 1-10, November.
    10. Wei Chen & Hao Lin & Peng-Mian Feng & Chen Ding & Yong-Chun Zuo & Kuo-Chen Chou, 2012. "iNuc-PhysChem: A Sequence-Based Predictor for Identifying Nucleosomes via Physicochemical Properties," PLOS ONE, Public Library of Science, vol. 7(10), pages 1-9, October.
    11. Joke J F A van Vugt & Martijn de Jager & Magdalena Murawska & Alexander Brehm & John van Noort & Colin Logie, 2009. "Multiple Aspects of ATP-Dependent Nucleosome Translocation by RSC and Mi-2 Are Directed by the Underlying DNA Sequence," PLOS ONE, Public Library of Science, vol. 4(7), pages 1-14, July.
    12. Fang Liu & Eivind Tøstesen & Jostein K Sundet & Tor-Kristian Jenssen & Christoph Bock & Geir Ivar Jerstad & William G Thilly & Eivind Hovig, 2007. "The Human Genomic Melting Map," PLOS Computational Biology, Public Library of Science, vol. 3(5), pages 1-13, May.
    13. Harsh Nagpal & Ahmad Ali-Ahmad & Yasuhiro Hirano & Wei Cai & Mario Halic & Tatsuo Fukagawa & Nikolina Sekulić & Beat Fierz, 2023. "CENP-A and CENP-B collaborate to create an open centromeric chromatin state," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    14. Kroll, K.M. & Ferrantini, A. & Domany, E., 2010. "Introduction to biology and chromosomal instabilities in cancer," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 389(20), pages 4374-4388.
    15. Xianfu Yi & Yu-Dong Cai & Zhisong He & WeiRen Cui & Xiangyin Kong, 2010. "Prediction of Nucleosome Positioning Based on Transcription Factor Binding Sites," PLOS ONE, Public Library of Science, vol. 5(9), pages 1-7, September.
    16. Behrouz Eslami-Mossallam & Raoul D Schram & Marco Tompitak & John van Noort & Helmut Schiessel, 2016. "Multiplexing Genetic and Nucleosome Positioning Codes: A Computational Approach," PLOS ONE, Public Library of Science, vol. 11(6), pages 1-14, June.
    17. Manu Setty & Christina S Leslie, 2015. "SeqGL Identifies Context-Dependent Binding Signals in Genome-Wide Regulatory Element Maps," PLOS Computational Biology, Public Library of Science, vol. 11(5), pages 1-21, May.
    18. Jiangning Song & Hao Tan & Andrew J Perry & Tatsuya Akutsu & Geoffrey I Webb & James C Whisstock & Robert N Pike, 2012. "PROSPER: An Integrated Feature-Based Tool for Predicting Protease Substrate Cleavage Sites," PLOS ONE, Public Library of Science, vol. 7(11), pages 1-23, November.
    19. Alexander W. Blocker & Edoardo M. Airoldi, 2016. "Template-Based Models for Genome-Wide Analysis of Next-Generation Sequencing Data at Base-Pair Resolution," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(515), pages 967-987, July.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0020059. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.