IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1004091.html
   My bibliography  Save this article

Protein Sectors: Statistical Coupling Analysis versus Conservation

Author

Listed:
  • Tiberiu Teşileanu
  • Lucy J Colwell
  • Stanislas Leibler

Abstract

Statistical coupling analysis (SCA) is a method for analyzing multiple sequence alignments that was used to identify groups of coevolving residues termed “sectors”. The method applies spectral analysis to a matrix obtained by combining correlation information with sequence conservation. It has been asserted that the protein sectors identified by SCA are functionally significant, with different sectors controlling different biochemical properties of the protein. Here we reconsider the available experimental data and note that it involves almost exclusively proteins with a single sector. We show that in this case sequence conservation is the dominating factor in SCA, and can alone be used to make statistically equivalent functional predictions. Therefore, we suggest shifting the experimental focus to proteins for which SCA identifies several sectors. Correlations in protein alignments, which have been shown to be informative in a number of independent studies, would then be less dominated by sequence conservation.Author Summary: Statistical analyses of alignments of evolutionarily related protein sequences have been proposed as a method for obtaining information about protein structure and function. One such method, called statistical coupling analysis, identifies patterns of correlated mutations and uses them to find groups of coevolving residues. These groups, called protein sectors, have been reported to be relevant for various functional aspects, such as enzymatic efficiency, protein stability, or allostery. Here, we reanalyze existing data in order to assess the relative importance of two factors contributing to statistical coupling analysis, namely single-site amino acid frequencies and pairwise correlations. Although correlations have been shown to be informative in other studies, we point out that in existing large-scale data that has been analyzed with statistical coupling analysis, single-site statistics seems to be a dominating factor.

Suggested Citation

  • Tiberiu Teşileanu & Lucy J Colwell & Stanislas Leibler, 2015. "Protein Sectors: Statistical Coupling Analysis versus Conservation," PLOS Computational Biology, Public Library of Science, vol. 11(2), pages 1-20, February.
  • Handle: RePEc:plo:pcbi00:1004091
    DOI: 10.1371/journal.pcbi.1004091
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1004091
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1004091&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1004091?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. J. P. Bouchaud & M. Potters, 2009. "Financial Applications of Random Matrix Theory: a short review," Papers 0910.1205, arXiv.org.
    2. Richard N. McLaughlin Jr & Frank J. Poelwijk & Arjun Raman & Walraj S. Gosal & Rama Ranganathan, 2012. "The spatial architecture of protein function and adaptation," Nature, Nature, vol. 491(7422), pages 138-142, November.
    3. Michael Socolich & Steve W. Lockless & William P. Russ & Heather Lee & Kevin H. Gardner & Rama Ranganathan, 2005. "Evolutionary information for specifying a protein fold," Nature, Nature, vol. 437(7058), pages 512-518, September.
    4. Lucy J Colwell & Michael P Brenner & Andrew W Murray, 2014. "Conservation Weighting Functions Enable Covariance Analyses to Detect Functionally Important Amino Acids," PLOS ONE, Public Library of Science, vol. 9(11), pages 1-9, November.
    5. William P. Russ & Drew M. Lowery & Prashant Mishra & Michael B. Yaffe & Rama Ranganathan, 2005. "Natural-like function in artificial WW domains," Nature, Nature, vol. 437(7058), pages 579-583, September.
    6. James S. Fraser & Michael W. Clarkson & Sheena C. Degnan & Renske Erion & Dorothee Kern & Tom Alber, 2009. "Hidden alternative structures of proline isomerase essential for catalysis," Nature, Nature, vol. 462(7273), pages 669-673, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yasser Roudi & Sheila Nirenberg & Peter E Latham, 2009. "Pairwise Maximum Entropy Models for Studying Large Biological Systems: When They Can Work and When They Can't," PLOS Computational Biology, Public Library of Science, vol. 5(5), pages 1-18, May.
    2. Shou-Wen Wang & Anne-Florence Bitbol & Ned S Wingreen, 2019. "Revealing evolutionary constraints on proteins through sequence analysis," PLOS Computational Biology, Public Library of Science, vol. 15(4), pages 1-16, April.
    3. Hugo Jacquin & Amy Gilson & Eugene Shakhnovich & Simona Cocco & Rémi Monasson, 2016. "Benchmarking Inverse Statistical Approaches for Protein Structure and Design with Exactly Solvable Models," PLOS Computational Biology, Public Library of Science, vol. 12(5), pages 1-18, May.
    4. Xu, Xiu-Lian & Shi, Jin-Xuan & Wang, Jun & Li, Wenfei, 2021. "Long-range correlation and critical fluctuations in coevolution networks of protein sequences," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 562(C).
    5. Pierre-Alain Reigneron & Romain Allez & Jean-Philippe Bouchaud, 2010. "Principal Regression Analysis and the index leverage effect," Papers 1011.5810, arXiv.org, revised Feb 2011.
    6. Alistair Bailey & Andy van Hateren & Tim Elliott & Jörn M Werner, 2014. "Two Polymorphisms Facilitate Differences in Plasticity between Two Chicken Major Histocompatibility Complex Class I Proteins," PLOS ONE, Public Library of Science, vol. 9(2), pages 1-11, February.
    7. Luu, Duc Thi & Yanovski, Boyan & Lux, Thomas, 2018. "An analysis of systematic risk in worldwide econonomic sentiment indices," Economics Working Papers 2018-03, Christian-Albrechts-University of Kiel, Department of Economics.
    8. Duc Thi Luu, 2022. "Portfolio Correlations in the Bank-Firm Credit Market of Japan," Computational Economics, Springer;Society for Computational Economics, vol. 60(2), pages 529-569, August.
    9. Ahmed Abdul Quadeer & David Morales-Jimenez & Matthew R McKay, 2018. "Co-evolution networks of HIV/HCV are modular with direct association to structure and function," PLOS Computational Biology, Public Library of Science, vol. 14(9), pages 1-29, September.
    10. Vincent Tan & Stefan Zohren, 2020. "Estimation of Large Financial Covariances: A Cross-Validation Approach," Papers 2012.05757, arXiv.org, revised Jan 2023.
    11. M. Raddant & T. Di Matteo, 2023. "A look at financial dependencies by means of econophysics and financial economics," Journal of Economic Interaction and Coordination, Springer;Society for Economic Science with Heterogeneous Interacting Agents, vol. 18(4), pages 701-734, October.
    12. Mattia Guerini & Duc Thi Luu & Mauro Napoletano, 2023. "Synchronization patterns in the European Union," Applied Economics, Taylor & Francis Journals, vol. 55(18), pages 2038-2059, April.
    13. repec:hal:spmain:info:hdl:2441/5q8fnecj1u87ka099dc571bhi2 is not listed on IDEAS
    14. Khaled Daqrouq & Rami Alhmouz & Ahmed Balamesh & Adnan Memic, 2015. "Application of Wavelet Transform for PDZ Domain Classification," PLOS ONE, Public Library of Science, vol. 10(4), pages 1-16, April.
    15. Liu Ziyin & Kentaro Minami & Kentaro Imajo, 2021. "Theoretically Motivated Data Augmentation and Regularization for Portfolio Construction," Papers 2106.04114, arXiv.org, revised Dec 2022.
    16. Sven Husmann & Antoniya Shivarova & Rick Steinert, 2022. "Sparsity and stability for minimum-variance portfolios," Risk Management, Palgrave Macmillan, vol. 24(3), pages 214-235, September.
    17. Sandoval, Leonidas Junior & Bruscato, Adriana & Venezuela, Maria Kelly, 2012. "Building portfolios of stocks in the São Paulo Stock Exchange using Random Matrix Theory," Insper Working Papers wpe_270, Insper Working Paper, Insper Instituto de Ensino e Pesquisa.
    18. Yongcheng Qi & Mengzi Xie, 2020. "Spectral Radii of Products of Random Rectangular Matrices," Journal of Theoretical Probability, Springer, vol. 33(4), pages 2185-2212, December.
    19. F. Pozzi & T. Matteo & T. Aste, 2012. "Exponential smoothing weighted correlations," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 85(6), pages 1-21, June.
    20. Zeng, Xingyuan, 2017. "Limiting empirical distribution for eigenvalues of products of random rectangular matrices," Statistics & Probability Letters, Elsevier, vol. 126(C), pages 33-40.
    21. Erik van Nimwegen, 2016. "Inferring Contacting Residues within and between Proteins: What Do the Probabilities Mean?," PLOS Computational Biology, Public Library of Science, vol. 12(5), pages 1-10, May.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1004091. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.