IDEAS home Printed from https://ideas.repec.org/a/eee/chsofr/v24y2005i1p115-123.html
   My bibliography  Save this article

Scaling behaviors of CG clusters in coding and noncoding DNA sequences

Author

Listed:
  • Zhang, Linxi
  • Chen, Jin

Abstract

In this paper the statistical properties of CG clusters in coding and non-coding DNA sequences are investigated through calculating the cluster-size distribution of CG clusters P(S) and the breadth of the distribution of the root-mean-square size of CG clusters σm in consecutive, non-overlapping blocks of m bases. There do exist some differences between coding and non-coding sequences. The cluster-size distribution of CG clusters P(S) for both coding and noncoding sequences follows an exponential decay of P(S)∝e−αS, and the value of α depends on the percentage of C–G content for coding sequences. It can fit into a linear line regularly but the case is contrary for noncoding sequences. We find that ξ(m)=σmm of CG clusters all obeys the good power-law decay of ξ(m)∝m−γ in both coding and non-coding sequences, and the value of γ is 0.949±0.014 and 0.826±0.011 for coding and noncoding sequences, respectively. Therefore, we can distinguish between coding and non-coding sequences on the basis of the value of γ. At the meantime, we also discuss the power-law of ξ(m)∝m−γ for random sequence, and find that the value of γ for random sequence is very close to 1.00. So we can know that the value of γ for coding sequences is more close to the random sequence, and obtain the conclusion that the behavior of coding sequence trends to random sequence more similarly. This investigation can provide some insights into DNA sequences.

Suggested Citation

  • Zhang, Linxi & Chen, Jin, 2005. "Scaling behaviors of CG clusters in coding and noncoding DNA sequences," Chaos, Solitons & Fractals, Elsevier, vol. 24(1), pages 115-123.
  • Handle: RePEc:eee:chsofr:v:24:y:2005:i:1:p:115-123
    DOI: 10.1016/j.chaos.2004.07.013
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S096007790400462X
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.chaos.2004.07.013?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Katsaloulis, P & Theoharis, T & Provata, A, 2002. "Statistical distributions of oligonucleotide combinations: applications in human chromosomes 21 and 22," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 316(1), pages 380-396.
    2. Provata, A. & Almirantis, Y., 1997. "Scaling properties of coding and non-coding DNA sequences," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 247(1), pages 482-496.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Cheng, Jun & Zhang, Linxi, 2005. "Scaling behaviors of CG clusters for chromosomes," Chaos, Solitons & Fractals, Elsevier, vol. 25(2), pages 339-346.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Katsaloulis, P. & Theoharis, T. & Zheng, W.M. & Hao, B.L. & Bountis, A. & Almirantis, Y. & Provata, A., 2006. "Long-range correlations of RNA polymerase II promoter sequences across organisms," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 366(C), pages 308-322.
    2. Cheng, Jun & Zhang, Linxi, 2005. "Scaling behaviors of CG clusters for chromosomes," Chaos, Solitons & Fractals, Elsevier, vol. 25(2), pages 339-346.
    3. Thanos, Dimitrios & Li, Wentian & Provata, Astero, 2018. "Entropic fluctuations in DNA sequences," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 493(C), pages 444-457.
    4. Silva, R. & Silva, J.R.P. & Anselmo, D.H.A.L. & Alcaniz, J.S. & da Silva, W.J.C. & Costa, M.O., 2020. "An alternative description of power law correlations in DNA sequences," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 545(C).
    5. Wu, Zuo-Bing, 2010. "Global transposable characteristics in the complete DNA sequence of the yeast," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 389(24), pages 5698-5705.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:chsofr:v:24:y:2005:i:1:p:115-123. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Thayer, Thomas R. (email available below). General contact details of provider: https://www.journals.elsevier.com/chaos-solitons-and-fractals .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.