IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1000057.html
   My bibliography  Save this article

Investigations of Oligonucleotide Usage Variance Within and Between Prokaryotes

Author

Listed:
  • Jon Bohlin
  • Eystein Skjerve
  • David W Ussery

Abstract

Oligonucleotide usage in archaeal and bacterial genomes can be linked to a number of properties, including codon usage (trinucleotides), DNA base-stacking energy (dinucleotides), and DNA structural conformation (di- to tetranucleotides). We wanted to assess the statistical information potential of different DNA ‘word-sizes’ and explore how oligonucleotide frequencies differ in coding and non-coding regions. In addition, we used oligonucleotide frequencies to investigate DNA composition and how DNA sequence patterns change within and between prokaryotic organisms. Among the results found was that prokaryotic chromosomes can be described by hexanucleotide frequencies, suggesting that prokaryotic DNA is predominantly short range correlated, i.e., information in prokaryotic genomes is encoded in short oligonucleotides. Oligonucleotide usage varied more within AT-rich and host-associated genomes than in GC-rich and free-living genomes, and this variation was mainly located in non-coding regions. Bias (selectional pressure) in tetranucleotide usage correlated with GC content, and coding regions were more biased than non-coding regions. Non-coding regions were also found to be approximately 5.5% more AT-rich than coding regions, on average, in the 402 chromosomes examined. Pronounced DNA compositional differences were found both within and between AT-rich and GC-rich genomes. GC-rich genomes were more similar and biased in terms of tetranucleotide usage in non-coding regions than AT-rich genomes. The differences found between AT-rich and GC-rich genomes may possibly be attributed to lifestyle, since tetranucleotide usage within host-associated bacteria was, on average, more dissimilar and less biased than free-living archaea and bacteria.Author Summary: There are potentially many factors responsible for how archaeal and bacterial genomes are composed. Recent advances in DNA sequencing have made it possible to use computational and statistical methods to examine the interplay between evolution and genomic composition. We wished to see whether particular properties could be extracted that would provide clues on how prokaryotic DNA is composed. For instance, we wondered whether or not protein coding regions carried a greater information potential than non-coding regions, if there is a link between genome size and GC content, whether GC content is different in coding and non-coding regions, and possible associations between DNA composition and environment. Our results indicated that genomic nucleotide frequencies are a determinant of many DNA compositional properties, but also that other influences are at work. For instance, bacteria are known to frequently exchange DNA with the environment and other organisms. Acquired DNA can therefore have different compositional properties than host DNA, and since pathogenicity and antibiotic resistance in bacteria is often associated with foreign DNA, advancing the knowledge of DNA composition is of great importance.

Suggested Citation

  • Jon Bohlin & Eystein Skjerve & David W Ussery, 2008. "Investigations of Oligonucleotide Usage Variance Within and Between Prokaryotes," PLOS Computational Biology, Public Library of Science, vol. 4(4), pages 1-9, April.
  • Handle: RePEc:plo:pcbi00:1000057
    DOI: 10.1371/journal.pcbi.1000057
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1000057
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1000057&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1000057?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jon Bohlin & Ola Brynildsrud & Tammi Vesth & Eystein Skjerve & David W Ussery, 2013. "Amino Acid Usage Is Asymmetrically Biased in AT- and GC-Rich Microbial Genomes," PLOS ONE, Public Library of Science, vol. 8(7), pages 1-10, July.
    2. Colin F Davenport & Burkhard Tümmler, 2010. "Abundant Oligonucleotides Common to Most Bacteria," PLOS ONE, Public Library of Science, vol. 5(3), pages 1-8, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1000057. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.