IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1002150.html

A Detailed History of Intron-rich Eukaryotic Ancestors Inferred from a Global Survey of 100 Complete Genomes

Author

Listed:
  • Miklos Csuros
  • Igor B Rogozin
  • Eugene V Koonin

Abstract

Protein-coding genes in eukaryotes are interrupted by introns, but intron densities widely differ between eukaryotic lineages. Vertebrates, some invertebrates and green plants have intron-rich genes, with 6–7 introns per kilobase of coding sequence, whereas most of the other eukaryotes have intron-poor genes. We reconstructed the history of intron gain and loss using a probabilistic Markov model (Markov Chain Monte Carlo, MCMC) on 245 orthologous genes from 99 genomes representing the three of the five supergroups of eukaryotes for which multiple genome sequences are available. Intron-rich ancestors are confidently reconstructed for each major group, with 53 to 74% of the human intron density inferred with 95% confidence for the Last Eukaryotic Common Ancestor (LECA). The results of the MCMC reconstruction are compared with the reconstructions obtained using Maximum Likelihood (ML) and Dollo parsimony methods. An excellent agreement between the MCMC and ML inferences is demonstrated whereas Dollo parsimony introduces a noticeable bias in the estimations, typically yielding lower ancestral intron densities than MCMC and ML. Evolution of eukaryotic genes was dominated by intron loss, with substantial gain only at the bases of several major branches including plants and animals. The highest intron density, 120 to 130% of the human value, is inferred for the last common ancestor of animals. The reconstruction shows that the entire line of descent from LECA to mammals was intron-rich, a state conducive to the evolution of alternative splicing. Author Summary: In eukaryotes, protein-coding genes are interrupted by non-coding introns. The intron densities widely differ, from 6–7 introns per kilobase of coding sequence in vertebrates, some invertebrates and plants, to only a few introns across the entire genome in many unicellular forms. We applied a robust statistical methodology, Markov Chain Monte Carlo, to reconstruct the history of intron gain and loss throughout the evolution of eukaryotes using a set of 245 homologous genes from 99 genomes that represent the diversity of eukaryotes. Intron-rich ancestors were confidently inferred for each major eukaryotic group including 53% to 74% of the human intron density for the last eukaryotic common ancestor, and 120% to 130% of the human value for the last common ancestor of animals. Evolution of eukaryotic genes involved primarily intron loss, with substantial gain only at the bases of several major branches including plants and animals. Thus, the common ancestor of all extant eukaryotes was a complex organism with a gene architecture resembling those in multicellular organisms. The line of descent from the last common ancestor to mammals was an uninterrupted intron-rich state that, given the error-prone splicing in intron-rich organisms, was conducive to the elaboration of functional alternative splicing.

Suggested Citation

  • Miklos Csuros & Igor B Rogozin & Eugene V Koonin, 2011. "A Detailed History of Intron-rich Eukaryotic Ancestors Inferred from a Global Survey of 100 Complete Genomes," PLOS Computational Biology, Public Library of Science, vol. 7(9), pages 1-9, September.
  • Handle: RePEc:plo:pcbi00:1002150
    DOI: 10.1371/journal.pcbi.1002150
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1002150
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1002150&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1002150?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Eric T. Wang & Rickard Sandberg & Shujun Luo & Irina Khrebtukova & Lu Zhang & Christine Mayr & Stephen F. Kingsmore & Gary P. Schroth & Christopher B. Burge, 2008. "Alternative isoform regulation in human tissue transcriptomes," Nature, Nature, vol. 456(7221), pages 470-476, November.
    2. Alastair G. B. Simpson & Erin K. MacQuarrie & Andrew J. Roger, 2002. "Early origin of canonical introns," Nature, Nature, vol. 419(6904), pages 270-270, September.
    3. Hung D Nguyen & Maki Yoshihama & Naoya Kenmochi, 2005. "New Maximum Likelihood Estimators for Eukaryotic Intron Evolution," PLOS Computational Biology, Public Library of Science, vol. 1(7), pages 1-8, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Maria E Gallegos & Sanjeev Balakrishnan & Priya Chandramouli & Shaily Arora & Aruna Azameera & Anitha Babushekar & Emilee Bargoma & Abdulmalik Bokhari & Siva Kumari Chava & Pranti Das & Meetali Desai , 2012. "The C. elegans Rab Family: Identification, Classification and Toolkit Construction," PLOS ONE, Public Library of Science, vol. 7(11), pages 1-19, November.
    2. Poonam Kashyap & Kalyani R. Aswale & Abhijit S. Deshmukh, 2025. "Deletion of splicing factor Cdc5 in Toxoplasma disrupts transcriptome integrity, induces abortive bradyzoite formation, and prevents acute infection in mice," Nature Communications, Nature, vol. 16(1), pages 1-23, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Megan D. Schertzer & Andrew Stirn & Keren Isaev & Laura Pereira & Stella H. Park & Anjali Das & Aline Réal & Erin D. Jeffery & Claire Harbison & Hans-Hermann Wessels & Gloria M. Sheynkman & Neville E., 2025. "Cas13d-mediated isoform-specific RNA knockdown with a unified computational and experimental toolbox," Nature Communications, Nature, vol. 16(1), pages 1-19, December.
    2. Gustavo Glusman & Juan Caballero & Max Robinson & Burak Kutlu & Leroy Hood, 2013. "Optimal Scaling of Digital Transcriptomes," PLOS ONE, Public Library of Science, vol. 8(11), pages 1-12, November.
    3. Wei Sun & Yufeng Liu & James J. Crowley & Ting-Huei Chen & Hua Zhou & Haitao Chu & Shunping Huang & Pei-Fen Kuan & Yuan Li & Darla Miller & Ginger Shaw & Yichao Wu & Vasyl Zhabotynsky & Leonard McMill, 2015. "IsoDOT Detects Differential RNA-Isoform Expression/Usage With Respect to a Categorical or Continuous Covariate With High Sensitivity and Specificity," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(511), pages 975-986, September.
    4. Xiaohong Li & Guy N Brock & Eric C Rouchka & Nigel G F Cooper & Dongfeng Wu & Timothy E O’Toole & Ryan S Gill & Abdallah M Eteleeb & Liz O’Brien & Shesh N Rai, 2017. "A comparison of per sample global scaling and per gene normalization methods for differential expression analysis of RNA-seq data," PLOS ONE, Public Library of Science, vol. 12(5), pages 1-22, May.
    5. Michelle M. Kameda-Smith & Helen Zhu & En-Ching Luo & Yujin Suk & Agata Xella & Brian Yee & Chirayu Chokshi & Sansi Xing & Frederick Tan & Raymond G. Fox & Ashley A. Adile & David Bakhshinyan & Kevin , 2022. "Characterization of an RNA binding protein interactome reveals a context-specific post-transcriptional landscape of MYC-amplified medulloblastoma," Nature Communications, Nature, vol. 13(1), pages 1-19, December.
    6. Hannah N. Jacobs & Bram L. Gorissen & Jeremy Guez & Masahiro Kanai & Kavi Gupta & Hilary K. Finucane & Konrad J. Karczewski & Christopher B. Burge, 2025. "Widespread naturally variable human exons aid genetic interpretation," Nature Communications, Nature, vol. 16(1), pages 1-15, December.
    7. Justin Bo-Kai Hsu & Neil Arvin Bretaña & Tzong-Yi Lee & Hsien-Da Huang, 2011. "Incorporating Evolutionary Information and Functional Domains for Identifying RNA Splicing Factors in Humans," PLOS ONE, Public Library of Science, vol. 6(11), pages 1-11, November.
    8. Patryk Poliński & Marta Miret Cuesta & Alfonsa Zamora-Moratalla & Federica Mantica & Gerard Cantero-Recasens & Carlotta Viana & Miguel Sabariego-Navarro & Davide Normanno & Luis P. Iñiguez & Cruz More, 2025. "A highly conserved neuronal microexon in DAAM1 controls actin dynamics, RHOA/ROCK signaling, and memory formation," Nature Communications, Nature, vol. 16(1), pages 1-21, December.
    9. Jun Inamo & Akari Suzuki & Mahoko Takahashi Ueda & Kensuke Yamaguchi & Hiroshi Nishida & Katsuya Suzuki & Yuko Kaneko & Tsutomu Takeuchi & Hiroaki Hatano & Kazuyoshi Ishigaki & Yasushi Ishihama & Kazu, 2024. "Long-read sequencing for 29 immune cell subsets reveals disease-linked isoforms," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    10. Feng Wang & Yang Xu & Robert Wang & Beatrice Zhang & Noah Smith & Amber Notaro & Samantha Gaerlan & Eric Kutschera & Kathryn E. Kadash-Edmondson & Yi Xing & Lan Lin, 2023. "TEQUILA-seq: a versatile and low-cost method for targeted long-read RNA sequencing," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    11. Elizabeth A. Werren & Geneva R. LaForce & Anshika Srivastava & Delia R. Perillo & Shaokun Li & Katherine Johnson & Safa Baris & Brandon Berger & Samantha L. Regan & Christian D. Pfennig & Sonja Munnik, 2024. "TREX tetramer disruption alters RNA processing necessary for corticogenesis in THOC6 Intellectual Disability Syndrome," Nature Communications, Nature, vol. 15(1), pages 1-21, December.
    12. Yvonne L. Chao & Katherine I. Zhou & Kwame K. Forbes & Alessandro Porrello & Gabrielle M. Gentile & Yinzhou Zhu & Aaron C. Chack & Dixcy J. S. John Mary & Haizhou Liu & Eric Cockman & Lincy Edatt & Gr, 2025. "Snord67 promotes breast cancer metastasis by guiding U6 modification and modulating the splicing landscape," Nature Communications, Nature, vol. 16(1), pages 1-23, December.
    13. Patricia González-Rodríguez & Daniel J. Klionsky & Bertrand Joseph, 2022. "Autophagy regulation by RNA alternative splicing and implications in human diseases," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    14. Stacey D Wagner & Adam J Struck & Riti Gupta & Dylan R Farnsworth & Amy E Mahady & Katy Eichinger & Charles A Thornton & Eric T Wang & J Andrew Berglund, 2016. "Dose-Dependent Regulation of Alternative Splicing by MBNL Proteins Reveals Biomarkers for Myotonic Dystrophy," PLOS Genetics, Public Library of Science, vol. 12(9), pages 1-24, September.
    15. Nysia I George & John F Bowyer & Nathaniel M Crabtree & Ching-Wei Chang, 2015. "An Iterative Leave-One-Out Approach to Outlier Detection in RNA-Seq Data," PLOS ONE, Public Library of Science, vol. 10(6), pages 1-10, June.
    16. Hung D Nguyen & Maki Yoshihama & Naoya Kenmochi, 2006. "Authors' Reply," PLOS Computational Biology, Public Library of Science, vol. 2(7), pages 1-2, July.
    17. Alberto Riva & Graziano Pesole, 2009. "A Unique, Consistent Identifier for Alternatively Spliced Transcript Variants," PLOS ONE, Public Library of Science, vol. 4(10), pages 1-10, October.
    18. Liguo Wang & Yuanxin Xi & Jun Yu & Liping Dong & Laising Yen & Wei Li, 2010. "A Statistical Method for the Detection of Alternative Splicing Using RNA-Seq," PLOS ONE, Public Library of Science, vol. 5(1), pages 1-8, January.
    19. Jeremy Vicencio & Daisuke Chihara & Matthias Eder & Lucia Sedlackova & Julie Ahringer & Nicholas Stroustrup, 2025. "Engineering the auxin-inducible degron system for tunable in vivo control of organismal physiology," Nature Communications, Nature, vol. 16(1), pages 1-16, December.
    20. Thomas A Richards & Laura Eme & John M Archibald & Guy Leonard & Susana M Coelho & Alex de Mendoza & Christophe Dessimoz & Pavel Dolezal & Lillian K Fritz-Laylin & Toni Gabaldón & Vladimír Hampl & Gee, 2024. "Reconstructing the last common ancestor of all eukaryotes," PLOS Biology, Public Library of Science, vol. 22(11), pages 1-24, November.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1002150. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.