Author
Listed:
- Artem S Kasianov
- Anna V Klepikova
- Alexey V Mayorov
- Gleb S Buzanov
- Maria D Logacheva
- Aleksey A Penin
Abstract
Interspecific gene comparisons are the keystones for many areas of biological research and are especially important for the translation of knowledge from model organisms to economically important species. Currently they are hampered by the low resolution of methods based on sequence analysis and by the complex evolutionary history of eukaryotic genes. This is especially critical for plants, whose genomes are shaped by multiple whole genome duplications and subsequent gene loss. This requires the development of new methods for comparing the functions of genes in different species. Here, we report ISEEML (Interspecific Similarity of Expression Evaluated using Machine Learning)–a novel machine learning-based algorithm for interspecific gene classification. In contrast to previous studies focused on sequence similarity, our algorithm focuses on functional similarity inferred from the comparison of gene expression profiles. We propose novel metrics for expression pattern similarity–expression score (ES)–that is suitable for species with differing morphologies. As a proof of concept, we compare detailed transcriptome maps of Arabidopsis thaliana, the model species, Zea mays (maize) and Fagopyrum esculentum (common buckwheat), which are species that represent distant clades within flowering plants. The classifier resulted in an AUC of 0.91; under the ES threshold of 0.5, the specificity was 94%, and sensitivity was 72%.Author summary: Interspecific gene comparisons are keystone for many areas of biological research, being especially important for the translation of knowledge from model organisms to economically important species. Currently, they are based on the concept of orthology–the orthologs are assumed to have similar functions (the so called Ortholog Conjecture). This approach is problematic for two reasons: 1) the universal applicability of Ortholog Conjecture is arguable 2) the accuracy of orthology inference is complicated due to multiple whole genome duplications and subsequent gene loss–the typical processes for most eukaryotic organisms. We report a novel machine-learning-based algorithm for the interspecific gene comparison. In contrast to previous studies, which focus on sequence similarity, it focuses on the similarity of function at the organismic level approximated by the expression patterns. As source of information for the classification, we use detailed gene expression maps. Our study for the first time proposes a metrics for comparison of expression maps suitable for species with differing morphologies and/or developmental rates. Without this, the comparisons of expression maps were possible only either for closely related species with similar morphology or for very low resolution maps. In contrast, our approach is suitable for the wide range of organisms with no limitations of their morphology and the resolution of expression maps.
Suggested Citation
Artem S Kasianov & Anna V Klepikova & Alexey V Mayorov & Gleb S Buzanov & Maria D Logacheva & Aleksey A Penin, 2023.
"Interspecific comparison of gene expression profiles using machine learning,"
PLOS Computational Biology, Public Library of Science, vol. 19(1), pages 1-20, January.
Handle:
RePEc:plo:pcbi00:1010743
DOI: 10.1371/journal.pcbi.1010743
Download full text from publisher
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.
- Marjolein Carron & Attila Placido Sachslehner & Munevver Burcu Cicekdal & Inge Bruggeman & Suzan Demuynck & Bahar Golabi & Elfride Baere & Wim Declercq & Erwin Tschachler & Kris Vleminckx & Leopold Ec, 2024.
"Evolutionary origin of Hoxc13-dependent skin appendages in amphibians,"
Nature Communications, Nature, vol. 15(1), pages 1-12, December.
- Maxence Lanoizelet & Léo Michel & Ronan Lagadec & Hélène Mayeur & Lucile Guichard & Valentin Logeux & Dany Séverac & Kyle Martin & Christophe Klopp & Sylvain Marcellini & Héctor Castillo & Nicolas Pol, 2024.
"Analysis of a shark reveals ancient, Wnt-dependent, habenular asymmetries in vertebrates,"
Nature Communications, Nature, vol. 15(1), pages 1-16, December.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1010743. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.