IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1003254.html
   My bibliography  Save this article

Feature Selection Methods for Identifying Genetic Determinants of Host Species in RNA Viruses

Author

Listed:
  • Ricardo Aguas
  • Neil M Ferguson

Abstract

Despite environmental, social and ecological dependencies, emergence of zoonotic viruses in human populations is clearly also affected by genetic factors which determine cross-species transmission potential. RNA viruses pose an interesting case study given their mutation rates are orders of magnitude higher than any other pathogen – as reflected by the recent emergence of SARS and Influenza for example. Here, we show how feature selection techniques can be used to reliably classify viral sequences by host species, and to identify the crucial minority of host-specific sites in pathogen genomic data. The variability in alleles at those sites can be translated into prediction probabilities that a particular pathogen isolate is adapted to a given host. We illustrate the power of these methods by: 1) identifying the sites explaining SARS coronavirus differences between human, bat and palm civet samples; 2) showing how cross species jumps of rabies virus among bat populations can be readily identified; and 3) de novo identification of likely functional influenza host discriminant markers.Author Summary: Moving away from genome scan methods used for human GWAS (ultimately inappropriate for the short highly polymorphic genomes of RNA viruses), our work shows the power and potential of multi-class machine learning algorithms in inferring the functional genetic changes associated with phenotypic change (e.g. crossing a species barrier). We show that even distantly related viruses within a viral family share highly conserved genetic signatures of host specificity; reinforce how fitness landscapes of host adaptation are shaped by host phylogeny; and highlight the evolutionary trajectories of RNA viruses in rapid expansion and under great evolutionary pressure. We do so by (for each dataset) unveiling a set of phenotype characteristic mutations which are shown to be functionally relevant, thus providing new insights into phenotypic relationships between RNA viruses. These methods also provide a solid statistical framework with which the degree of host adaptation can be inferred, thus serving as a valuable tool for studying host transition events with particular relevance for emerging infectious diseases. These methods can then serve as rigorous tools of emergence potential assessment, specifically in scenarios where rapid host classification of newly emerging viruses can be more important than identifying putative functional sites.

Suggested Citation

  • Ricardo Aguas & Neil M Ferguson, 2013. "Feature Selection Methods for Identifying Genetic Determinants of Host Species in RNA Viruses," PLOS Computational Biology, Public Library of Science, vol. 9(10), pages 1-10, October.
  • Handle: RePEc:plo:pcbi00:1003254
    DOI: 10.1371/journal.pcbi.1003254
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1003254
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1003254&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1003254?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Gabriele Neumann & Takeshi Noda & Yoshihiro Kawaoka, 2009. "Emergence and pandemic potential of swine-origin H1N1 influenza virus," Nature, Nature, vol. 459(7249), pages 931-939, June.
    2. Kate E. Jones & Nikkita G. Patel & Marc A. Levy & Adam Storeygard & Deborah Balk & John L. Gittleman & Peter Daszak, 2008. "Global trends in emerging infectious diseases," Nature, Nature, vol. 451(7181), pages 990-993, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tobias S Brett & Pejman Rohani, 2020. "Dynamical footprints enable detection of disease emergence," PLOS Biology, Public Library of Science, vol. 18(5), pages 1-20, May.
    2. Nikolett Orosz & Tünde Tóthné Tóth & Gyöngyi Vargáné Gyuró & Zsoltné Tibor Nábrádi & Klára Hegedűsné Sorosi & Zsuzsa Nagy & Éva Rigó & Ádám Kaposi & Gabriella Gömöri & Cornelia Melinda Adi Santoso & A, 2022. "Comparison of Length of Hospital Stay for Community-Acquired Infections Due to Enteric Pathogens, Influenza Viruses and Multidrug-Resistant Bacteria: A Cross-Sectional Study in Hungary," IJERPH, MDPI, vol. 19(23), pages 1-16, November.
    3. Mudassar Arsalan & Omar Mubin & Fady Alnajjar & Belal Alsinglawi, 2020. "COVID-19 Global Risk: Expectation vs. Reality," IJERPH, MDPI, vol. 17(15), pages 1-10, August.
    4. Ceddia, M.G. & Bardsley, N.O. & Goodwin, R. & Holloway, G.J. & Nocella, G. & Stasi, A., 2013. "A complex system perspective on the emergence and spread of infectious diseases: Integrating economic and ecological aspects," Ecological Economics, Elsevier, vol. 90(C), pages 124-131.
    5. John M Drake & Tobias S Brett & Shiyang Chen & Bogdan I Epureanu & Matthew J Ferrari & Éric Marty & Paige B Miller & Eamon B O’Dea & Suzanne M O’Regan & Andrew W Park & Pejman Rohani, 2019. "The statistics of epidemic transitions," PLOS Computational Biology, Public Library of Science, vol. 15(5), pages 1-14, May.
    6. Ongolo, Symphorien & Giessen, Lukas & Karsenty, Alain & Tchamba, Martin & Krott, Max, 2021. "Forestland policies and politics in Africa: Recent evidence and new challenges," Forest Policy and Economics, Elsevier, vol. 127(C).
    7. Paige, Sarah B. & Malavé, Carly & Mbabazi, Edith & Mayer, Jonathan & Goldberg, Tony L., 2015. "Uncovering zoonoses awareness in an emerging disease ‘hotspot’," Social Science & Medicine, Elsevier, vol. 129(C), pages 78-86.
    8. Jianhua Wang & Guan-Zhu Han, 2023. "Genome mining shows that retroviruses are pervasively invading vertebrate genomes," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    9. Livia Marchetti & Valentina Cattivelli & Claudia Cocozza & Fabio Salbitano & Marco Marchetti, 2020. "Beyond Sustainability in Food Systems: Perspectives from Agroecology and Social Innovation," Sustainability, MDPI, vol. 12(18), pages 1-24, September.
    10. Ivan Montiel & Junghoon Park & Bryan W. Husted & Andres Velez-Calle, 2022. "Tracing the connections between international business and communicable diseases," Journal of International Business Studies, Palgrave Macmillan;Academy of International Business, vol. 53(8), pages 1785-1804, October.
    11. Maxwell B Joseph & William E Stutz & Pieter T J Johnson, 2016. "Multilevel Models for the Distribution of Hosts and Symbionts," PLOS ONE, Public Library of Science, vol. 11(11), pages 1-15, November.
    12. Laure Bonnaud & Nicolas Fortané, 2017. "Serge Morand and Muriel Figuié (eds), 2016, Emergence de maladies infectieuses. Risques et enjeux de société (The emergence of infectious diseases. Societal risks and stakes)," Review of Agricultural, Food and Environmental Studies, Springer, vol. 98(3), pages 225-228, December.
    13. Chen, Xiaowei & Chong, Wing Fung & Feng, Runhuan & Zhang, Linfeng, 2021. "Pandemic risk management: Resources contingency planning and allocation," Insurance: Mathematics and Economics, Elsevier, vol. 101(PB), pages 359-383.
    14. Lin Zhang & Jason Rohr & Ruina Cui & Yusi Xin & Lixia Han & Xiaona Yang & Shimin Gu & Yuanbao Du & Jing Liang & Xuyu Wang & Zhengjun Wu & Qin Hao & Xuan Liu, 2022. "Biological invasions facilitate zoonotic disease emergences," Nature Communications, Nature, vol. 13(1), pages 1-11, December.
    15. Elisa Giannone & Nuno Paixao & Xinle Pang, 2021. "The Geography of Pandemic Containment," Staff Working Papers 21-26, Bank of Canada.
    16. Katarzyna Kubiak & Hanna Szymańska & Małgorzata Dmitryjuk & Ewa Dzika, 2022. "Abundance of Ixodes ricinus Ticks (Acari: Ixodidae) and the Diversity of Borrelia Species in Northeastern Poland," IJERPH, MDPI, vol. 19(12), pages 1-18, June.
    17. Michael George Roberts & Hiroshi Nishiura, 2011. "Early Estimation of the Reproduction Number in the Presence of Imported Cases: Pandemic Influenza H1N1-2009 in New Zealand," PLOS ONE, Public Library of Science, vol. 6(5), pages 1-9, May.
    18. Christian Sieben & Erdinc Sezgin & Christian Eggeling & Suliana Manley, 2020. "Influenza A viruses use multivalent sialic acid clusters for cell binding and receptor activation," PLOS Pathogens, Public Library of Science, vol. 16(7), pages 1-27, July.
    19. Anna C. Peterson & Himanshu Sharma & Arvind Kumar & Bruno M. Ghersi & Scott J. Emrich & Kurt J. Vandegrift & Amit Kapoor & Michael J. Blum, 2021. "Rodent Virus Diversity and Differentiation across Post-Katrina New Orleans," Sustainability, MDPI, vol. 13(14), pages 1-18, July.
    20. Blanco, Esther & Baier, Alexandra & Holzmeister, Felix & Jaber-Lopez, Tarek & Struwe, Natalie, 2022. "Substitution of social sustainability concerns under the Covid-19 pandemic," Ecological Economics, Elsevier, vol. 192(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1003254. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.