IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0096984.html
   My bibliography  Save this article

Understanding the Underlying Mechanism of HA-Subtyping in the Level of Physic-Chemical Characteristics of Protein

Author

Listed:
  • Mansour Ebrahimi
  • Parisa Aghagolzadeh
  • Narges Shamabadi
  • Ahmad Tahmasebi
  • Mohammed Alsharifi
  • David L Adelson
  • Farhid Hemmatzadeh
  • Esmaeil Ebrahimie

Abstract

The evolution of the influenza A virus to increase its host range is a major concern worldwide. Molecular mechanisms of increasing host range are largely unknown. Influenza surface proteins play determining roles in reorganization of host-sialic acid receptors and host range. In an attempt to uncover the physic-chemical attributes which govern HA subtyping, we performed a large scale functional analysis of over 7000 sequences of 16 different HA subtypes. Large number (896) of physic-chemical protein characteristics were calculated for each HA sequence. Then, 10 different attribute weighting algorithms were used to find the key characteristics distinguishing HA subtypes. Furthermore, to discover machine leaning models which can predict HA subtypes, various Decision Tree, Support Vector Machine, Naïve Bayes, and Neural Network models were trained on calculated protein characteristics dataset as well as 10 trimmed datasets generated by attribute weighting algorithms. The prediction accuracies of the machine learning methods were evaluated by 10-fold cross validation. The results highlighted the frequency of Gln (selected by 80% of attribute weighting algorithms), percentage/frequency of Tyr, percentage of Cys, and frequencies of Try and Glu (selected by 70% of attribute weighting algorithms) as the key features that are associated with HA subtyping. Random Forest tree induction algorithm and RBF kernel function of SVM (scaled by grid search) showed high accuracy of 98% in clustering and predicting HA subtypes based on protein attributes. Decision tree models were successful in monitoring the short mutation/reassortment paths by which influenza virus can gain the key protein structure of another HA subtype and increase its host range in a short period of time with less energy consumption. Extracting and mining a large number of amino acid attributes of HA subtypes of influenza A virus through supervised algorithms represent a new avenue for understanding and predicting possible future structure of influenza pandemics.

Suggested Citation

  • Mansour Ebrahimi & Parisa Aghagolzadeh & Narges Shamabadi & Ahmad Tahmasebi & Mohammed Alsharifi & David L Adelson & Farhid Hemmatzadeh & Esmaeil Ebrahimie, 2014. "Understanding the Underlying Mechanism of HA-Subtyping in the Level of Physic-Chemical Characteristics of Protein," PLOS ONE, Public Library of Science, vol. 9(5), pages 1-14, May.
  • Handle: RePEc:plo:pone00:0096984
    DOI: 10.1371/journal.pone.0096984
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0096984
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0096984&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0096984?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Andrea Freyer Dugas & Mehdi Jalalpour & Yulia Gel & Scott Levin & Fred Torcaso & Takeru Igusa & Richard E Rothman, 2013. "Influenza Forecasting with Google Flu Trends," PLOS ONE, Public Library of Science, vol. 8(2), pages 1-7, February.
    2. Xia-Yu Xia & Meng Ge & Zhi-Xin Wang & Xian-Ming Pan, 2012. "Accurate Prediction of Protein Structural Class," PLOS ONE, Public Library of Science, vol. 7(6), pages 1-8, June.
    3. Tokiko Watanabe & Maki Kiso & Satoshi Fukuyama & Noriko Nakajima & Masaki Imai & Shinya Yamada & Shin Murakami & Seiya Yamayoshi & Kiyoko Iwatsuki-Horimoto & Yoshihiro Sakoda & Emi Takashita & Ryan Mc, 2013. "Characterization of H7N9 influenza A viruses isolated from humans," Nature, Nature, vol. 501(7468), pages 551-555, September.
    4. Mansour Ebrahimi & Amir Lakizadeh & Parisa Agha-Golzadeh & Esmaeil Ebrahimie & Mahdi Ebrahimi, 2011. "Prediction of Thermostability from Amino Acid Attributes by Combination of Clustering with Attribute Weighting: A New Vista in Engineering Enzymes," PLOS ONE, Public Library of Science, vol. 6(8), pages 1-11, August.
    5. Ron A. M. Fouchier & Yoshihiro Kawaoka, 2013. "Gain-of-function experiments on H7N9," Nature, Nature, vol. 500(7461), pages 150-151, August.
    6. Jeremy Ginsberg & Matthew H. Mohebbi & Rajan S. Patel & Lynnette Brammer & Mark S. Smolinski & Larry Brilliant, 2009. "Detecting influenza epidemics using search engine query data," Nature, Nature, vol. 457(7232), pages 1012-1014, February.
    7. Lembke B., 1918. "√ a. p," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 111(1), pages 709-712, February.
    8. Masaki Imai & Tokiko Watanabe & Masato Hatta & Subash C. Das & Makoto Ozawa & Kyoko Shinya & Gongxun Zhong & Anthony Hanson & Hiroaki Katsura & Shinji Watanabe & Chengjun Li & Eiryo Kawakami & Shinya , 2012. "Experimental adaptation of an influenza H5 HA confers respiratory droplet transmission to a reassortant H5 HA/H1N1 virus in ferrets," Nature, Nature, vol. 486(7403), pages 420-428, June.
    9. David A. Steinhauer, 2013. "Pathways to human adaptation," Nature, Nature, vol. 499(7459), pages 412-413, July.
    10. Faezeh Hosseinzadeh & Mansour Ebrahimi & Bahram Goliaei & Narges Shamabadi, 2012. "Classification of Lung Cancer Tumors Based on Structural and Physicochemical Properties of Proteins by Bioinformatics Models," PLOS ONE, Public Library of Science, vol. 7(7), pages 1-8, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Fantazzini, Dean, 2020. "Short-term forecasting of the COVID-19 pandemic using Google Trends data: Evidence from 158 countries," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 59, pages 33-54.
    2. Zeynep Ertem & Dorrie Raymond & Lauren Ancel Meyers, 2018. "Optimal multi-source forecasting of seasonal influenza," PLOS Computational Biology, Public Library of Science, vol. 14(9), pages 1-16, September.
    3. Jose L Herrera & Ravi Srinivasan & John S Brownstein & Alison P Galvani & Lauren Ancel Meyers, 2016. "Disease Surveillance on Complex Social Networks," PLOS Computational Biology, Public Library of Science, vol. 12(7), pages 1-16, July.
    4. Ibrahim Musa & Hyun Woo Park & Lkhagvadorj Munkhdalai & Keun Ho Ryu, 2018. "Global Research on Syndromic Surveillance from 1993 to 2017: Bibliometric Analysis and Visualization," Sustainability, MDPI, vol. 10(10), pages 1-20, September.
    5. Daniel Alejandro Gónzalez-Bandala & Juan Carlos Cuevas-Tello & Daniel E. Noyola & Andreu Comas-García & Christian A García-Sepúlveda, 2020. "Computational Forecasting Methodology for Acute Respiratory Infectious Disease Dynamics," IJERPH, MDPI, vol. 17(12), pages 1-20, June.
    6. Halousková, Martina & Stašek, Daniel & Horváth, Matúš, 2022. "The role of investor attention in global asset price variation during the invasion of Ukraine," Finance Research Letters, Elsevier, vol. 50(C).
    7. Livio Fenga, 2020. "Filtering and prediction of noisy and unstable signals: The case of Google Trends data," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(2), pages 281-295, March.
    8. Logan C Brooks & David C Farrow & Sangwon Hyun & Ryan J Tibshirani & Roni Rosenfeld, 2015. "Flexible Modeling of Epidemics with an Empirical Bayes Framework," PLOS Computational Biology, Public Library of Science, vol. 11(8), pages 1-18, August.
    9. Jichang Dong & Wei Dai & Ying Liu & Lean Yu & Jie Wang, 2019. "Forecasting Chinese Stock Market Prices using Baidu Search Index with a Learning-Based Data Collection Method," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 18(05), pages 1605-1629, September.
    10. Logan C Brooks & David C Farrow & Sangwon Hyun & Ryan J Tibshirani & Roni Rosenfeld, 2018. "Nonmechanistic forecasts of seasonal influenza with iterative one-week-ahead distributions," PLOS Computational Biology, Public Library of Science, vol. 14(6), pages 1-29, June.
    11. Baek, Changryong & Davis, Richard A. & Pipiras, Vladas, 2017. "Sparse seasonal and periodic vector autoregressive modeling," Computational Statistics & Data Analysis, Elsevier, vol. 106(C), pages 103-126.
    12. Sergei Rogosin & Maryna Dubatovskaya, 2017. "Letnikov vs. Marchaud: A Survey on Two Prominent Constructions of Fractional Derivatives," Mathematics, MDPI, vol. 6(1), pages 1-15, December.
    13. , Aisdl, 2019. "What Citizenship for What Transition?: Contradictions, Ambivalence, and Promises in Post-Socialist Citizenship Education in Vietnam," OSF Preprints jyqp5, Center for Open Science.
    14. Clarke, Matthew, 2011. "Innovative Delivery Mechanisms for Increased Aid Budgets," WIDER Working Paper Series 073, World Institute for Development Economic Research (UNU-WIDER).
    15. Patrick E. Shea, 2016. "Borrowing Trouble: Sovereign Credit, Military Regimes, and Conflict," International Interactions, Taylor & Francis Journals, vol. 42(3), pages 401-428, May.
    16. Valerio Antonelli & Raffaele D'Alessio & Roberto Rossi, 2014. "Budgetary practices in the Ministry of War and the Ministry of Munitions in Italy, 1915-1918," Accounting History Review, Taylor & Francis Journals, vol. 24(2-3), pages 139-160, November.
    17. Karlsson, Martin & Nilsson, Therese & Pichler, Stefan, 2012. "What Doesn't Kill You Makes You Stronger? The Impact of the 1918 Spanish Flu Epidemic on Economic Performance in Sweden," Working Paper Series 911, Research Institute of Industrial Economics.
    18. Roger R. Betancourt, 1969. "R. A. EASTERLIN. Population, Labor Force, and Long Swings in Economic Growth: The American Experience. Pp. xx, 298. New York: National Bureau of Economic Research (Distributed by Columbia University P," The ANNALS of the American Academy of Political and Social Science, , vol. 384(1), pages 183-192, July.
    19. David H Chae & Sean Clouston & Mark L Hatzenbuehler & Michael R Kramer & Hannah L F Cooper & Sacoby M Wilson & Seth I Stephens-Davidowitz & Robert S Gold & Bruce G Link, 2015. "Association between an Internet-Based Measure of Area Racism and Black Mortality," PLOS ONE, Public Library of Science, vol. 10(4), pages 1-12, April.
    20. Ilan Noy & Toshihiro Okubo & Eric Strobl, 2023. "The Japanese textile sector and the influenza pandemic of 1918–1920," Journal of Regional Science, Wiley Blackwell, vol. 63(5), pages 1192-1227, November.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0096984. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.