IDEAS home Printed from https://ideas.repec.org/a/spr/endesu/v25y2023i11d10.1007_s10668-022-02646-3.html
   My bibliography  Save this article

Understanding the COVID-19 pandemic prevalence in Africa through optimal feature selection and clustering: evidence from a statistical perspective

Author

Listed:
  • Mohamed Lamine Sidibé

    (Institut International d’Ingénierie de l’Eau Et de l’Environnement (2iE))

  • Roland Yonaba

    (Institut International d’Ingénierie de l’Eau Et de l’Environnement (2iE))

  • Fowé Tazen

    (Institut International d’Ingénierie de l’Eau Et de l’Environnement (2iE))

  • Héla Karoui

    (Institut International d’Ingénierie de l’Eau Et de l’Environnement (2iE))

  • Ousmane Koanda

    (Institut International d’Ingénierie de l’Eau Et de l’Environnement (2iE))

  • Babacar Lèye

    (Institut International d’Ingénierie de l’Eau Et de l’Environnement (2iE))

  • Harinaivo Anderson Andrianisa

    (Institut International d’Ingénierie de l’Eau Et de l’Environnement (2iE))

  • Harouna Karambiri

    (Institut International d’Ingénierie de l’Eau Et de l’Environnement (2iE))

Abstract

The COVID-19 pandemic, which outbroke in Wuhan (China) in December 2019, severely hit almost all sectors of activity in the world as a consequence of the restrictive measures imposed. Two years later, Africa still emerges as the least affected continent by the pandemic. This study analyzed COVID-19 prevalence across African countries through country-level variables prior to clustering. Using Spearman-rank correlation, multicollinearity analysis and univariate filtering, 9 country-level variables were identified from an initial set of 34 variables. These variables relate to socioeconomic status, population structure, healthcare system and environment and the climatic setting. A clustering of the 54 African countries is further carried out through the use of agglomerative hierarchical clustering (AHC) method, which generated 3 distinctive clusters. Cluster 1 (11 countries) is the most affected by COVID-19 (median of 63,508.6 confirmed cases and 946.5 deaths per million) and is composed of countries with the highest socioeconomic status. Cluster 2 (27 countries) is the least affected (median of 4473.7 confirmed cases and 81.2 deaths per million), and mainly features countries with the least socioeconomic features and international exposure. Cluster 3 (16 countries) is intermediate in terms of COVID-19 prevalence (median of 2569.3 confirmed cases and 35.7 deaths per million) and features countries the least urbanized and geographically close to the equator, with intermediate international exposure and socioeconomic features. These findings shed light on the main features of COVID-19 prevalence in Africa and might help refine effectively coping management strategies of the ongoing pandemic.

Suggested Citation

  • Mohamed Lamine Sidibé & Roland Yonaba & Fowé Tazen & Héla Karoui & Ousmane Koanda & Babacar Lèye & Harinaivo Anderson Andrianisa & Harouna Karambiri, 2023. "Understanding the COVID-19 pandemic prevalence in Africa through optimal feature selection and clustering: evidence from a statistical perspective," Environment, Development and Sustainability: A Multidisciplinary Approach to the Theory and Practice of Sustainable Development, Springer, vol. 25(11), pages 13565-13593, November.
  • Handle: RePEc:spr:endesu:v:25:y:2023:i:11:d:10.1007_s10668-022-02646-3
    DOI: 10.1007/s10668-022-02646-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10668-022-02646-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10668-022-02646-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Benjamin F. Zaitchik & Neville Sweijd & Joy Shumake-Guillemot & Andy Morse & Chris Gordon & Aileen Marty & Juli Trtanj & Juerg Luterbacher & Joel Botai & Swadhin Behera & Yonglong Lu & Jane Olwoch & K, 2020. "A framework for research linking weather, climate and COVID-19," Nature Communications, Nature, vol. 11(1), pages 1-3, December.
    2. Burim Ramosaj & Markus Pauly, 2019. "Predicting missing values: a comparative study on non-parametric approaches for imputation," Computational Statistics, Springer, vol. 34(4), pages 1741-1764, December.
    3. Glenn Milligan & Martha Cooper, 1985. "An examination of procedures for determining the number of clusters in a data set," Psychometrika, Springer;The Psychometric Society, vol. 50(2), pages 159-179, June.
    4. Charrad, Malika & Ghazzali, Nadia & Boiteau, Véronique & Niknafs, Azam, 2014. "NbClust: An R Package for Determining the Relevant Number of Clusters in a Data Set," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 61(i06).
    5. Anthony Medford & Sergi Trias-Llimós, 2020. "Population age structure only partially explains the large number of COVID-19 deaths at the oldest ages," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 43(19), pages 533-544.
    6. Mizanur Rahman & Mahmuda Islam & Mehedi Hasan Shimanto & Jannatul Ferdous & Abdullah Al-Nur Shanto Rahman & Pabitra Singha Sagor & Tahasina Chowdhury, 2021. "A global analysis on the effect of temperature, socio-economic and environmental factors on the spread and mortality rate of the COVID-19 pandemic," Environment, Development and Sustainability: A Multidisciplinary Approach to the Theory and Practice of Sustainable Development, Springer, vol. 23(6), pages 9352-9366, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Cyril Atkinson-Clement & Eléonore Pigalle, 2021. "What can we learn from Covid-19 pandemic’s impact on human behaviour? The case of France’s lockdown," Palgrave Communications, Palgrave Macmillan, vol. 8(1), pages 1-12, December.
    2. Tae Kyung Yoon & SoEun Ahn, 2020. "Clustering Koreans’ Environmental Awareness and Attitudes into Seven Groups: Environmentalists, Dissatisfieds, Inactivators, Bystanders, Honeybees, Optimists, and Moderates," Sustainability, MDPI, vol. 12(20), pages 1-18, October.
    3. Bolívar, Fernando & Duran, Miguel A. & Lozano-Vivas, Ana, 2023. "Business model contributions to bank profit performance: A machine learning approach," Research in International Business and Finance, Elsevier, vol. 64(C).
    4. Weinand, J.M. & McKenna, R. & Fichtner, W., 2019. "Developing a municipality typology for modelling decentralised energy systems," Utilities Policy, Elsevier, vol. 57(C), pages 75-96.
    5. Lyu, Guowei & Bertolini, Luca & Pfeffer, Karin, 2016. "Developing a TOD typology for Beijing metro station areas," Journal of Transport Geography, Elsevier, vol. 55(C), pages 40-50.
    6. Teichgraeber, Holger & Brandt, Adam R., 2022. "Time-series aggregation for the optimization of energy systems: Goals, challenges, approaches, and opportunities," Renewable and Sustainable Energy Reviews, Elsevier, vol. 157(C).
    7. Andrea De Mauro & Marco Greco & Michele Grimaldi, 2019. "Understanding Big Data Through a Systematic Literature Review: The ITMI Model," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 18(04), pages 1433-1461, July.
    8. Maki Ikegami & Bart Neuts, 2020. "Strategic Options for Campus Sustainability: Cluster Analysis on Higher Education Institutions in Japan," Sustainability, MDPI, vol. 12(6), pages 1-13, March.
    9. Igor Kravchuk & Viktoriia Stoika, 2021. "Business Μodels of Βanks for the Financial Markets in the EU," European Research Studies Journal, European Research Studies Journal, vol. 0(2B), pages 371-382.
    10. Liu, Pei-chen Barry & Hansen, Mark & Mukherjee, Avijit, 2008. "Scenario-based air traffic flow management: From theory to practice," Transportation Research Part B: Methodological, Elsevier, vol. 42(7-8), pages 685-702, August.
    11. Hélène Syed Zwick & S. Ali Shah Syed, 2017. "The polarization impact of the crisis on the Eurozone labour markets: a hierarchical cluster analysis," Applied Economics Letters, Taylor & Francis Journals, vol. 24(7), pages 472-476, April.
    12. Goethner, Maximilian & Hornuf, Lars & Regner, Tobias, 2021. "Protecting investors in equity crowdfunding: An empirical analysis of the small investor protection act," Technological Forecasting and Social Change, Elsevier, vol. 162(C).
    13. Bolívar, Fernando & Duran, Miguel A. & Lozano-Vivas, Ana, 2023. "Bank business models, size, and profitability," Finance Research Letters, Elsevier, vol. 53(C).
    14. Pennings, J.S.J. & van Kranenburg, H.L. & Hagedoorn, J., 2005. "Past, present and future of the telecommunications industry," Research Memorandum 016, Maastricht University, Maastricht Research School of Economics of Technology and Organization (METEOR).
    15. Li-Xuan Qin & Steven G. Self, 2006. "The Clustering of Regression Models Method with Applications in Gene Expression Data," Biometrics, The International Biometric Society, vol. 62(2), pages 526-533, June.
    16. Caroline Méjean & Pauline Macouillard & Sandrine Péneau & Camille Lassale & Serge Hercberg & Katia Castetbon, 2014. "Association of Perception of Front-of-Pack Labels with Dietary, Lifestyle and Health Characteristics," PLOS ONE, Public Library of Science, vol. 9(3), pages 1-11, March.
    17. Reder, Maik & Yürüşen, Nurseda Y. & Melero, Julio J., 2018. "Data-driven learning framework for associating weather conditions and wind turbine failures," Reliability Engineering and System Safety, Elsevier, vol. 169(C), pages 554-569.
    18. Li, Pai-Ling & Chiou, Jeng-Min, 2011. "Identifying cluster number for subspace projected functional data clustering," Computational Statistics & Data Analysis, Elsevier, vol. 55(6), pages 2090-2103, June.
    19. Marcin Gąsior, 2021. "Environmental Attitudes and Willingness to Purchase Online—Classification Approach," Sustainability, MDPI, vol. 13(15), pages 1-17, August.
    20. Marin, Giovanni & Vona, Francesco, 2019. "Climate policies and skill-biased employment dynamics: Evidence from EU countries," Journal of Environmental Economics and Management, Elsevier, vol. 98(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:endesu:v:25:y:2023:i:11:d:10.1007_s10668-022-02646-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.