IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v21y2023i1p2-d1303380.html

Classification of Obesity among South African Female Adolescents: Comparative Analysis of Logistic Regression and Random Forest Algorithms

Author

Listed:
  • Ronel Sewpaul

    (Public Health, Societies and Belonging, Human Sciences Research Council, Merchant House, 2 Dock Rail Road, Cape Town 8001, South Africa)

  • Olushina Olawale Awe

    (Institute of Mathematics, Statistics and Scientific Computing (IMECC), University of Campinas, Campinas 13083-859, Brazil)

  • Dennis Makafui Dogbey

    (Medical Biotechnology and Immunotherapy Research Unit, Institute of Infectious Diseases and Molecular Medicine, Faculty of Health Sciences, University of Cape Town, Cape Town 7700, South Africa)

  • Machoene Derrick Sekgala

    (Non-Communicable Diseases, South African Medical Research Council, Cape Town 7505, South Africa)

  • Natisha Dukhi

    (Public Health, Societies and Belonging, Human Sciences Research Council, Merchant House, 2 Dock Rail Road, Cape Town 8001, South Africa)

Abstract

Background: This study evaluates the performance of logistic regression (LR) and random forest (RF) algorithms to model obesity among female adolescents in South Africa. Methods: Data was analysed on 375 females aged 15–17 from the South African National Health and Nutrition Examination Survey 2011/2012. The primary outcome was obesity, defined as body mass index (BMI) ≥ 30 kg/m 2 . A total of 31 explanatory variables were included, ranging from socio-economic, demographic, family history, dietary and health behaviour. RF and LR models were run using imbalanced data as well as after oversampling, undersampling, and hybrid sampling of the data. Results: Using the imbalanced data, the RF model performed better with higher precision, recall, F1 score, and balanced accuracy. Balanced accuracy was highest with the hybrid data (0.618 for RF and 0.668 for LR). Using the hybrid balanced data, the RF model performed better (F1-score = 0.940 for RF vs. 0.798 for LR). Conclusion: The model with the highest overall performance metrics was the RF model both before balancing the data and after applying hybrid balancing. Future work would benefit from using larger datasets on adolescent female obesity to assess the robustness of the models.

Suggested Citation

  • Ronel Sewpaul & Olushina Olawale Awe & Dennis Makafui Dogbey & Machoene Derrick Sekgala & Natisha Dukhi, 2023. "Classification of Obesity among South African Female Adolescents: Comparative Analysis of Logistic Regression and Random Forest Algorithms," IJERPH, MDPI, vol. 21(1), pages 1-15, December.
  • Handle: RePEc:gam:jijerp:v:21:y:2023:i:1:p:2-:d:1303380
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/21/1/2/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/21/1/2/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Shaoyan Zhang & Christos Tjortjis & Xiaojun Zeng & Hong Qiao & Iain Buchan & John Keane, 2009. "Comparing data mining methods with logistic regression in childhood obesity prediction," Information Systems Frontiers, Springer, vol. 11(4), pages 449-460, September.
    2. Deon Filmer & Lant Pritchett, 2001. "Estimating Wealth Effects Without Expenditure Data—Or Tears: An Application To Educational Enrollments In States Of India," Demography, Springer;Population Association of America (PAA), vol. 38(1), pages 115-132, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Diane Coffey & Ashwini Deshpande & Jeffrey Hammer & Dean Spears, 2019. "Local Social Inequality, Economic Inequality, and Disparities in Child Height in India," Demography, Springer;Population Association of America (PAA), vol. 56(4), pages 1427-1452, August.
    2. Alejandro Lopez-Feldman, 2013. "Climate change, agriculture, and poverty: A household level analysis for rural Mexico," Economics Bulletin, AccessEcon, vol. 33(2), pages 1126-1139.
    3. Cornelie Nienaber-Rousseau & Olusola F. Sotunde & Patricia O. Ukegbu & P. Hermanus Myburgh & Hattie H. Wright & Lize Havemann-Nel & Sarah J. Moss & Iolanthé M. Kruger & H. Salomé Kruger, 2017. "Socio-Demographic and Lifestyle Factors Predict 5-Year Changes in Adiposity among a Group of Black South African Adults," IJERPH, MDPI, vol. 14(9), pages 1-16, September.
    4. Kumar, Kaushalendra & Shukla, Ankita & Singh, Abhishek & Ram, Faujdar & Kowal, Paul, 2016. "Association between wealth and health among older adults in rural China and India," The Journal of the Economics of Ageing, Elsevier, vol. 7(C), pages 43-52.
    5. Winters, P. & Kafle, K. & Benfica, R., 2018. "IFAD RESEARCH SERIES 21 - Does relative deprivation induce migration? Evidence from sub-Saharan Africa," IFAD Research Series 280070, International Fund for Agricultural Development (IFAD).
    6. Langyintuo, Augustine S. & Mungoma, Catherine, 2008. "The effect of household wealth on the adoption of improved maize varieties in Zambia," Food Policy, Elsevier, vol. 33(6), pages 550-559, December.
    7. Ravi Prakash & Abhishek Singh, 2014. "Who Marries Whom? Changing Mate Selection Preferences in Urban India and Emerging Implications on Social Institutions," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 33(2), pages 205-227, April.
    8. Samikshya Poudel & Timothy Dobbins & Husna Razee & Blessing Akombi-Inyang, 2023. "Adolescent Pregnancy in South Asia: A Pooled Analysis of Demographic and Health Surveys," IJERPH, MDPI, vol. 20(12), pages 1-14, June.
    9. Tuccio, Michele & Wahba, Jackline & Hamdouch, Bachir, 2016. "International Migration: Driver of Political and Social Change?," IZA Discussion Papers 9794, IZA Network @ LISER.
    10. Pritchett, Lant & Sumarto, Sudarno & Suryahadi, Asep, 2001. "Targeted Programs in an Economic Crisis: Empirical Findings from Indonesia’s Experience," MPRA Paper 58727, University Library of Munich, Germany.
    11. Derek Headey & David Stifel & Liangzhi You & Zhe Guo, 2018. "Remoteness, urbanization, and child nutrition in sub‐Saharan Africa," Agricultural Economics, International Association of Agricultural Economists, vol. 49(6), pages 765-775, November.
    12. Barik, Debasis & Desai, Sonalde & Vanneman, Reeve, 2018. "Economic Status and Adult Mortality in India: Is the Relationship Sensitive to Choice of Indicators?," World Development, Elsevier, vol. 103(C), pages 176-187.
    13. repec:plo:pone00:0121767 is not listed on IDEAS
    14. Laetitia Duval & François-Charles Wolff, 2016. "Emigration intentions of Roma: evidence from Central and South-East Europe," Post-Communist Economies, Taylor & Francis Journals, vol. 28(1), pages 87-107, January.
    15. M Mahmud Khan & Sebastian Taylor & Chris Morry & Shyamkumar Sriram & Ibrahim Demir & Mizan Siddiqi, 2023. "How reliable is the asset score in measuring socioeconomic status? Comparing asset ownership reported by male and female heads of households," PLOS ONE, Public Library of Science, vol. 18(2), pages 1-15, February.
    16. Paschalis Arvanitidis & Athina Economou & Christos Kollias, 2016. "Terrorism’s effects on social capital in European countries," Public Choice, Springer, vol. 169(3), pages 231-250, December.
    17. Yi Rong Hoo & George Joseph & Rafael Rivera & Susanna Smets & Hanh Nguyen & Per Ljung & Sreymom Um & Georgia Davis & Jeff Albert, 2022. "Strategic complements: Poverty-targeted subsidy programs show additive benefits on household toilet purchases in rural Cambodia when coupled with sanitation marketing," PLOS ONE, Public Library of Science, vol. 17(6), pages 1-27, June.
    18. Christopher H. Herbst & Monique Vledder & Karen Campbell & Mirja Sjöblom & Agnes Soucat, 2011. "The Human Resources for Health Crisis in Zambia : An Outcome of Health Worker Entry, Exit, and Performance within the National Health Labor Market," World Bank Publications - Books, The World Bank Group, number 5938, April.
    19. Maria C. Lo Bue, 2024. "Drivers of changes in child nutritional conditions: A panel data‐based study on Indonesian households, 1997–2014," Review of Development Economics, Wiley Blackwell, vol. 28(2), pages 741-776, May.
    20. Janina Isabel Steinert & Lucie Dale Cluver & G. J. Melendez-Torres & Sebastian Vollmer, 2018. "One Size Fits All? The Validity of a Composite Poverty Index Across Urban and Rural Households in South Africa," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 136(1), pages 51-72, February.
    21. Pulkit Sharma & Achut Manandhar & Patrick Thomson & Jacob Katuva & Robert Hope & David A. Clifton, 2019. "Combining Multi-Modal Statistics for Welfare Prediction Using Deep Learning," Sustainability, MDPI, vol. 11(22), pages 1-15, November.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:21:y:2023:i:1:p:2-:d:1303380. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.