IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0167055.html
   My bibliography  Save this article

Into the Bowels of Depression: Unravelling Medical Symptoms Associated with Depression by Applying Machine-Learning Techniques to a Community Based Population Sample

Author

Listed:
  • Joanna F Dipnall
  • Julie A Pasco
  • Michael Berk
  • Lana J Williams
  • Seetal Dodd
  • Felice N Jacka
  • Denny Meyer

Abstract

Background: Depression is commonly comorbid with many other somatic diseases and symptoms. Identification of individuals in clusters with comorbid symptoms may reveal new pathophysiological mechanisms and treatment targets. The aim of this research was to combine machine-learning (ML) algorithms with traditional regression techniques by utilising self-reported medical symptoms to identify and describe clusters of individuals with increased rates of depression from a large cross-sectional community based population epidemiological study. Methods: A multi-staged methodology utilising ML and traditional statistical techniques was performed using the community based population National Health and Nutrition Examination Study (2009–2010) (N = 3,922). A Self-organised Mapping (SOM) ML algorithm, combined with hierarchical clustering, was performed to create participant clusters based on 68 medical symptoms. Binary logistic regression, controlling for sociodemographic confounders, was used to then identify the key clusters of participants with higher levels of depression (PHQ-9≥10, n = 377). Finally, a Multiple Additive Regression Tree boosted ML algorithm was run to identify the important medical symptoms for each key cluster within 17 broad categories: heart, liver, thyroid, respiratory, diabetes, arthritis, fractures and osteoporosis, skeletal pain, blood pressure, blood transfusion, cholesterol, vision, hearing, psoriasis, weight, bowels and urinary. Results: Five clusters of participants, based on medical symptoms, were identified to have significantly increased rates of depression compared to the cluster with the lowest rate: odds ratios ranged from 2.24 (95% CI 1.56, 3.24) to 6.33 (95% CI 1.67, 24.02). The ML boosted regression algorithm identified three key medical condition categories as being significantly more common in these clusters: bowel, pain and urinary symptoms. Bowel-related symptoms was found to dominate the relative importance of symptoms within the five key clusters. Conclusion: This methodology shows promise for the identification of conditions in general populations and supports the current focus on the potential importance of bowel symptoms and the gut in mental health research.

Suggested Citation

  • Joanna F Dipnall & Julie A Pasco & Michael Berk & Lana J Williams & Seetal Dodd & Felice N Jacka & Denny Meyer, 2016. "Into the Bowels of Depression: Unravelling Medical Symptoms Associated with Depression by Applying Machine-Learning Techniques to a Community Based Population Sample," PLOS ONE, Public Library of Science, vol. 11(12), pages 1-19, December.
  • Handle: RePEc:plo:pone00:0167055
    DOI: 10.1371/journal.pone.0167055
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0167055
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0167055&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0167055?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Joanna F Dipnall & Julie A Pasco & Michael Berk & Lana J Williams & Seetal Dodd & Felice N Jacka & Denny Meyer, 2016. "Fusing Data Mining, Machine Learning and Traditional Statistics to Detect Biomarkers Associated with Depression," PLOS ONE, Public Library of Science, vol. 11(2), pages 1-23, February.
    2. Lumley, Thomas, 2004. "Analysis of Complex Survey Samples," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 9(i08).
    3. Chris Poulin & Brian Shiner & Paul Thompson & Linas Vepstas & Yinong Young-Xu & Benjamin Goertzel & Bradley Watts & Laura Flashman & Thomas McAllister, 2014. "Predicting the Risk of Suicide by Analyzing the Text of Clinical Notes," PLOS ONE, Public Library of Science, vol. 9(1), pages 1-7, January.
    4. Matthias Schonlau, 2005. "Boosted regression (boosting): An introductory tutorial and a Stata plugin," Stata Journal, StataCorp LLC, vol. 5(3), pages 330-354, September.
    5. Lawrence A. David & Corinne F. Maurice & Rachel N. Carmody & David B. Gootenberg & Julie E. Button & Benjamin E. Wolfe & Alisha V. Ling & A. Sloan Devlin & Yug Varma & Michael A. Fischbach & Sudha B. , 2014. "Diet rapidly and reproducibly alters the human gut microbiome," Nature, Nature, vol. 505(7484), pages 559-563, January.
    6. Wehrens, Ron & Buydens, Lutgarde M. C., 2007. "Self- and Super-organizing Maps in R: The kohonen Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 21(i05).
    7. Kellie J. Archer & Stanley Lemeshow, 2006. "Goodness-of-fit test for a logistic regression model fitted using survey sample data," Stata Journal, StataCorp LLC, vol. 6(1), pages 97-105, March.
    8. Felice N Jacka & Nicolas Cherbuin & Kaarin J Anstey & Peter Butterworth, 2014. "Dietary Patterns and Depressive Symptoms over Time: Examining the Relationships with Socioeconomic Position, Health Behaviours and Cardiovascular Risk," PLOS ONE, Public Library of Science, vol. 9(1), pages 1-9, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Irene Mosca & Alan Barrett, 2016. "The impact of adult child emigration on the mental health of older parents," Journal of Population Economics, Springer;European Society for Population Economics, vol. 29(3), pages 687-719, July.
    2. Maciej Berk{e}sewicz & Herman Cherniaiev & Robert Pater, 2021. "Estimating the number of entities with vacancies using administrative and online data," Papers 2106.03263, arXiv.org.
    3. Bo Qiu & JiaXu Liang & Cong Li, 2023. "Effects of fecal microbiota transplantation in metabolic syndrome: A meta-analysis of randomized controlled trials," PLOS ONE, Public Library of Science, vol. 18(7), pages 1-16, July.
    4. Janvier Gasana & Boubakari Ibrahimou & Ahmed N. Albatineh & Mustafa Al-Zoughool & Dina Zein, 2021. "Exposures in the Indoor Environment and Prevalence of Allergic Conditions in the United States of America," IJERPH, MDPI, vol. 18(9), pages 1-13, May.
    5. Petri K M Purola & Joonas Taipale & Saku Väätäinen & Mika Harju & Seppo V P Koskinen & Hannu M T Uusitalo, 2023. "Price tag of glaucoma care is minor compared with the total direct and indirect costs of glaucoma: Results from nationwide survey and register data," PLOS ONE, Public Library of Science, vol. 18(12), pages 1-18, December.
    6. Michał Brzozowski & Grzegorz Tchorek, 2017. "Exchange Rate Risk as an Obstacle to Export Activity," Gospodarka Narodowa. The Polish Journal of Economics, Warsaw School of Economics, issue 3, pages 115-141.
    7. Jonathan Wakefield & Taylor Okonek & Jon Pedersen, 2020. "Small Area Estimation for Disease Prevalence Mapping," International Statistical Review, International Statistical Institute, vol. 88(2), pages 398-418, August.
    8. Fenton, Alex, 2013. "Small-area measures of income poverty," LSE Research Online Documents on Economics 58053, London School of Economics and Political Science, LSE Library.
    9. repec:cep:sticas:/173 is not listed on IDEAS
    10. repec:plo:pone00:0077941 is not listed on IDEAS
    11. Preetam Debasish Saha Roy & Prabhat Kumar Tiwari, 2019. "Knowledge discovery and predictive accuracy comparison of different classification algorithms for mould level fluctuation phenomenon in thin slab caster," Journal of Intelligent Manufacturing, Springer, vol. 30(1), pages 241-254, January.
    12. Camelia Herman & Colleen M. Leonard & Perpetua Uhomoibhi & Mark Maire & Delynn Moss & Uwem Inyang & Ado Abubakar & Abiodun Ogunniyi & Nwando Mba & Stacie M. Greby & McPaul I. Okoye & Nnaemeka C. Iriem, 2023. "Non-falciparum malaria infection and IgG seroprevalence among children under 15 years in Nigeria, 2018," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    13. Elijah O. Onsomu & DaKysha Moore & Benta A. Abuya & Peggy Valentine & Vanessa Duren-Winfield, 2013. "Importance of the Media in Scaling-Up HIV Testing in Kenya," SAGE Open, , vol. 3(3), pages 21582440134, July.
    14. Vinas-Forcade, Jennifer & Seijas, María Noé, 2021. "To teach or not to teach: Negative selection into the teaching profession in Uruguay," International Journal of Educational Development, Elsevier, vol. 84(C).
    15. Zhongqi Fan & Amy M. Yang & Marcus Lehr & Ana B. Ronan & Ryan B. Simpson & Kimberly H. Nguyen & Elena N. Naumova & Naglaa H. El-Abbadi, 2024. "Food Insecurity across Age Groups in the United States during the COVID-19 Pandemic," IJERPH, MDPI, vol. 21(8), pages 1-19, August.
    16. Olusola Sanwo & Ihoghosa Iyamu & Augustine Idemudia & Titilope Badru & Sylvia Ekponimo & Dorothy Oqua & Olusesan A Makinde & Gambo G Aliyu & Abimbola Kola-Jebutu & Jemeh Egwuagu-Pius & Chika Obiora-Ok, 2023. "Willingness to pay for antiretroviral therapy, viral load, and premium services; A contingent valuation survey of people living with HIV in southern Nigeria," PLOS ONE, Public Library of Science, vol. 18(11), pages 1-16, November.
    17. Matthew R. Williams & Terrance D. Savitsky, 2021. "Uncertainty Estimation for Pseudo‐Bayesian Inference Under Complex Sampling," International Statistical Review, International Statistical Institute, vol. 89(1), pages 72-107, April.
    18. Mehmet Güney Celbiş & Pui-Hang Wong & Karima Kourtit & Peter Nijkamp, 2021. "Innovativeness, Work Flexibility, and Place Characteristics: A Spatial Econometric and Machine Learning Approach," Sustainability, MDPI, vol. 13(23), pages 1-29, December.
    19. Wang, Jianqiang C., 2012. "Sample distribution function based goodness-of-fit test for complex surveys," Computational Statistics & Data Analysis, Elsevier, vol. 56(3), pages 664-679.
    20. Alejandro Aybar-Flores & Alvaro Talavera & Elizabeth Espinoza-Portilla, 2023. "Predicting the HIV/AIDS Knowledge among the Adolescent and Young Adult Population in Peru: Application of Quasi-Binomial Logistic Regression and Machine Learning Algorithms," IJERPH, MDPI, vol. 20(7), pages 1-29, March.
    21. Joseph R Starnes & Chiara Di Gravio & Rebecca Irlmeier & Ryan Moore & Vincent Okoth & Ash Rogers & Daniele J Ressler & Troy D Moon, 2021. "Characterizing multidimensional poverty in Migori County, Kenya and its association with depression," PLOS ONE, Public Library of Science, vol. 16(11), pages 1-10, November.
    22. Christian A. Maino Vieytes & Ruoqing Zhu & Francesca Gany & Amirah Burton-Obanla & Anna E. Arthur, 2022. "Empirical Dietary Patterns Associated with Food Insecurity in U.S. Cancer Survivors: NHANES 1999–2018," IJERPH, MDPI, vol. 19(21), pages 1-21, October.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0167055. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.