IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0266516.html
   My bibliography  Save this article

Machine learning for passive mental health symptom prediction: Generalization across different longitudinal mobile sensing studies

Author

Listed:
  • Daniel A Adler
  • Fei Wang
  • David C Mohr
  • Tanzeem Choudhury

Abstract

Mobile sensing data processed using machine learning models can passively and remotely assess mental health symptoms from the context of patients’ lives. Prior work has trained models using data from single longitudinal studies, collected from demographically homogeneous populations, over short time periods, using a single data collection platform or mobile application. The generalizability of model performance across studies has not been assessed. This study presents a first analysis to understand if models trained using combined longitudinal study data to predict mental health symptoms generalize across current publicly available data. We combined data from the CrossCheck (individuals living with schizophrenia) and StudentLife (university students) studies. In addition to assessing generalizability, we explored if personalizing models to align mobile sensing data, and oversampling less-represented severe symptoms, improved model performance. Leave-one-subject-out cross-validation (LOSO-CV) results were reported. Two symptoms (sleep quality and stress) had similar question-response structures across studies and were used as outcomes to explore cross-dataset prediction. Models trained with combined data were more likely to be predictive (significant improvement over predicting training data mean) than models trained with single-study data. Expected model performance improved if the distance between training and validation feature distributions decreased using combined versus single-study data. Personalization aligned each LOSO-CV participant with training data, but only improved predicting CrossCheck stress. Oversampling significantly improved severe symptom classification sensitivity and positive predictive value, but decreased model specificity. Taken together, these results show that machine learning models trained on combined longitudinal study data may generalize across heterogeneous datasets. We encourage researchers to disseminate collected de-identified mobile sensing and mental health symptom data, and further standardize data types collected across studies to enable better assessment of model generalizability.

Suggested Citation

  • Daniel A Adler & Fei Wang & David C Mohr & Tanzeem Choudhury, 2022. "Machine learning for passive mental health symptom prediction: Generalization across different longitudinal mobile sensing studies," PLOS ONE, Public Library of Science, vol. 17(4), pages 1-20, April.
  • Handle: RePEc:plo:pone00:0266516
    DOI: 10.1371/journal.pone.0266516
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0266516
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0266516&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0266516?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Bernard Rosner & Robert J. Glynn & Mei-Ling T. Lee, 2006. "The Wilcoxon Signed Rank Test for Paired Comparisons of Clustered Data," Biometrics, The International Biometric Society, vol. 62(1), pages 185-192, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Somnath Datta & Glen A. Satten, 2008. "A Signed-Rank Test for Clustered Data," Biometrics, The International Biometric Society, vol. 64(2), pages 501-507, June.
    2. Syed Emad Azhar Ali & Fong-Woon Lai & Ahmad Ali Jan & Haseeb ur Rahman & Syed Quaid Ali Shah & Salaheldin Hamad, 2024. "Does intellectual capital curb the long-term effect of information security breaches on firms’ market value?," Quality & Quantity: International Journal of Methodology, Springer, vol. 58(4), pages 3673-3702, August.
    3. Anna Urbanek & Anna Losa & Monika Wieczorek-Kosmala & Karel Hlaváček & Aleš Lokaj, 2023. "Did the Quality of Digital Communication Skills in Education Improve after the Pandemic? Evidence from HEIs," Sustainability, MDPI, vol. 15(15), pages 1-22, August.
    4. Bagkavos, Dimitrios & Patil, Prakash N., 2021. "Improving the Wilcoxon signed rank test by a kernel smooth probability integral transformation," Statistics & Probability Letters, Elsevier, vol. 171(C).
    5. Slepicka, Jessie, 2022. "Reassessing the missing link in general deterrence research: A behavioral economic approach," Journal of Criminal Justice, Elsevier, vol. 82(C).
    6. del Campo, Cristina & Urquía-Grande, Elena & Pascual-Ezama, David, 2023. "Internationalizing the business school: A comparative analysis of English-medium and Spanish-medium instruction impact on student performance," Evaluation and Program Planning, Elsevier, vol. 98(C).
    7. Sandipan Dutta, 2022. "Robust Testing of Paired Outcomes Incorporating Covariate Effects in Clustered Data with Informative Cluster Size," Stats, MDPI, vol. 5(4), pages 1-13, December.
    8. Jasleen Kaur & Khushdeep Dharni, 2022. "Assessing efficacy of association rules for predicting global stock indices," DECISION: Official Journal of the Indian Institute of Management Calcutta, Springer;Indian Institute of Management Calcutta, vol. 49(3), pages 329-339, September.
    9. Haataja, Riina & Larocque, Denis & Nevalainen, Jaakko & Oja, Hannu, 2009. "A weighted multivariate signed-rank test for cluster-correlated data," Journal of Multivariate Analysis, Elsevier, vol. 100(6), pages 1107-1119, July.
    10. Brigitte Fong Yeong Woo & Wilson Wai San Tam & Taiju Rangpa & Wei Fong Liau & Jennifer Nathania & Toon Wei Lim, 2022. "A Nurse-Led Integrated Chronic Care E-Enhanced Atrial Fibrillation (NICE-AF) Clinic in the Community: A Preliminary Evaluation," IJERPH, MDPI, vol. 19(8), pages 1-15, April.
    11. Thuy-Ninh Dao & Po-Han Chen & The-Quan Nguyen, 2020. "Enhancement of Mutual Recognition and Mobility of BIM Experts in ASEAN Countries," Sustainability, MDPI, vol. 12(18), pages 1-20, September.
    12. Hamza Zubair & Ampol Karoonsoontawong & Kunnawee Kanitpong, 2022. "Effects of COVID-19 on Travel Behavior and Mode Choice: A Case Study for the Bangkok Metropolitan Area," Sustainability, MDPI, vol. 14(15), pages 1-26, July.
    13. Saif Uddin & Montaha Behbehani & Nazima Habibi & Scott W. Fowler & Hanan A. Al-Sarawi & Carlos Alonso-Hernandez, 2023. "Microplastics Residence Time in Marine Copepods: An Experimental Study," Sustainability, MDPI, vol. 15(20), pages 1-12, October.
    14. Peng Zeng & Ming Wei & Xiaoyang Liu, 2020. "Investigating the Spatiotemporal Dynamics of Urban Vitality Using Bicycle-Sharing Data," Sustainability, MDPI, vol. 12(5), pages 1-14, February.
    15. Tarek Numair & Daniel Toshio Harrell & Nguyen Tien Huy & Futoshi Nishimoto & Yvonne Muthiani & Samson Muuo Nzou & Angkhana Lasaphonh & Khomsonerasinh Palama & Tiengkham Pongvongsa & Kazuhiko Moji & Ke, 2021. "Barriers to the Digitization of Health Information: A Qualitative and Quantitative Study in Kenya and Lao PDR Using a Cloud-Based Maternal and Child Registration System," IJERPH, MDPI, vol. 18(12), pages 1-15, June.
    16. Dohyun Kim & Sungmin You & Soonwon So & Jongshill Lee & Sunhyun Yook & Dong Pyo Jang & In Young Kim & Eunkyoung Park & Kyeongwon Cho & Won Chul Cha & Dong Wook Shin & Baek Hwan Cho & Hoon-Ki Park, 2018. "A data-driven artificial intelligence model for remote triage in the prehospital environment," PLOS ONE, Public Library of Science, vol. 13(10), pages 1-14, October.
    17. Mohd Sakib & Tamanna Siddiqui & Suhel Mustajab & Reemiah Muneer Alotaibi & Nouf Mohammad Alshareef & Mohammad Zunnun Khan, 2025. "An ensemble deep learning framework for energy demand forecasting using genetic algorithm-based feature selection," PLOS ONE, Public Library of Science, vol. 20(1), pages 1-28, January.
    18. Lan Wang & Runze Li, 2009. "Weighted Wilcoxon-Type Smoothly Clipped Absolute Deviation Method," Biometrics, The International Biometric Society, vol. 65(2), pages 564-571, June.
    19. Ana Lazcano & Pedro Javier Herrera & Manuel Monge, 2023. "A Combined Model Based on Recurrent Neural Networks and Graph Convolutional Networks for Financial Time Series Forecasting," Mathematics, MDPI, vol. 11(1), pages 1-21, January.
    20. Maul, D. & Schiereck, D., 2017. "The bond event study methodology since 1974," Publications of Darmstadt Technical University, Institute for Business Studies (BWL) 80723, Darmstadt Technical University, Department of Business Administration, Economics and Law, Institute for Business Studies (BWL).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0266516. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.