IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v19y2022i12p7023-d834120.html
   My bibliography  Save this article

Methodological Issues in Analyzing Real-World Longitudinal Occupational Health Data: A Useful Guide to Approaching the Topic

Author

Listed:
  • Rémi Colin-Chevalier

    (CNRS, LaPSCo, Physiological and Psychosocial Stress, Université Clermont Auvergne, F-63000 Clermont-Ferrand, France
    Preventive and Occupational Medicine, CHU Clermont-Ferrand, F-63000 Clermont-Ferrand, France
    Wittyfit, F-75000 Paris, France)

  • Frédéric Dutheil

    (CNRS, LaPSCo, Physiological and Psychosocial Stress, Université Clermont Auvergne, F-63000 Clermont-Ferrand, France
    Preventive and Occupational Medicine, CHU Clermont-Ferrand, F-63000 Clermont-Ferrand, France
    Wittyfit, F-75000 Paris, France)

  • Sébastien Cambier

    (Biostatistics Unit, The Clinical Research and Innovation Direction, University Hospital of Clermont-Ferrand, CHU Clermont-Ferrand, F-63000 Clermont-Ferrand, France)

  • Samuel Dewavrin

    (Wittyfit, F-75000 Paris, France)

  • Thomas Cornet

    (Wittyfit, F-75000 Paris, France)

  • Julien Steven Baker

    (Centre for Health and Exercise Science Research, Hong Kong Baptist University, Kowloon Tong, Hong Kong 999077, China)

  • Bruno Pereira

    (Biostatistics Unit, The Clinical Research and Innovation Direction, University Hospital of Clermont-Ferrand, CHU Clermont-Ferrand, F-63000 Clermont-Ferrand, France)

Abstract

Ever greater technological advances and democratization of digital tools such as computers and smartphones offer researchers new possibilities to collect large amounts of health data in order to conduct clinical research. Such data, called real-world data, appears to be a perfect complement to traditional randomized clinical trials and has become more important in health decisions. Due to its longitudinal nature, real-world data is subject to specific and well-known methodological issues, namely issues with the analysis of cluster-correlated data, missing data and longitudinal data itself. These concepts have been widely discussed in the literature and many methods and solutions have been proposed to cope with these issues. As examples, mixed and trajectory models have been developed to explore longitudinal data sets, imputation methods can resolve missing data issues, and multilevel models facilitate the treatment of cluster-correlated data. Nevertheless, the analysis of real-world longitudinal occupational health data remains difficult, especially when the methodological challenges overlap. The purpose of this article is to present various solutions developed in the literature to deal with cluster-correlated data, missing data and longitudinal data, sometimes overlapped, in an occupational health context. The novelty and usefulness of our approach is supported by a step-by-step search strategy and an example from the Wittyfit database, which is an epidemiological database of occupational health data. Therefore, we hope that this article will facilitate the work of researchers in the field and improve the accuracy of future studies.

Suggested Citation

  • Rémi Colin-Chevalier & Frédéric Dutheil & Sébastien Cambier & Samuel Dewavrin & Thomas Cornet & Julien Steven Baker & Bruno Pereira, 2022. "Methodological Issues in Analyzing Real-World Longitudinal Occupational Health Data: A Useful Guide to Approaching the Topic," IJERPH, MDPI, vol. 19(12), pages 1-20, June.
  • Handle: RePEc:gam:jijerp:v:19:y:2022:i:12:p:7023-:d:834120
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/19/12/7023/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/19/12/7023/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. P. Diggle & M. G. Kenward, 1994. "Informative Drop‐Out in Longitudinal Data Analysis," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 43(1), pages 49-73, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. David M. Murray & Jonathan L. Blitstein, 2003. "Methods To Reduce The Impact Of Intraclass Correlation In Group-Randomized Trials," Evaluation Review, , vol. 27(1), pages 79-103, February.
    2. Patrick E. B. FitzGerald, 2002. "Extended Generalized Estimating Equations for Binary Familial Data with Incomplete Families," Biometrics, The International Biometric Society, vol. 58(4), pages 718-726, December.
    3. Pourahmadi, Mohsen & Daniels, Michael J. & Park, Trevor, 2007. "Simultaneous modelling of the Cholesky decomposition of several covariance matrices," Journal of Multivariate Analysis, Elsevier, vol. 98(3), pages 568-587, March.
    4. Sinha, Sanjoy K. & Kaushal, Amit & Xiao, Wenzhong, 2014. "Inference for longitudinal data with nonignorable nonmonotone missing responses," Computational Statistics & Data Analysis, Elsevier, vol. 72(C), pages 77-91.
    5. E. Michael Foster & Grace Y. Fang, 2004. "Alternative Methods for Handling Attrition," Evaluation Review, , vol. 28(5), pages 434-464, October.
    6. Mette Ejrnæs & Anders Holm, 2006. "Comparing Fixed Effects and Covariance Structure Estimators for Panel Data," Sociological Methods & Research, , vol. 35(1), pages 61-83, August.
    7. Geert Verbeke & Geert Molenberghs & Herbert Thijs & Emmanuel Lesaffre & Michael G. Kenward, 2001. "Sensitivity Analysis for Nonrandom Dropout: A Local Influence Approach," Biometrics, The International Biometric Society, vol. 57(1), pages 7-14, March.
    8. Rebecca E. Anthony & Amy L. Paine & Katherine H. Shelton, 2019. "Depression and Anxiety Symptoms of British Adoptive Parents: A Prospective Four-Wave Longitudinal Study," IJERPH, MDPI, vol. 16(24), pages 1-14, December.
    9. Miran A. Jaffa & Ayad A. Jaffa, 2019. "A Likelihood-Based Approach with Shared Latent Random Parameters for the Longitudinal Binary and Informative Censoring Processes," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 11(3), pages 597-613, December.
    10. Molenberghs, Geert & Verbeke, Geert & Thijs, Herbert & Lesaffre, Emmanuel & Kenward, Michael G., 2001. "Influence analysis to assess sensitivity of the dropout process," Computational Statistics & Data Analysis, Elsevier, vol. 37(1), pages 93-113, July.
    11. Shu Xu & Shelley A. Blozis, 2011. "Sensitivity Analysis of Mixed Models for Incomplete Longitudinal Data," Journal of Educational and Behavioral Statistics, , vol. 36(2), pages 237-256, April.
    12. Sebastian Domhof & Edgar Brunner & D. Wayne Osgood, 2002. "Rank Procedures for Repeated Measures with Missing Values," Sociological Methods & Research, , vol. 30(3), pages 367-393, February.
    13. Amelia M. Haviland & Bobby L. Jones & Daniel S. Nagin, 2011. "Group-based Trajectory Modeling Extended to Account for Nonrandom Participant Attrition," Sociological Methods & Research, , vol. 40(2), pages 367-390, May.
    14. Shelley A. Blozis & Jeffrey R. Harring, 2017. "Understanding Individual-level Change Through the Basis Functions of a Latent Curve Model," Sociological Methods & Research, , vol. 46(4), pages 793-820, November.
    15. Michael J. Daniels & Joseph W. Hogan, 2000. "Reparameterizing the Pattern Mixture Model for Sensitivity Analyses Under Informative Dropout," Biometrics, The International Biometric Society, vol. 56(4), pages 1241-1248, December.
    16. Lars Relund Nielsen & Erik Jørgensen & Søren Højsgaard, 2011. "Embedding a state space model into a Markov decision process," Annals of Operations Research, Springer, vol. 190(1), pages 289-309, October.
    17. Jennifer Chan & Wai Wan, 2011. "Bayesian approach to analysing longitudinal bivariate binary data with informative dropout," Computational Statistics, Springer, vol. 26(1), pages 121-144, March.
    18. Jayajit Chakraborty & Timothy W. Collins & Sara E. Grineski & Alejandra Maldonado, 2017. "Racial Differences in Perceptions of Air Pollution Health Risk: Does Environmental Exposure Matter?," IJERPH, MDPI, vol. 14(2), pages 1-16, January.
    19. Jolene Birmingham & Garrett M. Fitzmaurice, 2002. "A Pattern-Mixture Model for Longitudinal Binary Responses with Nonignorable Nonresponse," Biometrics, The International Biometric Society, vol. 58(4), pages 989-996, December.
    20. Anders Skrondal & Sophia Rabe-Hesketh, 2022. "The Role of Conditional Likelihoods in Latent Variable Modeling," Psychometrika, Springer;The Psychometric Society, vol. 87(3), pages 799-834, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:19:y:2022:i:12:p:7023-:d:834120. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.