IDEAS home Printed from https://ideas.repec.org/a/bla/jorssa/v183y2020i3p1293-1311.html
   My bibliography  Save this article

Improving external validity of epidemiologic cohort analyses: a kernel weighting approach

Author

Listed:
  • Lingxiao Wang
  • Barry I. Graubard
  • Hormuzd A. Katki
  • and Yan Li

Abstract

For various reasons, cohort studies generally forgo probability sampling required to obtain population representative samples. However, such cohorts lack population representativeness, which invalidates estimates of population prevalences for novel health factors that are only available in cohorts. To improve external validity of estimates from cohorts, we propose a kernel weighting (KW) approach that uses survey data as a reference to create pseudoweights for cohorts. A jackknife variance is proposed for the KW estimates. In simulations, the KW method outperformed two existing propensity‐score‐based weighting methods in mean‐squared error while maintaining confidence interval coverage. We applied all methods to estimating US population mortality and prevalences of various diseases from the non‐representative US National Institutes of Health–American Association of Retired Persons cohort, using the sample from the US‐representative National Health Interview Survey as the reference. Assuming that the survey estimates are correct, the KW approach yielded generally less biased estimates compared with the existing propensity‐score‐based weighting methods.

Suggested Citation

  • Lingxiao Wang & Barry I. Graubard & Hormuzd A. Katki & and Yan Li, 2020. "Improving external validity of epidemiologic cohort analyses: a kernel weighting approach," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(3), pages 1293-1311, June.
  • Handle: RePEc:bla:jorssa:v:183:y:2020:i:3:p:1293-1311
    DOI: 10.1111/rssa.12564
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/rssa.12564
    Download Restriction: no

    File URL: https://libkey.io/10.1111/rssa.12564?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Greg Duncan, 2008. "When to promote, and when to avoid, a population perspective," Demography, Springer;Population Association of America (PAA), vol. 45(4), pages 763-784, November.
    2. Yan Li & Orestis A. Panagiotou & Amanda Black & Dandan Liao & Sholom Wacholder, 2016. "Multivariate piecewise exponential survival modeling," Biometrics, The International Biometric Society, vol. 72(2), pages 546-553, June.
    3. Czajka, John L, et al, 1992. "Projecting from Advance Data Using Propensity Modeling: An Application to Income and Tax Statistics," Journal of Business & Economic Statistics, American Statistical Association, vol. 10(2), pages 117-131, April.
    4. Brian K Lee & Justin Lessler & Elizabeth A Stuart, 2011. "Weight Trimming and Propensity Score Weighting," PLOS ONE, Public Library of Science, vol. 6(3), pages 1-6, March.
    5. Elizabeth A. Stuart & Stephen R. Cole & Catherine P. Bradshaw & Philip J. Leaf, 2011. "The use of propensity scores to assess the generalizability of results from randomized trials," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 174(2), pages 369-386, April.
    6. Yan Li & Barry I. Graubard & Ralph DiGaetano, 2011. "Weighting methods for population‐based case–control studies with complex sampling," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 60(2), pages 165-185, March.
    7. Niels Keiding & Thomas A. Louis, 2016. "Perils and potentials of self-selected entry to epidemiological studies and surveys," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 179(2), pages 319-376, February.
    8. Jean-François Beaumont, 2008. "A new approach to weighting and inference in sample surveys," Biometrika, Biometrika Trust, vol. 95(3), pages 539-553.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. María del Mar Rueda & Sergio Martínez-Puertas & Luis Castro-Martín, 2022. "Methods to Counter Self-Selection Bias in Estimations of the Distribution Function and Quantiles," Mathematics, MDPI, vol. 10(24), pages 1-19, December.
    2. Lingxiao Wang & Barry I. Graubard & Hormuzd A. Katki & Yan Li, 2022. "Efficient and robust propensity‐score‐based methods for population inference using epidemiologic cohorts," International Statistical Review, International Statistical Institute, vol. 90(1), pages 146-164, April.
    3. Luis Castro-Martín & María del Mar Rueda & Ramón Ferri-García & César Hernando-Tamayo, 2021. "On the Use of Gradient Boosting Methods to Improve the Estimation with Data Obtained with Self-Selection Procedures," Mathematics, MDPI, vol. 9(23), pages 1-23, November.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wendy Chan, 2018. "Applications of Small Area Estimation to Generalization With Subclassification by Propensity Scores," Journal of Educational and Behavioral Statistics, , vol. 43(2), pages 182-224, April.
    2. Ashley L. Buchanan & Michael G. Hudgens & Stephen R. Cole & Katie R. Mollan & Paul E. Sax & Eric S. Daar & Adaora A. Adimora & Joseph J. Eron & Michael J. Mugavero, 2018. "Generalizing evidence from randomized trials using inverse probability of sampling weights," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 181(4), pages 1193-1209, October.
    3. Ray Chambers & Setareh Ranjbar & Nicola Salvati & Barbara Pacini, 2022. "Weighting, informativeness and causal inference, with an application to rainfall enhancement," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(4), pages 1584-1612, October.
    4. Amang Sukasih & Donsig Jang & Sonya Vartivarian & Stephen Cohen & Fan Zhang, "undated". "A Simulation Study to Compare Weighting Methods for Survey Nonresponses in the National Survey of Recent College Graduates," Mathematica Policy Research Reports 613f000cac94492f91b53813f, Mathematica Policy Research.
    5. Amarendra Sharma, 2019. "Indira Awas Yojana and Housing Adequacy: An Evaluation using Propensity Score Matching," ASARC Working Papers 2019-05, The Australian National University, Australia South Asia Research Centre.
    6. Fabian Kosse & Thomas Deckers & Pia Pinger & Hannah Schildberg-Hörisch & Armin Falk, 2020. "The Formation of Prosociality: Causal Evidence on the Role of Social Environment," Journal of Political Economy, University of Chicago Press, vol. 128(2), pages 434-467.
    7. Glover, Steven & Jones, Sam, 2019. "Can commercial farming promote rural dynamism in sub-Saharan Africa? Evidence from Mozambique," World Development, Elsevier, vol. 114(C), pages 110-121.
    8. Nazmul Islam & Natalie E. Sheils & Megan S. Jarvis & Kenneth Cohen, 2022. "Comparative effectiveness over time of the mRNA-1273 (Moderna) vaccine and the BNT162b2 (Pfizer-BioNTech) vaccine," Nature Communications, Nature, vol. 13(1), pages 1-7, December.
    9. Bonnie E. Shook‐Sa & Michael G. Hudgens, 2022. "Power and sample size for observational studies of point exposure effects," Biometrics, The International Biometric Society, vol. 78(1), pages 388-398, March.
    10. Fan Li & Ashley L. Buchanan & Stephen R. Cole, 2022. "Generalizing trial evidence to target populations in non‐nested designs: Applications to AIDS clinical trials," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(3), pages 669-697, June.
    11. Elizabeth Tipton & Robert B. Olsen, "undated". "Enhancing the Generalizability of Impact Studies in Education," Mathematica Policy Research Reports 35d5625333dc480aba9765b3b, Mathematica Policy Research.
    12. Jörg Peters & Jörg Langbein & Gareth Roberts, 2018. "Generalization in the Tropics – Development Policy, Randomized Controlled Trials, and External Validity," The World Bank Research Observer, World Bank, vol. 33(1), pages 34-64.
    13. Xu Qin & Jonah Deutsch & Guanglei Hong, 2021. "Unpacking Complex Mediation Mechanisms And Their Heterogeneity Between Sites In A Job Corps Evaluation," Journal of Policy Analysis and Management, John Wiley & Sons, Ltd., vol. 40(1), pages 158-190, January.
    14. Robert W. Hahn & Robert D. Metcalfe, 2021. "Efficiency and Equity Impacts of Energy Subsidies," American Economic Review, American Economic Association, vol. 111(5), pages 1658-1688, May.
    15. Adesina, Adedoyin & Akogun, Oladele & Dillon, Andrew & Friedman, Jed & Njobdi, Sani & Serneels, Pieter, 2017. "Robustness and External Validity: What do we Learn from Repeated Study Designs over Time?," 2018 Allied Social Sciences Association (ASSA) Annual Meeting, January 5-7, 2018, Philadelphia, Pennsylvania 266292, Agricultural and Applied Economics Association.
    16. Kara E. Rudolph & Jonathan Levy & Mark J. van der Laan, 2021. "Transporting stochastic direct and indirect effects to new populations," Biometrics, The International Biometric Society, vol. 77(1), pages 197-211, March.
    17. Dasom Lee & Shu Yang & Lin Dong & Xiaofei Wang & Donglin Zeng & Jianwen Cai, 2023. "Improving trial generalizability using observational studies," Biometrics, The International Biometric Society, vol. 79(2), pages 1213-1225, June.
    18. Sharples, Linda D., 2018. "The role of statistics in the era of big data: Electronic health records for healthcare research," Statistics & Probability Letters, Elsevier, vol. 136(C), pages 105-110.
    19. Denis Fougère & Nicolas Jacquemet, 2020. "Policy Evaluation Using Causal Inference Methods," SciencePo Working papers Main hal-03455978, HAL.
    20. D. R. Cox, 2009. "Randomization in the Design of Experiments," International Statistical Review, International Statistical Institute, vol. 77(3), pages 415-429, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssa:v:183:y:2020:i:3:p:1293-1311. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.