IDEAS home Printed from https://ideas.repec.org/a/bpj/ijbist/v4y2008i1n19.html

Simple Optimal Weighting of Cases and Controls in Case-Control Studies

Author

Listed:
  • Rose Sherri

    (University of California, Berkeley)

  • van der Laan Mark J.

    (University of California, Berkeley)

Abstract

Researchers of uncommon diseases are often interested in assessing potential risk factors. Given the low incidence of disease, these studies are frequently case-control in design. Such a design allows a sufficient number of cases to be obtained without extensive sampling and can increase efficiency; however, these case-control samples are then biased since the proportion of cases in the sample is not the same as the population of interest. Methods for analyzing case-control studies have focused on utilizing logistic regression models that provide conditional and not causal estimates of the odds ratio. This article will demonstrate the use of the prevalence probability and case-control weighted targeted maximum likelihood estimation (MLE), as described by van der Laan (2008), in order to obtain causal estimates of the parameters of interest (risk difference, relative risk, and odds ratio). It is meant to be used as a guide for researchers, with step-by-step directions to implement this methodology. We will also present simulation studies that show the improved efficiency of the case-control weighted targeted MLE compared to other techniques.

Suggested Citation

  • Rose Sherri & van der Laan Mark J., 2008. "Simple Optimal Weighting of Cases and Controls in Case-Control Studies," The International Journal of Biostatistics, De Gruyter, vol. 4(1), pages 1-26, September.
  • Handle: RePEc:bpj:ijbist:v:4:y:2008:i:1:n:19
    DOI: 10.2202/1557-4679.1115
    as

    Download full text from publisher

    File URL: https://doi.org/10.2202/1557-4679.1115
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.2202/1557-4679.1115?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Anthony P. Morise & George A. Diamond & Robert Detrano & Marco Bobbio & Erdogan Gunel, 1996. "The Effect of Disease-prevalence Adjustments on the Accuracy of a Logistic Prediction Model," Medical Decision Making, , vol. 16(2), pages 133-142, June.
    2. van der Laan Mark J., 2006. "Statistical Inference for Variable Importance," The International Journal of Biostatistics, De Gruyter, vol. 2(1), pages 1-33, February.
    3. Paul W. Holland & Donald B. Rubin, 1988. "Causal Inference in Retrospective Studies," Evaluation Review, , vol. 12(3), pages 203-231, June.
    4. van der Laan Mark J., 2008. "Estimation Based on Case-Control Designs with Known Prevalence Probability," The International Journal of Biostatistics, De Gruyter, vol. 4(1), pages 1-59, September.
    5. van der Laan Mark J. & Rubin Daniel, 2006. "Targeted Maximum Likelihood Learning," The International Journal of Biostatistics, De Gruyter, vol. 2(1), pages 1-40, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. van der Laan Mark J., 2010. "Targeted Maximum Likelihood Based Causal Inference: Part I," The International Journal of Biostatistics, De Gruyter, vol. 6(2), pages 1-45, February.
    2. Rose Sherri & van der Laan Mark J., 2011. "A Targeted Maximum Likelihood Estimator for Two-Stage Designs," The International Journal of Biostatistics, De Gruyter, vol. 7(1), pages 1-21, March.
    3. Etienne Volatier & Francesco Salvo & Antoine Pariente & Émeline Courtois & Sylvie Escolano & Pascale Tubert-Bitter & Ismaïl Ahmed, 2022. "High-Dimensional Propensity Score-Adjusted Case-Crossover for Discovering Adverse Drug Reactions from Computerized Administrative Healthcare Databases," Drug Safety, Springer, vol. 45(3), pages 275-285, March.
    4. van der Laan Mark J. & Gruber Susan, 2010. "Collaborative Double Robust Targeted Maximum Likelihood Estimation," The International Journal of Biostatistics, De Gruyter, vol. 6(1), pages 1-71, May.
    5. van der Laan Mark J. & Gruber Susan, 2012. "Targeted Minimum Loss Based Estimation of Causal Effects of Multiple Time Point Interventions," The International Journal of Biostatistics, De Gruyter, vol. 8(1), pages 1-41, May.
    6. O. Saarela & L. R. Belzile & D. A. Stephens, 2016. "A Bayesian view of doubly robust causal inference," Biometrika, Biometrika Trust, vol. 103(3), pages 667-681.
    7. Sherri Rose & Julie Shi & Thomas G. McGuire & Sharon-Lise T. Normand, 2017. "Matching and Imputation Methods for Risk Adjustment in the Health Insurance Marketplaces," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 9(2), pages 525-542, December.
    8. Noah Zaitlen & Sara Lindström & Bogdan Pasaniuc & Marilyn Cornelis & Giulio Genovese & Samuela Pollack & Anne Barton & Heike Bickeböller & Donald W Bowden & Steve Eyre & Barry I Freedman & David J Fri, 2012. "Informed Conditioning on Clinical Covariates Increases Power in Case-Control Association Studies," PLOS Genetics, Public Library of Science, vol. 8(11), pages 1-13, November.
    9. van der Laan Mark J. & Petersen Maya & Zheng Wenjing, 2013. "Estimating the Effect of a Community-Based Intervention with Two Communities," Journal of Causal Inference, De Gruyter, vol. 1(1), pages 83-106, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tuglus Catherine & van der Laan Mark J., 2011. "Repeated Measures Semiparametric Regression Using Targeted Maximum Likelihood Methodology with Application to Transcription Factor Activity Discovery," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 10(1), pages 1-31, January.
    2. Geeven Geert & van der Laan Mark J. & de Gunst Mathisca C.M., 2012. "Comparison of Targeted Maximum Likelihood and Shrinkage Estimators of Parameters in Gene Networks," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(5), pages 1-29, September.
    3. Elise D Riley & Torsten B Neilands & Kelly Moore & Jennifer Cohen & David R Bangsberg & Diane Havlir, 2012. "Social, Structural and Behavioral Determinants of Overall Health Status in a Cohort of Homeless and Unstably Housed HIV-Infected Men," PLOS ONE, Public Library of Science, vol. 7(4), pages 1-7, April.
    4. van der Laan Mark J. & Petersen Maya & Zheng Wenjing, 2013. "Estimating the Effect of a Community-Based Intervention with Two Communities," Journal of Causal Inference, De Gruyter, vol. 1(1), pages 83-106, June.
    5. Tuglus Catherine & van der Laan Mark J., 2009. "Modified FDR Controlling Procedure for Multi-Stage Analyses," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-17, February.
    6. van der Laan Mark J., 2010. "Targeted Maximum Likelihood Based Causal Inference: Part I," The International Journal of Biostatistics, De Gruyter, vol. 6(2), pages 1-45, February.
    7. Helene C. W. Rytgaard & Mark J. Laan, 2024. "Targeted maximum likelihood estimation for causal inference in survival and competing risks analysis," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 30(1), pages 4-33, January.
    8. Brooks Jordan & van der Laan Mark J. & Go Alan S., 2012. "Targeted Maximum Likelihood Estimation for Prediction Calibration," The International Journal of Biostatistics, De Gruyter, vol. 8(1), pages 1-35, October.
    9. Antoine Chambaz & Mark J. Laan, 2014. "Inference in Targeted Group-Sequential Covariate-Adjusted Randomized Clinical Trials," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 41(1), pages 104-140, March.
    10. Iván Díaz & Alan Hubbard & Anna Decker & Mitchell Cohen, 2015. "Variable Importance and Prediction Methods for Longitudinal Problems with Missing Variables," PLOS ONE, Public Library of Science, vol. 10(3), pages 1-17, March.
    11. Guanbo Wang & Mireille E. Schnitzer & Dick Menzies & Piret Viiklepp & Timothy H. Holtz & Andrea Benedetti, 2020. "Estimating treatment importance in multidrug‐resistant tuberculosis using Targeted Learning: An observational individual patient data network meta‐analysis," Biometrics, The International Biometric Society, vol. 76(3), pages 1007-1016, September.
    12. Justin Whitehouse & Qizhao Chen & Morgane Austern & Vasilis Syrgkanis, 2025. "Inference on Optimal Policy Values and Other Irregular Functionals via Softmax Smoothing," Papers 2507.11780, arXiv.org, revised Mar 2026.
    13. Harsh Parikh & Carlos Varjao & Louise Xu & Eric Tchetgen Tchetgen, 2022. "Validating Causal Inference Methods," Papers 2202.04208, arXiv.org, revised Jul 2022.
    14. Martin Huber, 2019. "An introduction to flexible methods for policy evaluation," Papers 1910.00641, arXiv.org.
    15. Wei Zhao & Ying Qing Chen & Li Hsu, 2017. "On estimation of time-dependent attributable fraction from population-based case-control studies," Biometrics, The International Biometric Society, vol. 73(3), pages 866-875, September.
    16. Tran Linh & Petersen Maya & Schwab Joshua & van der Laan Mark J., 2023. "Robust variance estimation and inference for causal effect estimation," Journal of Causal Inference, De Gruyter, vol. 11(1), pages 1-27, January.
    17. S Ariane Christie & Amanda S Conroy & Rachael A Callcut & Alan E Hubbard & Mitchell J Cohen, 2019. "Dynamic multi-outcome prediction after injury: Applying adaptive machine learning for precision medicine in trauma," PLOS ONE, Public Library of Science, vol. 14(4), pages 1-13, April.
    18. Waverly Wei & Maya Petersen & Mark J van der Laan & Zeyu Zheng & Chong Wu & Jingshen Wang, 2023. "Efficient targeted learning of heterogeneous treatment effects for multiple subgroups," Biometrics, The International Biometric Society, vol. 79(3), pages 1934-1946, September.
    19. Michael Rosenblum & Nicholas P. Jewell & Mark van der Laan & Stephen Shiboski & Ariane van der Straten & Nancy Padian, 2009. "Analysing direct effects in randomized trials with secondary interventions: an application to human immunodeficiency virus prevention trials," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 172(2), pages 443-465, April.
    20. Jonathan Fuhr & Philipp Berens & Dominik Papies, 2024. "Estimating Causal Effects with Double Machine Learning -- A Method Evaluation," Papers 2403.14385, arXiv.org, revised Apr 2024.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:ijbist:v:4:y:2008:i:1:n:19. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyterbrill.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.