IDEAS home Printed from https://ideas.repec.org/p/ifs/cemmap/19-20.html
   My bibliography  Save this paper

Causal inference in case-control studies

Author

Listed:
  • Sung Jae Jun

    (Institute for Fiscal Studies and Pennsylvania State University)

  • Sokbae (Simon) Lee

    (Institute for Fiscal Studies and Columbia University and IFS)

Abstract

We investigate identi?cation of causal parameters in case-control and related studies. The odds ratio in the sample is our main estimand of interest and we articulate its relationship with causal parameters under various scenarios. It turns out that the odds ratio is generally a sharp upper bound for counterfactual relative risk under some monotonicity assumptions, without resorting to strong ig-norability, nor to the rare-disease assumption. Further, we propose semparametrically ef?cient, easy-to-implement, machine-learning-friendly estimators of the aggregated (log) odds ratio by exploiting an explicit form of the ef?cient in?uence function. Using our new estimators, we develop methods for causal inference and illustrate the usefulness of our methods by a real-data example.

Suggested Citation

  • Sung Jae Jun & Sokbae (Simon) Lee, 2020. "Causal inference in case-control studies," CeMMAP working papers CWP19/20, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
  • Handle: RePEc:ifs:cemmap:19/20
    as

    Download full text from publisher

    File URL: https://www.ifs.org.uk/uploads/CWP1920-Causal-inference-in-case-control-studies.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Imbens, Guido W, 1992. "An Efficient Method of Moments Estimator for Discrete Choice Models with Choice-Based Sampling," Econometrica, Econometric Society, vol. 60(5), pages 1187-1214, September.
    2. James J. Heckman & Petra E. Todd, 2009. "A note on adapting propensity score matching and selection models to choice based samples," Econometrics Journal, Royal Economic Society, vol. 12(s1), pages 230-234, January.
    3. Charles F. Manski, 1997. "Monotone Treatment Response," Econometrica, Econometric Society, vol. 65(6), pages 1311-1334, November.
    4. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
    5. Alexandre Belloni & Victor Chernozhukov & Ying Wei, 2016. "Post-Selection Inference for Generalized Linear Models With Many Controls," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(4), pages 606-619, October.
    6. repec:hrv:faseco:34728615 is not listed on IDEAS
    7. Sung Jae Jun & Sokbae (Simon) Lee, 2018. "Identifying the effect of persuasion," CeMMAP working papers CWP19/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    8. Cosslett, Stephen R, 1981. "Maximum Likelihood Estimator for Choice-Based Samples," Econometrica, Econometric Society, vol. 49(5), pages 1289-1316, September.
    9. Janet Currie & Matthew Neidell, 2005. "Air Pollution and Infant Health: What Can We Learn from California's Recent Experience?," The Quarterly Journal of Economics, Oxford University Press, vol. 120(3), pages 1003-1030.
    10. Machado, Cecilia & Shaikh, Azeem M. & Vytlacil, Edward J., 2019. "Instrumental variables and the sign of the average treatment effect," Journal of Econometrics, Elsevier, vol. 212(2), pages 522-555.
    11. Victor Chernozhukov & Sokbae Lee & Adam M. Rosen, 2013. "Intersection Bounds: Estimation and Inference," Econometrica, Econometric Society, vol. 81(2), pages 667-737, March.
    12. Keisuke Hirano & Guido W. Imbens & Geert Ridder, 2003. "Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score," Econometrica, Econometric Society, vol. 71(4), pages 1161-1189, July.
    13. Yu Xie & Charles F. Manski, 1989. "The Logit Model and Response-Based Samples," Sociological Methods & Research, , vol. 17(3), pages 283-302, February.
    14. Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2010. "Regularization Paths for Generalized Linear Models via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i01).
    15. Lee, Sokbae & Song, Kyungchul & Whang, Yoon-Jae, 2018. "Testing For A General Class Of Functional Inequalities," Econometric Theory, Cambridge University Press, vol. 34(5), pages 1018-1064, October.
    16. Eric J. Tchetgen Tchetgen & James M. Robins & Andrea Rotnitzky, 2010. "On doubly robust estimation in a semiparametric odds ratio model," Biometrika, Biometrika Trust, vol. 97(1), pages 171-180.
    17. Newey, Whitney K, 1990. "Semiparametric Efficiency Bounds," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 5(2), pages 99-135, April-Jun.
    18. Tsunao Okumura & Emiko Usui, 2014. "Concave‐monotone treatment response and monotone treatment selection: With an application to the returns to schooling," Quantitative Economics, Econometric Society, vol. 5, pages 175-194, March.
    19. Lancaster, Tony & Imbens, Guido, 1996. "Case-control studies with contaminated controls," Journal of Econometrics, Elsevier, vol. 71(1-2), pages 145-160.
    20. Daniel Ackerberg & Xiaohong Chen & Jinyong Hahn, 2012. "A Practical Asymptotic Variance Estimator for Two-Step Semiparametric Estimators," The Review of Economics and Statistics, MIT Press, vol. 94(2), pages 481-498, May.
    21. Imbens, Guido W. & Lancaster, Tony, 1996. "Efficient estimation and stratified sampling," Journal of Econometrics, Elsevier, vol. 74(2), pages 289-318, October.
    22. Chunrong Ai & Xiaohong Chen, 2003. "Efficient Estimation of Models with Conditional Moment Restrictions Containing Unknown Functions," Econometrica, Econometric Society, vol. 71(6), pages 1795-1843, November.
    23. Jinyong Hahn, 1998. "On the Role of the Propensity Score in Efficient Semiparametric Estimation of Average Treatment Effects," Econometrica, Econometric Society, vol. 66(2), pages 315-332, March.
    24. Daniel Ackerberg & Xiaohong Chen & Jinyong Hahn & Zhipeng Liao, 2014. "Asymptotic Efficiency of Semiparametric Two-step GMM," Review of Economic Studies, Oxford University Press, vol. 81(3), pages 919-943.
    25. Wooyoung Kim & Koohyun Kwon & Soonwoo Kwon & Sokbae Lee, 2018. "The identification power of smoothness assumptions in models with counterfactual outcomes," Quantitative Economics, Econometric Society, vol. 9(2), pages 617-642, July.
    26. Kani Chen, 2001. "Parametric models for response‐biased sampling," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 63(4), pages 775-789.
    27. Hua Yun Chen, 2007. "A Semiparametric Odds Ratio Model for Measuring Association," Biometrics, The International Biometric Society, vol. 63(2), pages 413-421, June.
    28. Jay Bhattacharya & Azeem M. Shaikh & Edward Vytlacil, 2008. "Treatment Effect Bounds under Monotonicity Assumptions: An Application to Swan-Ganz Catheterization," American Economic Review, American Economic Association, vol. 98(2), pages 351-356, May.
    29. Heckman, James J. & Robb, Richard Jr., 1985. "Alternative methods for evaluating the impact of interventions : An overview," Journal of Econometrics, Elsevier, vol. 30(1-2), pages 239-267.
    30. E. H. Kennedy & A. Sjölander & D. S. Small, 2015. "Semiparametric causal inference in matched cohort studies," Biometrika, Biometrika Trust, vol. 102(3), pages 739-746.
    31. Bhattacharya, Jay & Shaikh, Azeem M. & Vytlacil, Edward, 2012. "Treatment effect bounds: An application to Swan–Ganz catheterization," Journal of Econometrics, Elsevier, vol. 168(2), pages 223-243.
    32. Ian Domowitz & Robert L. Sartain, 1999. "Determinants of the Consumer Bankruptcy Decision," Journal of Finance, American Finance Association, vol. 54(1), pages 403-420, February.
    33. Elie Tamer, 2010. "Partial Identification in Econometrics," Annual Review of Economics, Annual Reviews, vol. 2(1), pages 167-195, September.
    34. Imbens,Guido W. & Rubin,Donald B., 2015. "Causal Inference for Statistics, Social, and Biomedical Sciences," Cambridge Books, Cambridge University Press, number 9780521885881, December.
    35. Jing Zhou & Amy H. Herring & Anirban Bhattacharya & Andrew F. Olshan & David B. Dunson, 2016. "Nonparametric Bayes modeling for case control studies with many predictors," Biometrics, The International Biometric Society, vol. 72(1), pages 184-192, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sung Jae Jun & Sokbae Lee, 2020. "Causal Inference under Outcome-Based Sampling with Monotonicity Assumptions," Papers 2004.08318, arXiv.org, revised Aug 2021.
    2. Kyungchul Song, 2009. "Efficient Estimation of Average Treatment Effects under Treatment-Based Sampling," PIER Working Paper Archive 09-011, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania.
    3. Bryan S. Graham & Cristine Campos De Xavier Pinto & Daniel Egel, 2012. "Inverse Probability Tilting for Moment Condition Models with Missing Data," Review of Economic Studies, Oxford University Press, vol. 79(3), pages 1053-1079.
    4. Alexandre Belloni & Victor Chernozhukov & Denis Chetverikov & Christian Hansen & Kengo Kato, 2018. "High-dimensional econometrics and regularized GMM," CeMMAP working papers CWP35/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    5. Sant’Anna, Pedro H.C. & Zhao, Jun, 2020. "Doubly robust difference-in-differences estimators," Journal of Econometrics, Elsevier, vol. 219(1), pages 101-122.
    6. Susan Athey & Guido W. Imbens & Stefan Wager, 2018. "Approximate residual balancing: debiased inference of average treatment effects in high dimensions," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 80(4), pages 597-623, September.
    7. Halbert White & Karim Chalak, 2013. "Identification and Identification Failure for Treatment Effects Using Structural Systems," Econometric Reviews, Taylor & Francis Journals, vol. 32(3), pages 273-317, November.
    8. Daniel McFadden, 2001. "Economic Choices," American Economic Review, American Economic Association, vol. 91(3), pages 351-378, June.
    9. Prokhorov, Artem & Schmidt, Peter, 2009. "GMM redundancy results for general missing data problems," Journal of Econometrics, Elsevier, vol. 151(1), pages 47-55, July.
    10. Bryan S. Graham & Cristine Campos de Xavier Pinto, 2018. "Semiparametrically efficient estimation of the average linear regression function," Papers 1810.12511, arXiv.org.
    11. Neng-Chieh Chang, 2020. "The Mode Treatment Effect," Papers 2007.11606, arXiv.org.
    12. Rahul Singh & Liyuan Xu & Arthur Gretton, 2020. "Reproducing Kernel Methods for Nonparametric and Semiparametric Treatment Effects," Papers 2010.04855, arXiv.org, revised Apr 2021.
    13. Wooyoung Kim & Koohyun Kwon & Soonwoo Kwon & Sokbae Lee, 2018. "The identification power of smoothness assumptions in models with counterfactual outcomes," Quantitative Economics, Econometric Society, vol. 9(2), pages 617-642, July.
    14. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
    15. Chen, Xuan & Flores, Carlos A. & Flores-Lagunes, Alfonso, 2015. "Going Beyond LATE: Bounding Average Treatment Effects of Job Corps Training," IZA Discussion Papers 9511, Institute of Labor Economics (IZA).
    16. Possebom, Vitor, 2018. "Sharp bounds on the MTE with sample selection," MPRA Paper 89785, University Library of Munich, Germany.
    17. Esmeralda A. Ramalho & Richard J. Smith, 2013. "Discrete Choice Non-Response," Review of Economic Studies, Oxford University Press, vol. 80(1), pages 343-364.
    18. Jelena Bradic & Stefan Wager & Yinchu Zhu, 2019. "Sparsity Double Robust Inference of Average Treatment Effects," Papers 1905.00744, arXiv.org.
    19. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2016. "Double/Debiased Machine Learning for Treatment and Causal Parameters," Papers 1608.00060, arXiv.org, revised Dec 2017.
    20. Masahiro Kato, 2021. "Adaptive Doubly Robust Estimator from Non-stationary Logging Policy under a Convergence of Average Probability," Papers 2102.08975, arXiv.org, revised Mar 2021.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ifs:cemmap:19/20. See general information about how to correct material in RePEc.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: . General contact details of provider: https://edirc.repec.org/data/cmifsuk.html .

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Emma Hyman (email available below). General contact details of provider: https://edirc.repec.org/data/cmifsuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service hosted by the Research Division of the Federal Reserve Bank of St. Louis . RePEc uses bibliographic data supplied by the respective publishers.