IDEAS home Printed from https://ideas.repec.org/a/bpj/ijbist/v12y2016i1p333-349n14.html
   My bibliography  Save this article

Second-Order Inference for the Mean of a Variable Missing at Random

Author

Listed:
  • Díaz Iván

    (Google Inc. New York, NY)

  • Carone Marco

    (Department of Biostatistics, University of Washington, Seattle, WA, USA)

  • van der Laan Mark J.

    (Division of Biostatistics, University of California, Berkeley, CA, USA)

Abstract

We present a second-order estimator of the mean of a variable subject to missingness, under the missing at random assumption. The estimator improves upon existing methods by using an approximate second-order expansion of the parameter functional, in addition to the first-order expansion employed by standard doubly robust methods. This results in weaker assumptions about the convergence rates necessary to establish consistency, local efficiency, and asymptotic linearity. The general estimation strategy is developed under the targeted minimum loss-based estimation (TMLE) framework. We present a simulation comparing the sensitivity of the first and second-order estimators to the convergence rate of the initial estimators of the outcome regression and missingness score. In our simulation, the second-order TMLE always had a coverage probability equal or closer to the nominal value 0.95, compared to its first-order counterpart. In the best-case scenario, the proposed second-order TMLE had a coverage probability of 0.86 when the first-order TMLE had a coverage probability of zero. We also present a novel first-order estimator inspired by a second-order expansion of the parameter functional. This estimator only requires one-dimensional smoothing, whereas implementation of the second-order TMLE generally requires kernel smoothing on the covariate space. The first-order estimator proposed is expected to have improved finite sample performance compared to existing first-order estimators. In the best-case scenario of our simulation study, the novel first-order TMLE improved the coverage probability from 0 to 0.90. We provide an illustration of our methods using a publicly available dataset to determine the effect of an anticoagulant on health outcomes of patients undergoing percutaneous coronary intervention. We provide R code implementing the proposed estimator.

Suggested Citation

  • Díaz Iván & Carone Marco & van der Laan Mark J., 2016. "Second-Order Inference for the Mean of a Variable Missing at Random," The International Journal of Biostatistics, De Gruyter, vol. 12(1), pages 333-349, May.
  • Handle: RePEc:bpj:ijbist:v:12:y:2016:i:1:p:333-349:n:14
    DOI: 10.1515/ijb-2015-0031
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/ijb-2015-0031
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.1515/ijb-2015-0031?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Porter Kristin E. & Gruber Susan & van der Laan Mark J. & Sekhon Jasjeet S., 2011. "The Relative Performance of Targeted Maximum Likelihood Estimators," The International Journal of Biostatistics, De Gruyter, vol. 7(1), pages 1-34, August.
    2. Heejung Bang & James M. Robins, 2005. "Doubly Robust Estimation in Missing Data and Causal Inference Models," Biometrics, The International Biometric Society, vol. 61(4), pages 962-973, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Susan Gruber & Mark J. van der Laan, 2013. "An Application of Targeted Maximum Likelihood Estimation to the Meta-Analysis of Safety Data," Biometrics, The International Biometric Society, vol. 69(1), pages 254-262, March.
    2. Stitelman Ori M. & De Gruttola Victor & van der Laan Mark J., 2012. "A General Implementation of TMLE for Longitudinal Data Applied to Causal Inference in Survival Analysis," The International Journal of Biostatistics, De Gruyter, vol. 8(1), pages 1-39, September.
    3. Lars van der Laan & Wenbo Zhang & Peter B. Gilbert, 2023. "Nonparametric estimation of the causal effect of a stochastic threshold‐based intervention," Biometrics, The International Biometric Society, vol. 79(2), pages 1014-1028, June.
    4. Radice Rosalba & Ramsahai Roland & Grieve Richard & Kreif Noemi & Sadique Zia & Sekhon Jasjeet S., 2012. "Evaluating treatment effectiveness in patient subgroups: a comparison of propensity score methods with an automated matching approach," The International Journal of Biostatistics, De Gruyter, vol. 8(1), pages 1-45, August.
    5. Díaz Iván & Rosenblum Michael, 2015. "Targeted Maximum Likelihood Estimation using Exponential Families," The International Journal of Biostatistics, De Gruyter, vol. 11(2), pages 233-251, November.
    6. Stitelman Ori M & Wester C. William & De Gruttola Victor & van der Laan Mark J., 2011. "Targeted Maximum Likelihood Estimation of Effect Modification Parameters in Survival Analysis," The International Journal of Biostatistics, De Gruyter, vol. 7(1), pages 1-34, March.
    7. van der Laan Mark J. & Gruber Susan, 2012. "Targeted Minimum Loss Based Estimation of Causal Effects of Multiple Time Point Interventions," The International Journal of Biostatistics, De Gruyter, vol. 8(1), pages 1-41, May.
    8. Sant’Anna, Pedro H.C. & Zhao, Jun, 2020. "Doubly robust difference-in-differences estimators," Journal of Econometrics, Elsevier, vol. 219(1), pages 101-122.
    9. Görg Holger & Marchal Léa, 2019. "Die Effekte deutscher Direktinvestitionen im Empfängerland vor dem Hintergrund des Leistungsbilanzüberschusses: Empirische Evidenz mit Mikrodaten für Frankreich," Perspektiven der Wirtschaftspolitik, De Gruyter, vol. 20(1), pages 53-69, June.
    10. Léa Marchal & Clément Nedoncelle, 2019. "Immigrants, occupations and firm export performance," Review of International Economics, Wiley Blackwell, vol. 27(5), pages 1480-1509, November.
    11. Hisaki Kono & Yasuyuki Sawada & Abu S. Shonchoy, 2016. "DVD-based Distance-learning Program for University Entrance Exams: Experimental Evidence from Rural Bangladesh," CIRJE F-Series CIRJE-F-1027, CIRJE, Faculty of Economics, University of Tokyo.
    12. Paul Frédéric Blanche & Anders Holt & Thomas Scheike, 2023. "On logistic regression with right censored data, with or without competing risks, and its use for estimating treatment effects," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 29(2), pages 441-482, April.
    13. Ignaciuk, Ada & Malevolti, Giulia & Scognamillo, Antonio & Sitko, Nicholas J., 2022. "Can food aid relax farmers’ constraints to adopting climate-adaptive agricultural practices? Evidence from Ethiopia, Malawi and the United Republic of Tanzania," ESA Working Papers 324073, Food and Agriculture Organization of the United Nations, Agricultural Development Economics Division (ESA).
    14. Everding, Jakob & Marcus, Jan, 2020. "The effect of unemployment on the smoking behavior of couples," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 29(2), pages 154-170.
    15. Li Liang & Greene Tom, 2013. "A Weighting Analogue to Pair Matching in Propensity Score Analysis," The International Journal of Biostatistics, De Gruyter, vol. 9(2), pages 215-234, July.
    16. McFarland, Michael J. & Geller, Amanda & McFarland, Cheryl, 2019. "Police contact and health among urban adolescents: The role of perceived injustice," Social Science & Medicine, Elsevier, vol. 238(C), pages 1-1.
    17. Fan Li & Ashley L. Buchanan & Stephen R. Cole, 2022. "Generalizing trial evidence to target populations in non‐nested designs: Applications to AIDS clinical trials," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(3), pages 669-697, June.
    18. Ronald Herrera & Ursula Berger & Ondine S. Von Ehrenstein & Iván Díaz & Stella Huber & Daniel Moraga Muñoz & Katja Radon, 2017. "Estimating the Causal Impact of Proximity to Gold and Copper Mines on Respiratory Diseases in Chilean Children: An Application of Targeted Maximum Likelihood Estimation," IJERPH, MDPI, vol. 15(1), pages 1-15, December.
    19. Antonelli Joseph & Cefalu Matthew, 2020. "Averaging causal estimators in high dimensions," Journal of Causal Inference, De Gruyter, vol. 8(1), pages 92-107, January.
    20. Yukun Ma & Pedro H. C. Sant'Anna & Yuya Sasaki & Takuya Ura, 2023. "Doubly Robust Estimators with Weak Overlap," Papers 2304.08974, arXiv.org, revised Apr 2023.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:ijbist:v:12:y:2016:i:1:p:333-349:n:14. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.