IDEAS home Printed from https://ideas.repec.org/a/spr/stabio/v12y2020i3d10.1007_s12561-019-09262-2.html
   My bibliography  Save this article

Deductive Semiparametric Estimation in Double-Sampling Designs with Application to PEPFAR

Author

Listed:
  • Tianchen Qian

    (Harvard University)

  • Constantine Frangakis

    (Johns Hopkins University)

  • Constantin Yiannoutsos

    (Indiana University)

Abstract

Non-ignorable dropout is common in studies with long follow-up time, and it can bias study results unless handled carefully in the study design and the statistical analysis. A double-sampling design allocates additional resources to pursue a subsample of the dropouts and find out their outcomes, which can address potential biases due to non-ignorable dropout. It is desirable to construct semiparametric estimators for the double-sampling design because of their robustness properties. However, obtaining such semiparametric estimators remains a challenge due to the requirement of the analytic form of the efficient influence function (EIF), the derivation of which can be ad hoc and difficult for the double-sampling design. Recent work has shown how the derivation of EIF can be made deductive and computerizable using the functional derivative representation of the EIF in nonparametric models. This approach, however, requires deriving the mixture of a continuous distribution and a point mass, which can itself be challenging for complicated problems such as the double-sampling design. We propose semiparametric estimators for the survival probability in double-sampling designs by generalizing the deductive and computerizable estimation approach. In particular, we propose to build the semiparametric estimators based on a discretized support structure, which approximates the possibly continuous observed data distribution and circumvents the derivation of the mixture distribution. Our approach is deductive in the sense that it is expected to produce semiparametric locally efficient estimators within finite steps without knowledge of the EIF. We apply the proposed estimators to estimating the mortality rate in a double-sampling design component of the President’s Emergency Plan for AIDS Relief (PEPFAR) program. We evaluate the impact of double-sampling selection criteria on the mortality rate estimates. Simulation studies are conducted to evaluate the robustness of the proposed estimators.

Suggested Citation

  • Tianchen Qian & Constantine Frangakis & Constantin Yiannoutsos, 2020. "Deductive Semiparametric Estimation in Double-Sampling Designs with Application to PEPFAR," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 12(3), pages 417-445, December.
  • Handle: RePEc:spr:stabio:v:12:y:2020:i:3:d:10.1007_s12561-019-09262-2
    DOI: 10.1007/s12561-019-09262-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s12561-019-09262-2
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s12561-019-09262-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Min Zhang & Anastasios A. Tsiatis & Marie Davidian, 2008. "Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates," Biometrics, The International Biometric Society, vol. 64(3), pages 707-715, September.
    2. Constantine E. Frangakis & Tianchen Qian & Zhenke Wu & Iván Díaz, 2015. "Rejoinder to Discussions on: Deductive derivation and turing-computerization of semiparametric efficient estimation," Biometrics, The International Biometric Society, vol. 71(4), pages 881-883, December.
    3. Newey, Whitney K, 1990. "Semiparametric Efficiency Bounds," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 5(2), pages 99-135, April-Jun.
    4. Vaart,A. W. van der, 2000. "Asymptotic Statistics," Cambridge Books, Cambridge University Press, number 9780521784504.
    5. James Robins & Andrea Rotnitzky & Marco Bonetti, 2001. "Discussion of the Frangakis and Rubin Article," Biometrics, The International Biometric Society, vol. 57(2), pages 343-347, June.
    6. Constantine E. Frangakis & Tianchen Qian & Zhenke Wu & Ivan Diaz, 2015. "Deductive derivation and turing-computerization of semiparametric efficient estimation," Biometrics, The International Biometric Society, vol. 71(4), pages 867-874, December.
    7. Chamberlain, Gary, 1987. "Asymptotic efficiency in estimation with conditional moment restrictions," Journal of Econometrics, Elsevier, vol. 34(3), pages 305-334, March.
    8. Constantine E. Frangakis & Donald B. Rubin, 2001. "Addressing an Idiosyncrasy in Estimating Survival Curves Using Double Sampling in the Presence of Self-Selected Right Censoring," Biometrics, The International Biometric Society, vol. 57(2), pages 333-342, June.
    9. Ming-Wen An & Constantine E. Frangakis & Beverly S. Musick & Constantin T. Yiannoutsos, 2009. "The Need for Double-Sampling Designs in Survival Studies: An Application to Monitor PEPFAR," Biometrics, The International Biometric Society, vol. 65(1), pages 301-306, March.
    10. Constantine E. Frangakis & Donald B. Rubin, 2001. "Rejoinder to Discussions on Addressing an Idiosyncrasy in Estimating Survival Curves Using Double Sampling in the Presence of Self-Selected Right Censoring," Biometrics, The International Biometric Society, vol. 57(2), pages 351-353, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Agnes N Kiragga & Barbara Castelnuovo & Rachel Musomba & Jonathan Levin & Andrew Kambugu & Yukari C Manabe & Constantin T Yiannoutsos & Noah Kiwanuka, 2013. "Comparison of Methods for Correction of Mortality Estimates for Loss to Follow-Up after ART Initiation: A Case of the Infectious Diseases Institute, Uganda," PLOS ONE, Public Library of Science, vol. 8(12), pages 1-7, December.
    2. Menggang Yu, 2016. "Improving estimation efficiency for semi-competing risks data with partially observed terminal event," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 28(4), pages 860-874, October.
    3. Richard J. Cook & Jerald F. Lawless, 2020. "Failure time studies with intermittent observation and losses to follow‐up," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 47(4), pages 1035-1063, December.
    4. Oliver Linton & Pedro Gozalo, 1996. "Conditional Independence Restrictions: Testing and Estimation," Cowles Foundation Discussion Papers 1140, Cowles Foundation for Research in Economics, Yale University.
    5. Waverly Wei & Maya Petersen & Mark J van der Laan & Zeyu Zheng & Chong Wu & Jingshen Wang, 2023. "Efficient targeted learning of heterogeneous treatment effects for multiple subgroups," Biometrics, The International Biometric Society, vol. 79(3), pages 1934-1946, September.
    6. Timothy B. Armstrong & Michal Kolesár, 2021. "Sensitivity analysis using approximate moment condition models," Quantitative Economics, Econometric Society, vol. 12(1), pages 77-108, January.
    7. Lancaster, Tony & Imbens, Guido, 1996. "Case-control studies with contaminated controls," Journal of Econometrics, Elsevier, vol. 71(1-2), pages 145-160.
    8. Park, Byeong U. & Sickles, Robin C. & Simar, Leopold, 2003. "Semiparametric-efficient estimation of AR(1) panel data models," Journal of Econometrics, Elsevier, vol. 117(2), pages 279-309, December.
    9. Imbens, Guido W. & Lancaster, Tony, 1996. "Efficient estimation and stratified sampling," Journal of Econometrics, Elsevier, vol. 74(2), pages 289-318, October.
    10. Robertson, Donald & Sarafidis, Vasilis, 2015. "IV estimation of panels with factor residuals," Journal of Econometrics, Elsevier, vol. 185(2), pages 526-541.
    11. repec:hal:spmain:info:hdl:2441/75dbbb2hc596np6q8flqf6i79k is not listed on IDEAS
    12. Julie Henriques & Mar Pujades-Rodriguez & Megan McGuire & Elisabeth Szumilin & Jean Iwaz & Jean-François Etard & René Ecochard, 2012. "Comparison of Methods to Correct Survival Estimates and Survival Regression Analysis on a Large HIV African Cohort," PLOS ONE, Public Library of Science, vol. 7(2), pages 1-7, February.
    13. Graham, Bryan S. & Hahn, Jinyong & Poirier, Alexandre & Powell, James L., 2018. "A quantile correlated random coefficients panel data model," Journal of Econometrics, Elsevier, vol. 206(2), pages 305-335.
    14. Menggang Yu & Constantin T. Yiannoutsos, 2015. "Marginal and Conditional Distribution Estimation from Double-sampled Semi-competing Risks Data," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(1), pages 87-103, March.
    15. Koen Jochmans, 2017. "Two-Way Models for Gravity," The Review of Economics and Statistics, MIT Press, vol. 99(3), pages 478-485, July.
    16. Dennis Kristensen, 2009. "Semiparametric Modelling and Estimation: A Selective Overview," CREATES Research Papers 2009-44, Department of Economics and Business Economics, Aarhus University.
    17. Graham, Bryan S. & Pinto, Cristine Campos de Xavier, 2022. "Semiparametrically efficient estimation of the average linear regression function," Journal of Econometrics, Elsevier, vol. 226(1), pages 115-138.
    18. Stuart G. Baker, 2001. "Discussion of Double Sampling for Survival Analysis," Biometrics, The International Biometric Society, vol. 57(2), pages 348-350, June.
    19. Ai, Chunrong & Chen, Xiaohong, 2012. "The semiparametric efficiency bound for models of sequential moment restrictions containing unknown functions," Journal of Econometrics, Elsevier, vol. 170(2), pages 442-457.
    20. Severini, Thomas A. & Tripathi, Gautam, 2001. "A simplified approach to computing efficiency bounds in semiparametric models," Journal of Econometrics, Elsevier, vol. 102(1), pages 23-66, May.
    21. St'ephane Bonhomme & Martin Weidner, 2018. "Minimizing Sensitivity to Model Misspecification," Papers 1807.02161, arXiv.org, revised Oct 2021.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stabio:v:12:y:2020:i:3:d:10.1007_s12561-019-09262-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.