IDEAS home Printed from https://ideas.repec.org/a/vrs/offsta/v36y2020i3p703-728n12.html
   My bibliography  Save this article

Proxy Pattern-Mixture Analysis for a Binary Variable Subject to Nonresponse

Author

Listed:
  • Andridge Rebecca R.

    (The Ohio State University College of Public Health Division of Biostatistics, 242 Cunz Hall, 1841 Neil Ave., Columbus, OH 43210, U.S.A.)

  • Little Roderick J.A.

    (University of Michigan, Department of Biostatistics, M4071 SPH II, 1415 Washington Heights, Ann Arbor, MI 48109, U.S.A.)

Abstract

Given increasing survey nonresponse, good measures of the potential impact of nonresponse on survey estimates are particularly important. Existing measures, such as the R-indicator, make the strong assumption that missingness is missing at random, meaning that it depends only on variables that are observed for respondents and nonrespondents. We consider assessment of the impact of nonresponse for a binary survey variable Y subject to nonresponse when missingness may be not at random, meaning that missingness may depend on Y itself. Our work is motivated by missing categorical income data in the 2015 Ohio Medicaid Assessment Survey (OMAS), where whether or not income is missing may be related to the income value itself, with low-income earners more reluctant to respond. We assume there is a set of covariates observed for nonrespondents and respondents, which for the item nonresponse (as in OMAS) is often a rich set of variables, but which may be potentially limited in cases of unit nonresponse. To reduce dimensionality and for simplicity we reduce these available covariates to a continuous proxy variable X, available for both respondents and nonrespondents, that has the highest correlation with Y, estimated from a probit regression analysis of respondent data. We extend the previously proposed proxy-pattern mixture (PPM) analysis for continuous outcomes to the binary outcome using a latent variable approach for modeling the joint distribution of Y and X. Our method does not assume data are missing at random but includes it as a special case, thus creating a convenient framework for sensitivity analyses. Maximum likelihood, Bayesian, and multiple imputation versions of PPM analysis are described, and robustness of these methods to model assumptions is discussed. Properties are demonstrated through simulation and with the 2015 OMAS data.

Suggested Citation

  • Andridge Rebecca R. & Little Roderick J.A., 2020. "Proxy Pattern-Mixture Analysis for a Binary Variable Subject to Nonresponse," Journal of Official Statistics, Sciendo, vol. 36(3), pages 703-728, September.
  • Handle: RePEc:vrs:offsta:v:36:y:2020:i:3:p:703-728:n:12
    DOI: 10.2478/jos-2020-0035
    as

    Download full text from publisher

    File URL: https://doi.org/10.2478/jos-2020-0035
    Download Restriction: no

    File URL: https://libkey.io/10.2478/jos-2020-0035?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Sullivan, Danielle & Andridge, Rebecca, 2015. "A hot deck imputation procedure for multiply imputing nonignorable missing data: The proxy pattern-mixture hot deck," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 173-185.
    2. James J. Heckman, 1976. "The Common Structure of Statistical Models of Truncation, Sample Selection and Limited Dependent Variables and a Simple Estimator for Such Models," NBER Chapters, in: Annals of Economic and Social Measurement, Volume 5, number 4, pages 475-492, National Bureau of Economic Research, Inc.
    3. Jae Kwang Kim & J. Michael Brick & Wayne A. Fuller & Graham Kalton, 2006. "On the bias of the multiple‐imputation variance estimator in survey sampling," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(3), pages 509-521, June.
    4. Manski, Charles F., 2016. "Credible interval estimates for official statistics with survey nonresponse," Journal of Econometrics, Elsevier, vol. 191(2), pages 293-301.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ana Beatriz Galvão & James Mitchell, 2024. "Communicating Data Uncertainty: Multiwave Experimental Evidence for UK GDP," Journal of Money, Credit and Banking, Blackwell Publishing, vol. 56(1), pages 81-114, February.
    2. Sènakpon Fidèle A. Dedehouanou & Luca Tiberti & Hilaire G. Houeninvo & Djohodo Inès Monwanou, 2019. "Working while studying: Employment premium or penalty for youth in Benin?," Working Papers PMMA 2019-03, PEP-PMMA.
    3. Sandra Müllbacher & Wolfgang Nagl, 2017. "Labour supply in Austria: an assessment of recent developments and the effects of a tax reform," Empirica, Springer;Austrian Institute for Economic Research;Austrian Economic Association, vol. 44(3), pages 465-486, August.
    4. Insoo Cho & Peter F. Orazem, 2021. "How endogenous risk preferences and sample selection affect analysis of firm survival," Small Business Economics, Springer, vol. 56(4), pages 1309-1332, April.
    5. Walter Beckert, 2015. "Choice in the Presence of Experts," Birkbeck Working Papers in Economics and Finance 1503, Birkbeck, Department of Economics, Mathematics & Statistics.
    6. Miyoshi, Koyo, 2008. "Male-female wage differentials in Japan," Japan and the World Economy, Elsevier, vol. 20(4), pages 479-496, December.
    7. Cameron, Trudy Ann & Shaw, W. Douglass & Ragland, Shannon E. & Callaway, J. Mac & Keefe, Sally, 1996. "Using Actual And Contingent Behavior Data With Differing Levels Of Time Aggregation To Model Recreation Demand," Journal of Agricultural and Resource Economics, Western Agricultural Economics Association, vol. 21(1), pages 1-20, July.
    8. Annika Meng, 2010. "Long-term Care Responsibility and its Opportunity Costs," Ruhr Economic Papers 0168, Rheinisch-Westfälisches Institut für Wirtschaftsforschung, Ruhr-Universität Bochum, Universität Dortmund, Universität Duisburg-Essen.
    9. Hans A. Holter & Dirk Krueger & Serhiy Stepanchuk, 2019. "How do tax progressivity and household heterogeneity affect Laffer curves?," Quantitative Economics, Econometric Society, vol. 10(4), pages 1317-1356, November.
    10. Ichimura, Hidehiko & Todd, Petra E., 2007. "Implementing Nonparametric and Semiparametric Estimators," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 6, chapter 74, Elsevier.
    11. Chen, Yuanyuan & Feng, Shuaizhang & Han, Yujie, 2020. "The effect of primary school type on the high school opportunities of migrant children in China," Journal of Comparative Economics, Elsevier, vol. 48(2), pages 325-338.
    12. Michael Raper, 1999. "Self-selection bias and cost-of-living estimates," Journal of Economics and Finance, Springer;Academy of Economics and Finance, vol. 23(1), pages 64-77, March.
    13. Bettina Peters & Rebecca Riley & Iulia Siedschlag & Priit Vahter & John McQuinn, 2014. "Innovation and Productivity in Services: Evidence from Germany, Ireland and the United Kingdom," JRC Working Papers on Corporate R&D and Innovation 2014-04, Joint Research Centre.
    14. Matthew Gentry & Tong Li & Jingfeng Lu, 2015. "Identification and estimation in first-price auctions with risk-averse bidders and selective entry," CeMMAP working papers 16/15, Institute for Fiscal Studies.
    15. Bernadette Power & Gavin C Reid, 2003. "Turbulence, Flexibility and Performance of the Long-lived Small Firm," Tinbergen Institute Discussion Papers 03-039/3, Tinbergen Institute.
    16. Aizenman, Joshua & Ito, Hiro & Pasricha, Gurnain Kaur, 2022. "Central bank swap arrangements in the COVID-19 crisis," Journal of International Money and Finance, Elsevier, vol. 122(C).
    17. Asaduzzaman, M. & Anik, Asif Reza, 2017. "Determinants of Adoption of Rice Yield Gap Minimisation Technology in Bangladesh," Bangladesh Development Studies, Bangladesh Institute of Development Studies (BIDS), vol. 40(1-2), pages 73-96, March-Jun.
    18. Fernandes, Marcelo & Mergulhão, João, 2016. "Anticipatory effects in the FTSE 100 index revisions," Journal of Empirical Finance, Elsevier, vol. 37(C), pages 79-90.
    19. Trottmann, Maria & Zweifel, Peter & Beck, Konstantin, 2012. "Supply-side and demand-side cost sharing in deregulated social health insurance: Which is more effective?," Journal of Health Economics, Elsevier, vol. 31(1), pages 231-242.
    20. Banal-Estañol, Albert & Duso, Tomaso & Seldeslachts, Jo & Szücs, Florian, 2022. "R&D spillovers through RJV cooperation," Research Policy, Elsevier, vol. 51(4).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:vrs:offsta:v:36:y:2020:i:3:p:703-728:n:12. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.sciendo.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.