IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2512.14616.html

Estimating Program Participation with Partial Validation

Author

Listed:
  • Augustine Denteh
  • Pierre E. Nguimkeu

Abstract

This paper considers the estimation of binary choice models when survey responses are possibly misclassified but one of the response category can be validated. Partial validation may occur when survey questions about participation include follow-up questions on that particular response category. In this case, we show that the initial two-sided misclassification problem can be transformed into a one-sided one, based on the partially validated responses. Using the updated responses naively for estimation does not solve or mitigate the misclassification bias, and we derive the ensuing asymptotic bias under general conditions. We then show how the partially validated responses can be used to construct a model for participation and propose consistent and asymptotically normal estimators that overcome misclassification error. Monte Carlo simulations are provided to demonstrate the finite sample performance of the proposed and selected existing methods. We provide an empirical illustration on the determinants of health insurance coverage in Ghana. We discuss implications for the design of survey questionnaires that allow researchers to overcome misclassification biases without recourse to relatively costly and often imperfect validation data.

Suggested Citation

  • Augustine Denteh & Pierre E. Nguimkeu, 2025. "Estimating Program Participation with Partial Validation," Papers 2512.14616, arXiv.org.
  • Handle: RePEc:arx:papers:2512.14616
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2512.14616
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. H. S. Farber, 1981. "Worker Preferences for Union Representation," Working papers 290, Massachusetts Institute of Technology (MIT), Department of Economics.
    2. Bollinger, Christopher R, 1998. "Measurement Error in the Current Population Survey: A Nonparametric Look," Journal of Labor Economics, University of Chicago Press, vol. 16(3), pages 576-594, July.
    3. Meng, Chun-Lo & Schmidt, Peter, 1985. "On the Cost of Partial Observability in the Bivariate Probit Model," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 26(1), pages 71-85, February.
    4. Brent Kreider & Steven C. Hill, 2009. "Partially Identifying Treatment Effects with an Application to Covering the Uninsured," Journal of Human Resources, University of Wisconsin Press, vol. 44(2).
    5. Ansolabehere, Stephen & Hersh, Eitan, 2012. "Validation: What Big Data Reveal About Survey Misreporting and the Real Electorate," Political Analysis, Cambridge University Press, vol. 20(4), pages 437-459.
    6. Black, Dan & Sanders, Seth & Taylor, Lowell, 2003. "Measurement of Higher Education in the Census and Current Population Survey," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 545-554, January.
    7. Bollinger, Christopher R & David, Martin H, 2001. "Estimation with Response Error and Nonresponse: Food-Stamp Participation in the SIPP," Journal of Business & Economic Statistics, American Statistical Association, vol. 19(2), pages 129-141, April.
    8. Boyes, William J. & Hoffman, Dennis L. & Low, Stuart A., 1989. "An econometric analysis of the bank credit scoring problem," Journal of Econometrics, Elsevier, vol. 40(1), pages 3-14, January.
    9. Freeman, Richard B, 1984. "Longitudinal Analyses of the Effects of Trade Unions," Journal of Labor Economics, University of Chicago Press, vol. 2(1), pages 1-26, January.
    10. Augustine Denteh & D'esir'e K'edagni, 2022. "Misclassification in Difference-in-differences Models," Papers 2207.11890, arXiv.org, revised Jul 2022.
    11. Bruce D. Meyer & Wallace K. C. Mok & James X. Sullivan, 2015. "Household Surveys in Crisis," Journal of Economic Perspectives, American Economic Association, vol. 29(4), pages 199-226, Fall.
    12. Pablo A. Celhay & Bruce D. Meyer & Nikolas Mittag, 2021. "Errors in Reporting and Imputation of Government Benefits and Their Implications," NBER Working Papers 29184, National Bureau of Economic Research, Inc.
    13. Duncan, Greg J & Hill, Daniel H, 1985. "An Investigation of the Extent and Consequences of Measurement Error in Labor-Economic Survey Data," Journal of Labor Economics, University of Chicago Press, vol. 3(4), pages 508-532, October.
    14. Chamberlain, Gary, 1987. "Asymptotic efficiency in estimation with conditional moment restrictions," Journal of Econometrics, Elsevier, vol. 34(3), pages 305-334, March.
    15. Bound, John & Brown, Charles & Mathiowetz, Nancy, 2001. "Measurement error in survey data," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 5, chapter 59, pages 3705-3843, Elsevier.
    16. Lee, Lung-fei, 1995. "Semiparametric maximum likelihood estimation of polychotomous and sequential choice models," Journal of Econometrics, Elsevier, vol. 65(2), pages 381-428, February.
    17. Bruce D. Meyer & Nikolas Mittag, 2019. "Misreporting of Government Transfers: How Important Are Survey Design and Geography?," Southern Economic Journal, John Wiley & Sons, vol. 86(1), pages 230-253, July.
    18. Bound, John & Krueger, Alan B, 1991. "The Extent of Measurement Error in Longitudinal Earnings Data: Do Two Wrongs Make a Right?," Journal of Labor Economics, University of Chicago Press, vol. 9(1), pages 1-24, January.
    19. Bruce D. Meyer & Derek Wu & Victoria Mooers & Carla Medalia, 2021. "The Use and Misuse of Income Data and Extreme Poverty in the United States," Journal of Labor Economics, University of Chicago Press, vol. 39(S1), pages 5-58.
    20. Bound, John & Brown, Charles & Duncan, Greg J & Rodgers, Willard L, 1994. "Evidence on the Validity of Cross-Sectional and Longitudinal Labor Market Data," Journal of Labor Economics, University of Chicago Press, vol. 12(3), pages 345-368, July.
    21. Edward Kwabena Ameyaw & Raymond Elikplim Kofinti & Francis Appiah, 2017. "National health insurance subscription and maternal healthcare utilisation across mothers’ wealth status in Ghana," Health Economics Review, Springer, vol. 7(1), pages 1-15, December.
    22. Nguimkeu, Pierre & Denteh, Augustine & Tchernis, Rusty, 2019. "On the estimation of treatment effects with endogenous misreporting," Journal of Econometrics, Elsevier, vol. 208(2), pages 487-506.
    23. Jason Abrevaya & Jerry A. Hausman, 1999. "Semiparametric Estimation with Mismeasured Dependent Variables: An Application to Duration Models for Unemployment Spells," Annals of Economics and Statistics, GENES, issue 55-56, pages 243-275.
    24. repec:adr:anecst:y:1999:i:55-56:p:09 is not listed on IDEAS
    25. Klein, Roger W & Spady, Richard H, 1993. "An Efficient Semiparametric Estimator for Binary Response Models," Econometrica, Econometric Society, vol. 61(2), pages 387-421, March.
    26. Stephen O. Abrokwah & Kevin Callison & Donald J. Meyer, 2019. "Social Health Insurance and the Use of Formal and Informal Care in Developing Countries: Evidence from Ghana’s National Health Insurance Scheme," Journal of Development Studies, Taylor & Francis Journals, vol. 55(7), pages 1477-1491, July.
    27. Michael P. Keane & Robert M. Sauer, 2009. "Classification Error in Dynamic Discrete Choice Models: Implications for Female Labor Supply Behavior," Econometrica, Econometric Society, vol. 77(3), pages 975-991, May.
    28. Meyer, Bruce D. & Mittag, Nikolas, 2017. "Misclassification in binary choice models," Journal of Econometrics, Elsevier, vol. 200(2), pages 295-311.
    29. Poirier, Dale J., 1980. "Partial observability in bivariate probit models," Journal of Econometrics, Elsevier, vol. 12(2), pages 209-217, February.
    30. Feinstein, Jonathan S, 1990. "Detection Controlled Estimation," Journal of Law and Economics, University of Chicago Press, vol. 33(1), pages 233-276, April.
    31. Zhichao Jiang & Peng Ding, 2020. "Measurement errors in the binary instrumental variable model," Biometrika, Biometrika Trust, vol. 107(1), pages 238-245.
    32. Agar Brugiavini & Noemi Pace, 2016. "Extending health insurance in Ghana: effects of the National Health Insurance Scheme on maternity care," Health Economics Review, Springer, vol. 6(1), pages 1-10, December.
    33. Card, David, 1996. "The Effect of Unions on the Structure of Wages: A Longitudinal Analysis," Econometrica, Econometric Society, vol. 64(4), pages 957-979, July.
    34. Ali, Mir M. & Mikhail, N. N. & Haq, M. Safiul, 1978. "A class of bivariate distributions including the bivariate logistic," Journal of Multivariate Analysis, Elsevier, vol. 8(3), pages 405-412, September.
    35. Bruce D. Meyer & Nikolas Mittag & Robert M. Goerge, 2022. "Errors in Survey Reporting and Imputation and Their Effects on Estimates of Food Stamp Program Participation," Journal of Human Resources, University of Wisconsin Press, vol. 57(5), pages 1605-1644.
    36. Lewbel, Arthur, 2000. "Semiparametric qualitative response model estimation with unknown heteroscedasticity or instrumental variables," Journal of Econometrics, Elsevier, vol. 97(1), pages 145-177, July.
    37. Scott Thompson, T., 1993. "Some efficiency bounds for semiparametric discrete choice models," Journal of Econometrics, Elsevier, vol. 58(1-2), pages 257-274, July.
    38. Butler, J S, 1996. "Estimating the Correlation in Censored Probit Models," The Review of Economics and Statistics, MIT Press, vol. 78(2), pages 356-358, May.
    39. Rothenberg, Thomas J, 1971. "Identification in Parametric Models," Econometrica, Econometric Society, vol. 39(3), pages 577-591, May.
    40. Hausman, J. A. & Abrevaya, Jason & Scott-Morton, F. M., 1998. "Misclassification of the dependent variable in a discrete-response setting," Journal of Econometrics, Elsevier, vol. 87(2), pages 239-269, September.
    41. Acerenza, Santiago & Ban, Kyunghoon & Kedagni, Desire, 2021. "Marginal Treatment Effects with Misclassified Treatment," ISU General Staff Papers 202106180700001132, Iowa State University, Department of Economics.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ha Trong Nguyen & Huong Thu Le & Luke Connelly & Francis Mitrou, 2023. "Accuracy of self‐reported private health insurance coverage," Health Economics, John Wiley & Sons, Ltd., vol. 32(12), pages 2709-2729, December.
    2. Denni Tommasi & Lina Zhang, 2024. "Identifying program benefits when participation is misreported," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(6), pages 1123-1148, September.
    3. Meyer, Bruce D. & Mittag, Nikolas, 2017. "Using Linked Survey and Administrative Data to Better Measure Income: Implications for Poverty, Program Effectiveness and Holes in the Safety Net," IZA Discussion Papers 10943, IZA Network @ LISER.
    4. Akanksha Negi & Digvijay S. Negi, 2025. "Difference‐in‐Differences With a Misclassified Treatment," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 40(4), pages 411-423, June.
    5. Bollinger, Christopher R. & Hirsch, Barry & Hokayem, Charles M. & Ziliak, James P., 2018. "Trouble in the Tails? What We Know about Earnings Nonresponse Thirty Years after Lillard, Smith, and Welch," IZA Discussion Papers 11710, IZA Network @ LISER.
    6. Celhay, Pablo & Meyer, Bruce D. & Mittag, Nikolas, 2024. "What leads to measurement errors? Evidence from reports of program participation in three surveys," Journal of Econometrics, Elsevier, vol. 238(2).
    7. Bruce Meyer & Nikolas Mittag, 2017. "Using Linked Survey and Administrative Data to Better Measure Income: Implications for Poverty, Program Effectiveness and Holes in the Safety Net," Working Papers 2017-075, Human Capital and Economic Opportunity Working Group.
    8. Meyer, Bruce D. & Mittag, Nikolas, 2017. "Misclassification in binary choice models," Journal of Econometrics, Elsevier, vol. 200(2), pages 295-311.
    9. Peter Gottschalk & Minh Huynh, 2010. "Are Earnings Inequality and Mobility Overstated? The Impact of Nonclassical Measurement Error," The Review of Economics and Statistics, MIT Press, vol. 92(2), pages 302-315, May.
    10. Christopher R. Bollinger, 2001. "Response Error and the Union Wage Differential," Southern Economic Journal, John Wiley & Sons, vol. 68(1), pages 60-76, July.
    11. Meyer, Bruce D. & Mittag, Nikolas, 2019. "Combining Administrative and Survey Data to Improve Income Measurement," IZA Discussion Papers 12266, IZA Network @ LISER.
    12. Bruce D. Meyer & Nikolas Mittag, 2015. "Using Linked Survey and Administrative Data to Better Measure Income: Implications for Poverty, Program Effectiveness and Holes in the Safety Net," NBER Working Papers 21676, National Bureau of Economic Research, Inc.
    13. Adele Bergin, 2013. "Job Changes and Wage Changes: Estimation with Measurement Error in a Binary Variable," Economics Department Working Paper Series n240-13.pdf, Department of Economics, National University of Ireland - Maynooth.
    14. Arie Kapteyn & Jelmer Y. Ypma, 2007. "Measurement Error and Misclassification: A Comparison of Survey and Administrative Data," Journal of Labor Economics, University of Chicago Press, vol. 25(3), pages 513-551.
    15. Bruce D. Meyer & Nikolas Mittag, 2019. "Combining Administrative and Survey Data to Improve Income Measurement," NBER Working Papers 25738, National Bureau of Economic Research, Inc.
    16. ChangHwan Kim & Christopher R. Tamborini, 2014. "Response Error in Earnings," Sociological Methods & Research, , vol. 43(1), pages 39-72, February.
    17. Aprajit Mahajan, 2006. "Identification and Estimation of Regression Models with Misclassification," Econometrica, Econometric Society, vol. 74(3), pages 631-665, May.
    18. Liu, Long, 2009. "On hourly wages and weekly earnings in the current population survey," Economics Letters, Elsevier, vol. 105(1), pages 113-116, October.
    19. Lynn, Peter & Jäckle, Annette & Sala, Emanuela & P. Jenkins, Stephen, 2004. "Validation of survey data on income and employment: the ISMIE experience," ISER Working Paper Series 2004-14, Institute for Social and Economic Research.
    20. Tommasi, Denni & Zhang, Lina, 2024. "Bounding program benefits when participation is misreported," Journal of Econometrics, Elsevier, vol. 238(1).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2512.14616. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.