IDEAS home Printed from https://ideas.repec.org/a/eee/econom/v200y2017i2p295-311.html
   My bibliography  Save this article

Misclassification in binary choice models

Author

Listed:
  • Meyer, Bruce D.
  • Mittag, Nikolas

Abstract

Bias from misclassification of binary dependent variables can be pronounced. We examine what can be learned from such contaminated data. First, we derive the asymptotic bias in parametric models allowing misclassification to be correlated with observables and unobservables. Simulations and validation data show that the bias formulas are accurate in finite samples and in most situations imply attenuation. Second, we examine the bias in a prototypical application. Erroneously restricting the covariance of misclassification and covariates aggravates the bias for all estimators we examine. Estimators that relax this restriction perform well if a model of misclassification or validation data is available.

Suggested Citation

  • Meyer, Bruce D. & Mittag, Nikolas, 2017. "Misclassification in binary choice models," Journal of Econometrics, Elsevier, vol. 200(2), pages 295-311.
  • Handle: RePEc:eee:econom:v:200:y:2017:i:2:p:295-311
    DOI: 10.1016/j.jeconom.2017.06.012
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0304407617300982
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jeconom.2017.06.012?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Shuaizhang Feng & Yingyao Hu, 2013. "Misclassification Errors and the Underestimation of the US Unemployment Rate," American Economic Review, American Economic Association, vol. 103(2), pages 1054-1070, April.
    2. Aigner, Dennis J., 1973. "Regression with a binary independent variable subject to errors of observation," Journal of Econometrics, Elsevier, vol. 1(1), pages 49-59, March.
    3. Ruud, Paul A., 1986. "Consistent estimation of limited dependent variable models despite misspecification of distribution," Journal of Econometrics, Elsevier, vol. 32(1), pages 157-187, June.
    4. Kreider, Brent & Pepper, John V., 2011. "Identification of Expected Outcomes in a Data Error Mixing Model With Multiplicative Mean Independence," Journal of Business & Economic Statistics, American Statistical Association, vol. 29(1), pages 49-60.
    5. Bruce D. Meyer & Wallace K. C. Mok & James X. Sullivan, 2015. "Household Surveys in Crisis," Journal of Economic Perspectives, American Economic Association, vol. 29(4), pages 199-226, Fall.
    6. Arturo Estrella & Frederic S. Mishkin, 1998. "Predicting U.S. Recessions: Financial Variables As Leading Indicators," The Review of Economics and Statistics, MIT Press, vol. 80(1), pages 45-61, February.
    7. Fearon, James D. & Laitin, David D., 2003. "Ethnicity, Insurgency, and Civil War," American Political Science Review, Cambridge University Press, vol. 97(1), pages 75-90, February.
    8. Bound, John & Brown, Charles & Mathiowetz, Nancy, 2001. "Measurement error in survey data," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 5, chapter 59, pages 3705-3843, Elsevier.
    9. Lance Lochner & Enrico Moretti, 2004. "The Effect of Education on Crime: Evidence from Prison Inmates, Arrests, and Self-Reports," American Economic Review, American Economic Association, vol. 94(1), pages 155-189, March.
    10. Horowitz, Joel L & Manski, Charles F, 1995. "Identification and Robustness with Contaminated and Corrupted Data," Econometrica, Econometric Society, vol. 63(2), pages 281-302, March.
    11. Brent Kreider & John Pepper, 2008. "Inferring disability status from corrupt data," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 23(3), pages 329-349.
    12. Bollinger, Christopher R & David, Martin H, 2001. "Estimation with Response Error and Nonresponse: Food-Stamp Participation in the SIPP," Journal of Business & Economic Statistics, American Statistical Association, vol. 19(2), pages 129-141, April.
    13. Bollinger, Christopher R., 1996. "Bounding mean regressions when a binary regressor is mismeasured," Journal of Econometrics, Elsevier, vol. 73(2), pages 387-399, August.
    14. Yatchew, Adonis & Griliches, Zvi, 1985. "Specification Error in Probit Models," The Review of Economics and Statistics, MIT Press, vol. 67(1), pages 134-139, February.
    15. Hu, Yingyao, 2008. "Identification and estimation of nonlinear models with misclassification error using instrumental variables: A general solution," Journal of Econometrics, Elsevier, vol. 144(1), pages 27-61, May.
    16. Lewbel, Arthur, 2000. "Identification Of The Binary Choice Model With Misclassification," Econometric Theory, Cambridge University Press, vol. 16(4), pages 603-609, August.
    17. Hausman, J. A. & Abrevaya, Jason & Scott-Morton, F. M., 1998. "Misclassification of the dependent variable in a discrete-response setting," Journal of Econometrics, Elsevier, vol. 87(2), pages 239-269, September.
    18. Xiaohong Chen & Han Hong & Denis Nekipelov, 2011. "Nonlinear Models of Measurement Errors," Journal of Economic Literature, American Economic Association, vol. 49(4), pages 901-937, December.
    19. Guido W. Imbens & Tony Lancaster, 1994. "Combining Micro and Macro Data in Microeconometric Models," Review of Economic Studies, Oxford University Press, vol. 61(4), pages 655-680.
    20. Molinari, Francesca, 2008. "Partial identification of probability distributions with misclassified data," Journal of Econometrics, Elsevier, vol. 144(1), pages 81-117, May.
    21. Wooldridge, Jeffrey M., 2007. "Inverse probability weighted estimation for general missing data problems," Journal of Econometrics, Elsevier, vol. 141(2), pages 1281-1301, December.
    22. Hugo Benítez-Silva & Moshe Buchinsky & Hiu Man Chan & Sofia Cheidvasser & John Rust, 2004. "How large is the bias in self-reported disability?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 19(6), pages 649-670.
    23. Black, Dan & Sanders, Seth & Taylor, Lowell, 2003. "Measurement of Higher Education in the Census and Current Population Survey," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 545-554, January.
    24. Poterba, James M & Summers, Lawrence H, 1995. "Unemployment Benefits and Labor Market Transitions: A Multinomial Logit Model with Errors in Classification," The Review of Economics and Statistics, MIT Press, vol. 77(2), pages 207-216, May.
    25. Stephen V. Cameron & James J. Heckman, 2001. "The Dynamics of Educational Attainment for Black, Hispanic, and White Males," Journal of Political Economy, University of Chicago Press, vol. 109(3), pages 455-499, June.
    26. Bound, John & Brown, Charles & Duncan, Greg J & Rodgers, Willard L, 1994. "Evidence on the Validity of Cross-Sectional and Longitudinal Labor Market Data," Journal of Labor Economics, University of Chicago Press, vol. 12(3), pages 345-368, July.
    27. Zvi Eckstein & Kenneth I. Wolpin, 1999. "Why Youths Drop Out of High School: The Impact of Preferences, Opportunities, and Abilities," Econometrica, Econometric Society, vol. 67(6), pages 1295-1340, November.
    28. Ruud, Paul A, 1983. "Sufficient Conditions for the Consistency of Maximum Likelihood Estimation Despite Misspecifications of Distribution in Multinomial Discrete Choice Models," Econometrica, Econometric Society, vol. 51(1), pages 225-228, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bruce Meyer & Nikolas Mittag, 2013. "Misclassification In Binary Choice Models," Working Papers 13-27, Center for Economic Studies, U.S. Census Bureau.
    2. Takahide Yanagi, 2019. "Inference on local average treatment effects for misclassified treatment," Econometric Reviews, Taylor & Francis Journals, vol. 38(8), pages 938-960, September.
    3. Craig Gundersen & Brent Kreider, 2008. "Food Stamps and Food Insecurity: What Can Be Learned in the Presence of Nonclassical Measurement Error?," Journal of Human Resources, University of Wisconsin Press, vol. 43(2), pages 352-382.
    4. Molinari, Francesca, 2008. "Partial identification of probability distributions with misclassified data," Journal of Econometrics, Elsevier, vol. 144(1), pages 81-117, May.
    5. Tommasi, Denni & Zhang, Lina, 2024. "Bounding program benefits when participation is misreported," Journal of Econometrics, Elsevier, vol. 238(1).
    6. Acerenza, Santiago & Ban, Kyunghoon & Kedagni, Desire, 2021. "Marginal Treatment Effects with Misclassified Treatment," ISU General Staff Papers 202106180700001132, Iowa State University, Department of Economics.
    7. Orville Mondal & Rui Wang, 2024. "Partial Identification of Binary Choice Models with Misreported Outcomes," Papers 2401.17137, arXiv.org.
    8. Kreider, Brent & Pepper, John V., 2007. "Disability and Employment: Reevaluating the Evidence in Light of Reporting Errors," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 432-441, June.
    9. Battistin, Erich & De Nadai, Michele & Sianesi, Barbara, 2014. "Misreported schooling, multiple measures and returns to educational qualifications," Journal of Econometrics, Elsevier, vol. 181(2), pages 136-150.
    10. Daniel Wilhelm, 2018. "Testing for the presence of measurement error," CeMMAP working papers CWP45/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    11. Christian vom Lehn & Cache Ellsworth & Zachary Kroff, 2022. "Reconciling Occupational Mobility in the Current Population Survey," Journal of Labor Economics, University of Chicago Press, vol. 40(4), pages 1005-1051.
    12. Arthur Lewbel, 2007. "Estimation of Average Treatment Effects with Misclassification," Econometrica, Econometric Society, vol. 75(2), pages 537-551, March.
    13. Hu, Yingyao, 2017. "The econometrics of unobservables: Applications of measurement error models in empirical industrial organization and labor economics," Journal of Econometrics, Elsevier, vol. 200(2), pages 154-168.
    14. Francis DiTraglia & Camilo Garcia-Jimeno, 2015. "On Mis-measured Binary Regressors: New Results And Some Comments on the Literature, Third Version," PIER Working Paper Archive 15-040, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania, revised 24 Nov 2015.
    15. Battistin, Erich & De Nadai, Michele & Vuri, Daniela, 2017. "Counting rotten apples: Student achievement and score manipulation in Italian elementary Schools," Journal of Econometrics, Elsevier, vol. 200(2), pages 344-362.
    16. Brent Kreider & John V. Pepper, 2011. "Identification of Expected Outcomes in a Data Error Mixing Model With Multiplicative Mean Independence," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 29(1), pages 49-60, January.
    17. Gundersen, Craig & Kreider, Brent, 2009. "Bounding the effects of food insecurity on children's health outcomes," Journal of Health Economics, Elsevier, vol. 28(5), pages 971-983, September.
    18. Francis J. DiTraglia & Camilo Garcia-Jimeno, 2020. "Identifying the effect of a mis-classified, binary, endogenous regressor," Papers 2011.07272, arXiv.org.
    19. DiTraglia, Francis J. & García-Jimeno, Camilo, 2019. "Identifying the effect of a mis-classified, binary, endogenous regressor," Journal of Econometrics, Elsevier, vol. 209(2), pages 376-390.
    20. Akanksha Negi & Digvijay Singh Negi, 2022. "Difference-in-Differences with a Misclassified Treatment," Papers 2208.02412, arXiv.org.

    More about this item

    Keywords

    Measurement error; Binary choice models; Program take-up; Food stamps;
    All these keywords.

    JEL classification:

    • C35 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Discrete Regression and Qualitative Choice Models; Discrete Regressors; Proportions
    • C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
    • H53 - Public Economics - - National Government Expenditures and Related Policies - - - Government Expenditures and Welfare Programs

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:econom:v:200:y:2017:i:2:p:295-311. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jeconom .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.