IDEAS home Printed from https://ideas.repec.org/p/azt/cemmap/16-25.html
   My bibliography  Save this paper

Binary classification with the maximum score model and linear programming

Author

Listed:
  • Joel L. Horowitz
  • Sokbae Lee

Abstract

This paper presents a computationally efficient method for binary classification using Manski's (1975,1985) maximum score model when covariates are discretely distributed and parameters are partially but not point identified. We establish conditions under which it is minimax optimal to allow for either non-classification or random classification and derive finite-sample and asymptotic lower bounds on the probability of correct classification. We also describe an extension of our method to continuous covariates. Our approach avoids the computational difficulty of maximum score estimation by reformulating the problem as two linear programs. Compared to parametric and nonparametric methods, our method balances extrapolation ability with minimal distributional assumptions. Monte Carlo simulations and empirical applications demonstrate its effectiveness and practical relevance.

Suggested Citation

  • Joel L. Horowitz & Sokbae Lee, 2025. "Binary classification with the maximum score model and linear programming," CeMMAP working papers 16/25, Institute for Fiscal Studies.
  • Handle: RePEc:azt:cemmap:16/25
    DOI: 10.47004/wp.cem.2025.1625
    as

    Download full text from publisher

    File URL: https://cemmap.ac.uk/wp-content/uploads/2025/08/CWP1625-Binary-classification-with-the-maximum-score-model-and-linear-programming.pdf
    Download Restriction: no

    File URL: https://libkey.io/10.47004/wp.cem.2025.1625?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Charles F. Manski & Elie Tamer, 2002. "Inference on Regressions with Interval Data on a Regressor or Outcome," Econometrica, Econometric Society, vol. 70(2), pages 519-546, March.
    2. Horowitz, Joel L, 1992. "A Smoothed Maximum Score Estimator for the Binary Response Model," Econometrica, Econometric Society, vol. 60(3), pages 505-531, May.
    3. Emily Breza & Arun G. Chandrasekhar & Davide Viviano, 2025. "Generalizability with ignorance in mind: learning what we do (not) know for archetypes discovery," Papers 2501.13355, arXiv.org, revised Jul 2025.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chen, Le-Yu & Lee, Sokbae, 2018. "Best subset binary prediction," Journal of Econometrics, Elsevier, vol. 206(1), pages 39-56.
    2. repec:cep:stiecm:em/2012/559 is not listed on IDEAS
    3. Chen, Songnian & Khan, Shakeeb & Tang, Xun, 2016. "Informational content of special regressors in heteroskedastic binary response models," Journal of Econometrics, Elsevier, vol. 193(1), pages 162-182.
    4. Jason R. Blevins, 2013. "Non-Standard Rates of Convergence of Criterion-Function-Based Set Estimators," Working Papers 13-02, Ohio State University, Department of Economics.
    5. Tatiana Komarova, 2012. "Binary Choice Models with Discrete Regressors: Identification and Misspecification," STICERD - Econometrics Paper Series 559, Suntory and Toyota International Centres for Economics and Related Disciplines, LSE.
    6. Xun Tang, 2009. "Binary Regressions with Bounded Median Dependence," PIER Working Paper Archive 09-003, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania.
    7. Wan, Yuanyuan & Xu, Haiqing, 2015. "Inference in semiparametric binary response models with interval data," Journal of Econometrics, Elsevier, vol. 184(2), pages 347-360.
    8. Arthur Lewbel, 2019. "The Identification Zoo: Meanings of Identification in Econometrics," Journal of Economic Literature, American Economic Association, vol. 57(4), pages 835-903, December.
    9. Xi Wang & Songnian Chen, 2022. "Partial Identification and Estimation of Semiparametric Ordered Response Models with Interval Regressor Data," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 84(4), pages 830-849, August.
    10. Chen, Songnian & Zhang, Hanghui, 2015. "Binary quantile regression with local polynomial smoothing," Journal of Econometrics, Elsevier, vol. 189(1), pages 24-40.
    11. repec:cep:stiecm:/2012/559 is not listed on IDEAS
    12. Komarova, Tatiana, 2013. "Binary choice models with discrete regressors: Identification and misspecification," Journal of Econometrics, Elsevier, vol. 177(1), pages 14-33.
    13. Qi Li & Jeffrey Scott Racine, 2006. "Nonparametric Econometrics: Theory and Practice," Economics Books, Princeton University Press, edition 1, volume 1, number 8355.
    14. Ariel Pakes & Jack Porter, 2024. "Moment inequalities for multinomial choice with fixed effects," Quantitative Economics, Econometric Society, vol. 15(1), pages 1-25, January.
    15. Kaido, Hiroaki, 2017. "Asymptotically Efficient Estimation Of Weighted Average Derivatives With An Interval Censored Variable," Econometric Theory, Cambridge University Press, vol. 33(5), pages 1218-1241, October.
    16. Patrick Bajari & Jeremy Fox & Stephen Ryan, 2008. "Evaluating wireless carrier consolidation using semiparametric demand estimation," Quantitative Marketing and Economics (QME), Springer, vol. 6(4), pages 299-338, December.
    17. Menzel, Konrad, 2014. "Consistent estimation with many moment inequalities," Journal of Econometrics, Elsevier, vol. 182(2), pages 329-350.
    18. Xiaohong Chen & Oliver Linton & Ingrid Van Keilegom, 2003. "Estimation of Semiparametric Models when the Criterion Function Is Not Smooth," Econometrica, Econometric Society, vol. 71(5), pages 1591-1608, September.
    19. Kemp, Gordon C.R. & Santos Silva, J.M.C., 2012. "Regression towards the mode," Journal of Econometrics, Elsevier, vol. 170(1), pages 92-101.
    20. Thierry Kalisa & Mary Riddel & W. Douglass Shaw, 2016. "Willingness to pay to avoid arsenic-related risks: a special regressor approach," Journal of Environmental Economics and Policy, Taylor & Francis Journals, vol. 5(2), pages 143-162, July.
    21. Ichimura, Hidehiko & Todd, Petra E., 2007. "Implementing Nonparametric and Semiparametric Estimators," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 6, chapter 74, Elsevier.
    22. Steven T Berry & Giovanni Compiani, 2023. "An Instrumental Variable Approach to Dynamic Models," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 90(4), pages 1724-1758.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:azt:cemmap:16/25. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Dermot Watson (email available below). General contact details of provider: https://edirc.repec.org/data/ifsssuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.