Binary Choice with Asymmetric Loss in a Data-Rich Environment: Theory and an Application to Racial Justice

My bibliography Save this paper

Binary Choice with Asymmetric Loss in a Data-Rich Environment: Theory and an Application to Racial Justice

Author

Listed:

Ghysels, Eric
Babii, Andrii
Chen, Xi
Kumar, Rohit

Registered:

Abstract

The importance of asymmetries in prediction problems arising in economics has been recognized for a long time. In this paper, we focus on binary choice problems in a data-rich environment with general loss functions. In contrast to the asymmetric regression problems, the binary choice with general loss functions and high-dimensional datasets is challenging and not well understood. Econometricians have studied binary choice problems for a long time, but the literature does not offer computationally attractive solutions in data-rich environments. In contrast, the machine learning literature has many computationally attractive algorithms that form the basis for much of the automated procedures that are implemented in practice, but it is focused on symmetric loss functions that are independent of individual characteristics. One of the main contributions of our paper is to show that the theoretically valid predictions of binary outcomes with arbitrary loss functions can be achieved via a very simple reweighting of the logistic regression, or other state-of-the-art machine learning techniques, such as boosting or (deep) neural networks. We apply our analysis to racial justice in pretrial detention.

Suggested Citation

Ghysels, Eric & Babii, Andrii & Chen, Xi & Kumar, Rohit, 2020. "Binary Choice with Asymmetric Loss in a Data-Rich Environment: Theory and an Application to Racial Justice," CEPR Discussion Papers 15418, C.E.P.R. Discussion Papers.

Handle: RePEc:cpr:ceprdp:15418

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Hutchinson, James M & Lo, Andrew W & Poggio, Tomaso, 1994. "A Nonparametric Approach to Pricing and Hedging Derivative Securities via Learning Networks," Journal of Finance, American Finance Association, vol. 49(3), pages 851-889, July.
- James M. Hutchinson & Andrew W. Lo & Tomaso Poggio, 1994. "A Nonparametric Approach to Pricing and Hedging Derivative Securities Via Learning Networks," NBER Working Papers 4718, National Bureau of Economic Research, Inc.
Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2014. "High-Dimensional Methods and Inference on Structural and Treatment Effects," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 29-50, Spring.
- Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2013. "High dimensional methods and inference on structural and treatment effects," CeMMAP working papers 59/13, Institute for Fiscal Studies.
- Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2013. "High dimensional methods and inference on structural and treatment effects," CeMMAP working papers CWP59/13, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
Max H. Farrell & Tengyuan Liang & Sanjog Misra, 2021. "Deep Neural Networks for Estimation and Inference," Econometrica, Econometric Society, vol. 89(1), pages 181-213, January.
- Max H. Farrell & Tengyuan Liang & Sanjog Misra, 2018. "Deep Neural Networks for Estimation and Inference," Papers 1809.09953, arXiv.org, revised Sep 2019.
Graham Elliott & Allan Timmermann, 2016. "Economic Forecasting," Economics Books, Princeton University Press, edition 1, number 10740.
- Graham Elliott & Allan Timmermann, 2008. "Economic Forecasting," Journal of Economic Literature, American Economic Association, vol. 46(1), pages 3-56, March.
- Timmermann, Allan & Elliott, Graham, 2007. "Economic Forecasting," CEPR Discussion Papers 6158, C.E.P.R. Discussion Papers.
Elliott, Graham & Lieli, Robert P., 2013. "Predicting binary outcomes," Journal of Econometrics, Elsevier, vol. 174(1), pages 15-26.
Xiaohong Chen & Sydney C. Ludvigson, 2009. "Land of addicts? an empirical investigation of habit‐based asset pricing models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 24(7), pages 1057-1093, November.
- Xiaohong Chen & Sydney C. Ludvigson, 2009. "Land of addicts? an empirical investigation of habit-based asset pricing models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 24(7), pages 1057-1093.
- Sydney Ludvigson & Xiaohong Chen, 2004. "Land of Addicts? An Empirical Investigation of Habit-Based Asset Pricing Models," 2004 Meeting Papers 692, Society for Economic Dynamics.
Susan Athey & Guido W. Imbens, 2019. "Machine Learning Methods That Economists Should Know About," Annual Review of Economics, Annual Reviews, vol. 11(1), pages 685-725, August.
Susan Athey & Stefan Wager, 2021. "Policy Learning With Observational Data," Econometrica, Econometric Society, vol. 89(1), pages 133-161, January.
- Susan Athey & Stefan Wager, 2017. "Policy Learning with Observational Data," Papers 1702.02896, arXiv.org, revised Sep 2020.
Lee, Tae-Hwy & White, Halbert & Granger, Clive W. J., 1993. "Testing for neglected nonlinearity in time series models : A comparison of neural network methods and alternative tests," Journal of Econometrics, Elsevier, vol. 56(3), pages 269-290, April.
- Tom Doan, "undated". "REGRESET: RATS procedure to perform Ramsey RESET test on regression," Statistical Software Components RTS00181, Boston College Department of Economics.
- Tom Doan, "undated". "REGWHITENNTEST: RATS procedure to perform White neural network test on regression," Statistical Software Components RTS00183, Boston College Department of Economics.
Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
- Shihao Gu & Bryan Kelly & Dacheng Xiu, 2018. "Empirical Asset Pricing via Machine Learning," NBER Working Papers 25398, National Bureau of Economic Research, Inc.
- Shihao Gu & Bryan T. Kelly & Dacheng Xiu, 2018. "Empirical Asset Pricing via Machine Learning," Swiss Finance Institute Research Paper Series 18-71, Swiss Finance Institute.
Xiaohong Chen & Xiaotong Shen, 1998. "Sieve Extremum Estimates for Weakly Dependent Data," Econometrica, Econometric Society, vol. 66(2), pages 289-314, March.
repec:cdl:ucsdec:qt6z55v472 is not listed on IDEAS
Chen, Xiaohong, 2007. "Large Sample Sieve Estimation of Semi-Nonparametric Models," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 6, chapter 76, Elsevier.
Athey, Susan & Imbens, Guido W., 2019. "Machine Learning Methods Economists Should Know About," Research Papers 3776, Stanford University, Graduate School of Business.
- Susan Athey & Guido Imbens, 2019. "Machine Learning Methods Economists Should Know About," Papers 1903.10075, arXiv.org.
Graham Elliott & Allan Timmermann, 2016. "Forecasting in Economics and Finance," Annual Review of Economics, Annual Reviews, vol. 8(1), pages 81-110, October.
- Timmermann, Allan & Elliott, Graham, 2016. "Forecasting in Economics and Finance," CEPR Discussion Papers 11354, C.E.P.R. Discussion Papers.
Aigner, D J & Amemiya, Takeshi & Poirier, Dale J, 1976. "On the Estimation of Production Frontiers: Maximum Likelihood Estimation of the Parameters of a Discontinuous Density Function," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 17(2), pages 377-396, June.
Boyes, William J. & Hoffman, Dennis L. & Low, Stuart A., 1989. "An econometric analysis of the bank credit scoring problem," Journal of Econometrics, Elsevier, vol. 40(1), pages 3-14, January.
Bartlett, Peter L. & Jordan, Michael I. & McAuliffe, Jon D., 2006. "Convexity, Classification, and Risk Bounds," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 138-156, March.
Manski, Charles F., 1975. "Maximum score estimation of the stochastic utility model of choice," Journal of Econometrics, Elsevier, vol. 3(3), pages 205-228, August.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Matthew Harding & Gabriel F. R. Vasconcelos, 2022. "Managers versus Machines: Do Algorithms Replicate Human Intuition in Credit Ratings?," Papers 2202.04218, arXiv.org.
Andrii Babii & Eric Ghysels & Jonas Striaukas, 2024. "Econometrics of machine learning methods in economic forecasting," Chapters, in: Michael P. Clements & Ana Beatriz GalvÃ£o (ed.), Handbook of Research Methods and Applications in Macroeconomic Forecasting, chapter 10, pages 246-273, Edward Elgar Publishing.
- Andrii Babii & Eric Ghysels & Jonas Striaukas, 2023. "Econometrics of Machine Learning Methods in Economic Forecasting," Papers 2308.10993, arXiv.org.
Lu, Xuefei & Calabrese, Raffaella, 2023. "The Cohort Shapley value to measure fairness in financing small and medium enterprises in the UK," Finance Research Letters, Elsevier, vol. 58(PC).
Toru Kitagawa & Shosei Sakaguchi & Aleksey Tetenov, 2021. "Constrained Classification and Policy Learning," Papers 2106.12886, arXiv.org, revised Jul 2023.
Lei Bill Wang & Zhenbang Jiao & Fangyi Wang, 2025. "Policy-Oriented Binary Classification: Improving (KD-)CART Final Splits for Subpopulation Targeting," Papers 2502.15072, arXiv.org, revised Oct 2025.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Andrii Babii & Xi Chen & Eric Ghysels & Rohit Kumar, 2020. "Binary Choice under Asymmetric Loss in a Data-Rich Environment: Theory and an Application to Algorithmic Fairness," Papers 2010.08463, arXiv.org, revised Nov 2025.
Su, Jiun-Hua, 2021. "Model selection in utility-maximizing binary prediction," Journal of Econometrics, Elsevier, vol. 223(1), pages 96-124.
Hector F. Calvo-Pardo & Tullio Mancini & Jose Olmo, 2020. "Neural Network Models for Empirical Finance," JRFM, MDPI, vol. 13(11), pages 1-22, October.
Ricardo P. Masini & Marcelo C. Medeiros & Eduardo F. Mendes, 2023. "Machine learning advances for time series forecasting," Journal of Economic Surveys, Wiley Blackwell, vol. 37(1), pages 76-111, February.
- Ricardo P. Masini & Marcelo C. Medeiros & Eduardo F. Mendes, 2020. "Machine Learning Advances for Time Series Forecasting," Papers 2012.12802, arXiv.org, revised Apr 2021.
Kyle Colangelo & Ying-Ying Lee, 2019. "Double debiased machine learning nonparametric inference with continuous treatments," CeMMAP working papers CWP72/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
Combes, Pierre-Philippe & Gobillon, Laurent & Zylberberg, Yanos, 2022. "Urban economics in a historical perspective: Recovering data with machine learning," Regional Science and Urban Economics, Elsevier, vol. 94(C).
- Gobillon, Laurent & Combes, Pierre-Philippe & Zylberberg, Yanos, 2020. "Urban economics in a historical perspective: Recovering data with machine learning," CEPR Discussion Papers 15308, C.E.P.R. Discussion Papers.
- Pierre-Philippe Combes & Laurent Gobillon & Yanos Zylberberg, 2022. "Urban Economics in a Historical Perspective: Recovering Data with Machine Learning," PSE-Ecole d'économie de Paris (Postprint) halshs-03673240, HAL.
- Pierre-Philippe Combes & Laurent Gobillon & Yanos Zylberberg, 2021. "Urban economics in a historical perspective: Recovering data with machine learning," Working Papers halshs-03231786, HAL.
- Pierre-Philippe Combes & Laurent Gobillon & Yanos Zylberberg, 2022. "Urban Economics in a Historical Perspective: Recovering Data with Machine Learning," Post-Print halshs-03673240, HAL.
- Combes, Pierre-Philippe & Gobillon, Laurent & Zylberberg, Yanos, 2021. "Urban Economics in a Historical Perspective: Recovering Data with Machine Learning," IZA Discussion Papers 14392, Institute of Labor Economics (IZA).
- Pierre-Philippe Combes & Laurent Gobillon & Yanos Zylberberg, 2021. "Urban economics in a historical perspective: Recovering data with machine learning," PSE Working Papers halshs-03231786, HAL.
- Pierre-Philippe Combes & Laurent Gobillon & Yanos Zylberberg, 2022. "Urban Economics in a Historical Perspective: Recovering Data with Machine Learning," SciencePo Working papers Main halshs-03673240, HAL.
Kyle Colangelo & Ying-Ying Lee, 2020. "Double Debiased Machine Learning Nonparametric Inference with Continuous Treatments," Papers 2004.03036, arXiv.org, revised Sep 2023.
Hui Chen & Antoine Didisheim & Simon Scheidegger, 2021. "Deep Structural Estimation: With an Application to Option Pricing," Papers 2102.09209, arXiv.org.
- Hui Chen & Antoine Didisheim & Simon Scheidegger, 2021. "Deep Structural Estimation:With an Application to Option Pricing," Cahiers de Recherches Economiques du Département d'économie 21.14, Université de Lausanne, Faculté des HEC, Département d’économie.
Combes, Pierre-Philippe & Gobillon, Laurent & Zylberberg, Yanos, 2022. "Urban economics in a historical perspective: Recovering data with machine learning," Regional Science and Urban Economics, Elsevier, vol. 94(C).
- Combes, Pierre-Philippe & Gobillon, Laurent & Zylberberg, Yanos, 2020. "Urban economics in a historical perspective: Recovering data with machine learning," CEPR Discussion Papers 15308, C.E.P.R. Discussion Papers.
- Pierre-Philippe Combes & Laurent Gobillon & Yanos Zylberberg, 2021. "Urban economics in a historical perspective: Recovering data with machine learning," SciencePo Working papers halshs-03231786, HAL.
- Pierre-Philippe Combes & Laurent Gobillon & yanos Zylberberg, 2022. "Urban economics in a historical perspective: Recovering data with machine learning," PSE-Ecole d'économie de Paris (Postprint) halshs-03673240, HAL.
- Pierre-Philippe Combes & Laurent Gobillon & Yanos Zylberberg, 2021. "Urban economics in a historical perspective: Recovering data with machine learning," Working Papers halshs-03231786, HAL.
- Pierre-Philippe Combes & Laurent Gobillon & yanos Zylberberg, 2022. "Urban economics in a historical perspective: Recovering data with machine learning," Post-Print halshs-03673240, HAL.
- Combes, Pierre-Philippe & Gobillon, Laurent & Zylberberg, Yanos, 2021. "Urban Economics in a Historical Perspective: Recovering Data with Machine Learning," IZA Discussion Papers 14392, Institute of Labor Economics (IZA).
- Pierre-Philippe Combes & Laurent Gobillon & Yanos Zylberberg, 2021. "Urban economics in a historical perspective: Recovering data with machine learning," PSE Working Papers halshs-03231786, HAL.
- Pierre-Philippe Combes & Laurent Gobillon & yanos Zylberberg, 2022. "Urban economics in a historical perspective: Recovering data with machine learning," SciencePo Working papers halshs-03673240, HAL.
Jozef Barunik & Lubos Hanus, 2022. "Learning Probability Distributions in Macroeconomics and Finance," Papers 2204.06848, arXiv.org.
Hoang, Daniel & Wiegratz, Kevin, 2022. "Machine learning methods in finance: Recent applications and prospects," Working Paper Series in Economics 158, Karlsruhe Institute of Technology (KIT), Department of Economics and Management.
Jiun-Hua Su, 2019. "Model Selection in Utility-Maximizing Binary Prediction," Papers 1903.00716, arXiv.org, revised Jul 2020.
Ay, Jean-Sauveur & Le Gallo, Julie, 2021. "The Signaling Values of Nested Wine Names," Working Papers 321851, American Association of Wine Economists.
- Jean-Sauveur Ay & Julie Le Gallo, 2021. "The signaling value of nested wine names," Post-Print hal-03268014, HAL.
Xiaohong Chen & Demian Pouzo, 2012. "Estimation of Nonparametric Conditional Moment Models With Possibly Nonsmooth Generalized Residuals," Econometrica, Econometric Society, vol. 80(1), pages 277-321, January.
- Xiaohong Chen & Demian Pouzo, 2008. "Estimation of Nonparametric Conditional Moment Models With Possibly Nonsmooth Generalized Residuals," Cowles Foundation Discussion Papers 1650R, Cowles Foundation for Research in Economics, Yale University, revised Jul 2009.
- Xiaohong Chen & Demian Pouzo, 2008. "Estimation of Nonparametric Conditional Moment Models With Possibly Nonsmooth Generalized Residuals," Cowles Foundation Discussion Papers 1650RR, Cowles Foundation for Research in Economics, Yale University, revised Jan 2011.
Byron Botha & Rulof Burger & Kevin Kotzé & Neil Rankin & Daan Steenkamp, 2023. "Big data forecasting of South African inflation," Empirical Economics, Springer, vol. 65(1), pages 149-188, July.
- Byron Botha & Rulof Burger & Kevin Kotze & Neil Rankin & Daan Steenkamp, 2022. "Big data forecasting of South African inflation," ERSA Working Paper Series, Economic Research Southern Africa, vol. 0.
- Byron Botha & Rulof Burger & Kevin Kotze & Neil Rankin & Daan Steenkamp, 2022. "Big data forecasting of South African inflation," School of Economics Macroeconomic Discussion Paper Series 2022-03, School of Economics, University of Cape Town.
- Byron Botha & Rulof Burger & Kevin Kotz & Neil Rankin & Daan Steenkamp, 2022. "Big data forecasting of South African inflation," Working Papers 11022, South African Reserve Bank.
Chen, Le-Yu & Lee, Sokbae, 2018. "Best subset binary prediction," Journal of Econometrics, Elsevier, vol. 206(1), pages 39-56.
- Le-Yu Chen & Sokbae Lee, 2016. "Best Subset Binary Prediction," Papers 1610.02738, arXiv.org, revised May 2018.
- Le-Yu Chen & Sokbae (Simon) Lee, 2017. "Best subset binary prediction," CeMMAP working papers CWP50/17, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Le-Yu Chen & Sokbae (Simon) Lee, 2017. "Best subset binary prediction," CeMMAP working papers 50/17, Institute for Fiscal Studies.
Kyle Colangelo & Ying-Ying Lee, 2019. "Double debiased machine learning nonparametric inference with continuous treatments," CeMMAP working papers CWP54/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
Rama K. Malladi, 2024. "Benchmark Analysis of Machine Learning Methods to Forecast the U.S. Annual Inflation Rate During a High-Decile Inflation Period," Computational Economics, Springer;Society for Computational Economics, vol. 64(1), pages 335-375, July.
Doumpos, Michalis & Zopounidis, Constantin & Gounopoulos, Dimitrios & Platanakis, Emmanouil & Zhang, Wenke, 2023. "Operational research and artificial intelligence methods in banking," European Journal of Operational Research, Elsevier, vol. 306(1), pages 1-16.
Michael Lechner, 2023. "Causal Machine Learning and its use for public policy," Swiss Journal of Economics and Statistics, Springer;Swiss Society of Economics and Statistics, vol. 159(1), pages 1-15, December.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BIG-2021-05-24 (Big Data)
NEP-CMP-2021-05-24 (Computational Economics)
NEP-DCM-2021-05-24 (Discrete Choice Models)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cpr:ceprdp:15418. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://www.cepr.org .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Binary Choice with Asymmetric Loss in a Data-Rich Environment: Theory and an Application to Racial Justice

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data