IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2307.16315.html
   My bibliography  Save this paper

Towards Practical Robustness Auditing for Linear Regression

Author

Listed:
  • Daniel Freund
  • Samuel B. Hopkins

Abstract

We investigate practical algorithms to find or disprove the existence of small subsets of a dataset which, when removed, reverse the sign of a coefficient in an ordinary least squares regression involving that dataset. We empirically study the performance of well-established algorithmic techniques for this task -- mixed integer quadratically constrained optimization for general linear regression problems and exact greedy methods for special cases. We show that these methods largely outperform the state of the art and provide a useful robustness check for regression problems in a few dimensions. However, significant computational bottlenecks remain, especially for the important task of disproving the existence of such small sets of influential samples for regression problems of dimension $3$ or greater. We make some headway on this challenge via a spectral algorithm using ideas drawn from recent innovations in algorithmic robust statistics. We summarize the limitations of known techniques in several challenge datasets to encourage further algorithmic innovation.

Suggested Citation

  • Daniel Freund & Samuel B. Hopkins, 2023. "Towards Practical Robustness Auditing for Linear Regression," Papers 2307.16315, arXiv.org.
  • Handle: RePEc:arx:papers:2307.16315
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2307.16315
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Card, David & Krueger, Alan B, 1994. "Minimum Wages and Employment: A Case Study of the Fast-Food Industry in New Jersey and Pennsylvania," American Economic Review, American Economic Association, vol. 84(4), pages 772-793, September.
    2. Meager, Rachael, 2022. "Aggregating distributional treatment effects: a Bayesian hierarchical analysis of the microcredit literature," LSE Research Online Documents on Economics 115559, London School of Economics and Political Science, LSE Library.
    3. repec:fth:prinin:315 is not listed on IDEAS
    4. Eubank, Nicholas & Fresh, Adriane, 2022. "Enfranchisement and Incarceration after the 1965 Voting Rights Act," American Political Science Review, Cambridge University Press, vol. 116(3), pages 791-806, August.
    5. David Card & Alan Krueger, 1993. "Minimum Wages and Employment: A Case Study of the Fast Food Industry in New Jersey and Pennsylvania," Working Papers 694, Princeton University, Department of Economics, Industrial Relations Section..
    6. Bruno Crépon & Florencia Devoto & Esther Duflo & William Parienté, 2015. "Estimating the Impact of Microcredit on Those Who Take It Up: Evidence from a Randomized Experiment in Morocco," American Economic Journal: Applied Economics, American Economic Association, vol. 7(1), pages 123-150, January.
    7. Finger, Robert & Möhring, Niklas, 2022. "The adoption of pesticide-free wheat production and farmers' perceptions of its environmental and health effects," Ecological Economics, Elsevier, vol. 198(C).
    8. Abhijit Banerjee & Esther Duflo & Rachel Glennerster & Cynthia Kinnan, 2015. "The Miracle of Microfinance? Evidence from a Randomized Evaluation," American Economic Journal: Applied Economics, American Economic Association, vol. 7(1), pages 22-53, January.
    9. Manuela Angelucci & Dean Karlan & Jonathan Zinman, 2015. "Microcredit Impacts: Evidence from a Randomized Microcredit Program Placement Experiment by Compartamos Banco," American Economic Journal: Applied Economics, American Economic Association, vol. 7(1), pages 151-182, January.
    10. Ankur Moitra & Dhruv Rohatgi, 2022. "Provably Auditing Ordinary Least Squares in Low Dimensions," Papers 2205.14284, arXiv.org, revised Jun 2022.
    11. Rachael Meager, 2022. "Aggregating Distributional Treatment Effects: A Bayesian Hierarchical Analysis of the Microcredit Literature," American Economic Review, American Economic Association, vol. 112(6), pages 1818-1847, June.
    12. Alessandro Tarozzi & Jaikishan Desai & Kristin Johnson, 2015. "The Impacts of Microcredit: Evidence from Ethiopia," American Economic Journal: Applied Economics, American Economic Association, vol. 7(1), pages 54-89, January.
    13. Antoine Falck & Adam Rej & David Thesmar, 2022. "When do systematic strategies decay?," Quantitative Finance, Taylor & Francis Journals, vol. 22(11), pages 1955-1969, November.
    14. Orazio Attanasio & Britta Augsburg & Ralph De Haas & Emla Fitzsimons & Heike Harmgart, 2015. "The Impacts of Microfinance: Evidence from Joint-Liability Lending in Mongolia," American Economic Journal: Applied Economics, American Economic Association, vol. 7(1), pages 90-122, January.
    15. Harrison, David Jr. & Rubinfeld, Daniel L., 1978. "Hedonic housing prices and the demand for clean air," Journal of Environmental Economics and Management, Elsevier, vol. 5(1), pages 81-102, March.
    16. Luis R. Martínez, 2022. "How Much Should We Trust the Dictator’s GDP Growth Estimates?," Journal of Political Economy, University of Chicago Press, vol. 130(10), pages 2731-2769.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bernardus F Nazar Van Doornik & Armando Gomes & David Schoenherr & Janis Skrastins, 2023. "Financial access and labor market outcomes: evidence from credit lotteries," BIS Working Papers 1071, Bank for International Settlements.
    2. Dagmara Celik Katreniak & Alexey Khazanov & Omer Moav & Zvika Neeman & Hosny Zoabi, 2023. "Why Not Borrow, Invest, and Escape Poverty?," Papers 2305.02546, arXiv.org.
    3. Lucia Dalla Pellegrina & Giorgio Di Maio & Paolo Landoni & Emanuele Rusinà, 2021. "Money management and entrepreneurial training in microfinance: impact on beneficiaries and institutions," Economia Politica: Journal of Analytical and Institutional Economics, Springer;Fondazione Edison, vol. 38(3), pages 1049-1085, October.
    4. Emily Breza & Cynthia Kinnan, 2021. "Measuring the Equilibrium Impacts of Credit: Evidence from the Indian Microfinance Crisis," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 136(3), pages 1447-1497.
    5. N'dri, Lasme Mathieu & Kakinaka, Makoto, 2020. "Financial inclusion, mobile money, and individual welfare: The case of Burkina Faso," Telecommunications Policy, Elsevier, vol. 44(3).
    6. Pedro Carneiro & Sokbae Lee & Daniel Wilhelm, 2020. "Optimal data collection for randomized control trials [Microcredit impacts: Evidence from a randomized microcredit program placement experiment by Compartamos Banco]," The Econometrics Journal, Royal Economic Society, vol. 23(1), pages 1-31.
    7. Abhijit Banerjee & Emily Breza & Esther Duflo & Cynthia Kinnan, 2019. "Can Microfinance Unlock a Poverty Trap for Some Entrepreneurs?," NBER Working Papers 26346, National Bureau of Economic Research, Inc.
    8. Daniel Bjorkegren & Joshua Blumenstock & Omowunmi Folajimi-Senjobi & Jacqueline Mauro & Suraj R. Nair, 2022. "Instant Loans Can Lift Subjective Well-Being: A Randomized Evaluation of Digital Credit in Nigeria," Papers 2202.13540, arXiv.org.
    9. Ahlin, Christian & Gulesci, Selim & Madestam, Andreas & Stryjan, Miri, 2020. "Loan contract structure and adverse selection: Survey evidence from Uganda," Journal of Economic Behavior & Organization, Elsevier, vol. 172(C), pages 180-195.
    10. Gyorgy Molnar & Attila Havas, 2019. "Escaping from the poverty trap with social innovation: a social microcredit programme in Hungary," CERS-IE WORKING PAPERS 1912, Institute of Economics, Centre for Economic and Regional Studies.
    11. Karlan, Dean & Osman, Adam & Zinman, Jonathan, 2016. "Follow the money not the cash: Comparing methods for identifying consumption and investment responses to a liquidity shock," Journal of Development Economics, Elsevier, vol. 121(C), pages 11-23.
    12. Nakano, Yuko & Magezi, Eustadius F., 2020. "The impact of microcredit on agricultural technology adoption and productivity: Evidence from randomized control trial in Tanzania," World Development, Elsevier, vol. 133(C).
    13. Fumagalli, Laura & Martin, Thomas, 2023. "Child labor among farm households in Mozambique and the role of reciprocal adult labor," World Development, Elsevier, vol. 161(C).
    14. Tamara Broderick & Ryan Giordano & Rachael Meager, 2020. "An Automatic Finite-Sample Robustness Metric: When Can Dropping a Little Data Make a Big Difference?," Papers 2011.14999, arXiv.org, revised Jul 2023.
    15. Bernardus Van Doornik & Armando Gomes & David Schoenherr & Janis Skrastins, 2021. "Financial Access and Labor Market Outcomes: Evidence from Credit Lotteries," Working Papers 2021-56, Princeton University. Economics Department..
    16. Lota Tamini & Ibrahima Bocoum & Ghislain Auger & Kotchikpa Gabriel Lawin & Arahama Traoré, 2019. "Enhanced Microfinance Services and Agricultural Best Management Practices: What Benefits for Smallholders Farmers? An Evidence from Burkina Faso," CIRANO Working Papers 2019s-11, CIRANO.
    17. Susmita Baulia, 2017. "Take-up of joint and individual liability loans: an analysis with laboratory experiments," Discussion Papers 117, Aboa Centre for Economics.
    18. Oriana Bandiera & Robin Burgess & Erika Deserranno & Ricardo Morel & Imran Rasul & Munshi Sulaiman & Jack Thiemel, 2022. "Microfinance and Diversification," Economica, London School of Economics and Political Science, vol. 89(S1), pages 239-275, June.
    19. Mathilde Maîtrot & Miguel Niño-Zarazúa, 2017. "Poverty and wellbeing impacts of microfinance: What do we know?," WIDER Working Paper Series wp-2017-190, World Institute for Development Economic Research (UNU-WIDER).
    20. Victor Chernozhukov & Mert Demirer & Esther Duflo & Ivan Fernandez-Val, 2017. "Generic machine learning inference on heterogenous treatment effects in randomized experiments," CeMMAP working papers 61/17, Institute for Fiscal Studies.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2307.16315. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.