IDEAS home Printed from https://ideas.repec.org/p/nbr/nberwo/28229.html

The Accuracy of Tax Imputations: Estimating Tax Liabilities and Credits Using Linked Survey and Administrative Data

Author

Listed:
  • Bruce D. Meyer
  • Derek Wu
  • Grace Finley
  • Patrick Langetieg
  • Carla Medalia
  • Mark Payne
  • Alan Plumley

Abstract

This paper calculates accurate estimates of income and payroll taxes using a groundbreaking set of linked survey and administrative tax data that are part of the Comprehensive Income Dataset (CID). We compare our estimates to survey imputations produced by the Census Bureau and those generated using the TAXSIM calculator from the National Bureau of Economic Research. The administrative data include two sets of Internal Revenue Service (IRS) data: (1) a limited set of tax information for the population of individual income tax returns covering selected line items from Forms 1040, W-2, and 1099-R; and (2) an extensive set of population tax records processed by the IRS in 2011, covering nearly every line item on Form 1040 and most lines on a series of third-party information returns. We link these IRS records to the Current Population Survey Annual Social and Economic Supplement (CPS) for reference year 2010. We describe how we form tax units and estimate various types of tax liabilities and credits using these linked data, providing a roadmap for constructing accurate measures of taxes while preserving the survey family as the sharing unit for distributional analyses. We find that aggregate estimates of various tax components using the limited and extensive tax data estimates are close to each other and much closer to public IRS tabulations than either of the imputations using survey data alone. At the individual level, the absolute errors of survey-only imputations of federal income taxes and total taxes are on average 10% and 13%, respectively, of adjusted gross income. In contrast, the limited tax data imputations yield mean absolute errors for federal income taxes and total taxes that are about 2% and 3% of adjusted gross income, respectively. For the Earned Income Tax Credit, the limited tax data imputation is off by less than $20 on average for a typical family (compared to more than $500 using either of the survey-only imputations).

Suggested Citation

  • Bruce D. Meyer & Derek Wu & Grace Finley & Patrick Langetieg & Carla Medalia & Mark Payne & Alan Plumley, 2020. "The Accuracy of Tax Imputations: Estimating Tax Liabilities and Credits Using Linked Survey and Administrative Data," NBER Working Papers 28229, National Bureau of Economic Research, Inc.
  • Handle: RePEc:nbr:nberwo:28229
    Note: LS PE
    as

    Download full text from publisher

    File URL: http://www.nber.org/papers/w28229.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jeehoon Han & Bruce D. Meyer & James X. Sullivan, 2020. "Income and Poverty in the COVID-19 Pandemic," Brookings Papers on Economic Activity, Economic Studies Program, The Brookings Institution, vol. 51(2 (Summer), pages 85-118.
    2. Larrimore, Jeff & Mortenson, Jacob & Splinter, David, 2023. "Earnings business cycles: The Covid recession, recovery, and policy response," Journal of Public Economics, Elsevier, vol. 225(C).
    3. Kevin C. Corinth & Jeff Larrimore, 2024. "Has Intergenerational Progress Stalled? Income Growth Over Five Generations of Americans," Finance and Economics Discussion Series 2024-007, Board of Governors of the Federal Reserve System (U.S.).
    4. Maggie R. Jones & Adam Bee & Amanda Eng & Kendall Houghton & Nikolas Pharris-Ciurej & Sonya R. Porter & Jonathan Rothbaum & John Voorheis, 2024. "Mobility, Opportunity, and Volatility Statistics (MOVS): Infrastructure Files and Public Use Data," Working Papers 24-23, Center for Economic Studies, U.S. Census Bureau.
    5. Iselin, John & Mackay, Taylor & Unrath, Matthew, 2023. "Measuring take-up of the California EITC with state administrative data," Journal of Public Economics, Elsevier, vol. 227(C).

    More about this item

    JEL classification:

    • C42 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Survey Methods
    • C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
    • H20 - Public Economics - - Taxation, Subsidies, and Revenue - - - General
    • H24 - Public Economics - - Taxation, Subsidies, and Revenue - - - Personal Income and Other Nonbusiness Taxes and Subsidies
    • I32 - Health, Education, and Welfare - - Welfare, Well-Being, and Poverty - - - Measurement and Analysis of Poverty

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nbr:nberwo:28229. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/nberrus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.