IDEAS home Printed from https://ideas.repec.org/a/mof/journl/ppr22_01_03.html

Evaluating Tax Return Data as a Source of Income Statistics

Author

Listed:
  • Unayama Takashi

    (Professor at Institute of Economic Research, Kyoto University)

  • Sano Shinpei

    (Professor, Graduate School of Economics, Kobe University)

  • Yugami Kazufumi

    (Professor, Graduate School of Economics, Kobe University)

  • Inaba Kazuhiro

    (National Tax College)

Abstract

This study evaluates the usefulness of income tax return data, which is newly available in Japan, as a source of income statistics, focusing on its characteristics, advantages, and limitations. The large scale of the data and its accuracy constitute major advantages, but they exclude individuals who are not required to file tax returns as well as income exempt from taxation, which limit their ability to capture the whole distribution of income. Approximately 23 million individuals file tax returns each year, with about 5 million additional individuals reported as dependents. Together, these groups account for only about 30 percent of the total population aged 15 and over. In particular, most wage earners and pensioners do not file returns, necessitating the use of supplementary information. Among wage earners and pensioners who do file returns, many do so to claim deductions such as the medical expense deduction or the tax credits for mortgage loan interest.Given the nature of these institutions, filers are supposed to be in distinct economic conditions—such as facing illness or having recently purchased a home—which indicates a potential selection bias within this group. By contrast, for self-employed individuals, the tax return data provide comprehensive information. The number of business income filers closely aligns with the self-employed population reported in statistical surveys, and the total reported income is broadly consistent with the national accounts data. Since more than half of self-employed filers submit returns almost every year, the tax return data are highly valuable as a panel dataset for studying the self-employed. This study also evaluates the tax return data as a source about information on top-income individuals. For the super-rich—those in the top 0.1 percent of the population—coverage is almost exhaustive. However, when extending the focus to the top 1 percent, the presence of individuals who do not file returns becomes non-negligible. Moreover, due to the Separate Withholding Taxation system on interest and dividend income, the coverage would be less complete, which likely results in an underestimation of top-income individuals. Future challenges include the integration of tax return data with other datasets on wage earners and pensioners to construct a more comprehensive database of incomes. To understand the entire distribution of income, comparison between tax return data and existing statistical surveys is necessary.

Suggested Citation

  • Unayama Takashi & Sano Shinpei & Yugami Kazufumi & Inaba Kazuhiro, 2026. "Evaluating Tax Return Data as a Source of Income Statistics," Public Policy Review, Policy Research Institute, Ministry of Finance Japan, vol. 22(1), pages 1-37, March.
  • Handle: RePEc:mof:journl:ppr22_01_03
    DOI: 10.57520/prippr.22-1-3
    as

    Download full text from publisher

    File URL: https://doi.org/10.57520/prippr.22-1-3
    Download Restriction: no

    File URL: https://libkey.io/10.57520/prippr.22-1-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Keywords

    ;
    ;
    ;

    JEL classification:

    • D31 - Microeconomics - - Distribution - - - Personal Income and Wealth Distribution
    • E01 - Macroeconomics and Monetary Economics - - General - - - Measurement and Data on National Income and Product Accounts and Wealth; Environmental Accounts
    • H2 - Public Economics - - Taxation, Subsidies, and Revenue

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:mof:journl:ppr22_01_03. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Policy Research Institute (email available below). General contact details of provider: https://edirc.repec.org/data/prigvjp.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.