IDEAS home Printed from https://ideas.repec.org/a/bla/obuest/v83y2021i1p66-87.html
   My bibliography  Save this article

Imputing Top‐Coded Income Data in Longitudinal Surveys

Author

Listed:
  • Li Tan

Abstract

The incomes of top earners are typically top‐coded in survey data. I show that the accuracy of imputed income values for top earners in longitudinal surveys can be improved significantly by incorporating information from multiple time periods into the imputation process in a simple way. Moreover, I introduce an innovative, nonparametric empirical Bayes imputation method that further improves imputation quality. I show that the empirical Bayes imputation method reduces the RMSE of imputed income values by 19–51% relative to standard approaches in the literature. I also illustrate the benefits of the empirical Bayes method for investigating multi‐year income inequality.

Suggested Citation

  • Li Tan, 2021. "Imputing Top‐Coded Income Data in Longitudinal Surveys," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 83(1), pages 66-87, February.
  • Handle: RePEc:bla:obuest:v:83:y:2021:i:1:p:66-87
    DOI: 10.1111/obes.12400
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/obes.12400
    Download Restriction: no

    File URL: https://libkey.io/10.1111/obes.12400?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Richard Burkhauser & Shuaizhang Feng & Stephen Jenkins & Jeff Larrimore, 2009. "Recent Trends in Top Income Shares in the USA: Reconciling Estimates from March CPS and IRS Tax Return Data," Working Papers 09-26, Center for Economic Studies, U.S. Census Bureau.
    2. John S. Heywood & Patrick L. O’Halloran, 2005. "Racial Earnings Differentials and Performance Pay," Journal of Human Resources, University of Wisconsin Press, vol. 40(2).
    3. Steven Haider & Gary Solon, 2006. "Life-Cycle Variation in the Association between Current and Lifetime Earnings," American Economic Review, American Economic Association, vol. 96(4), pages 1308-1320, September.
    4. Bhuller, Manudeep & Mogstad, Magne & Salvanes, Kjell G., 2011. "Life-Cycle Bias and the Returns to Schooling in Current and Lifetime Earnings," IZA Discussion Papers 5788, Institute of Labor Economics (IZA).
    5. Orazio Attanasio & Luigi Pistaferri, 2014. "Consumption Inequality over the Last Half Century: Some Evidence Using the New PSID Consumption Measure," American Economic Review, American Economic Association, vol. 104(5), pages 122-126, May.
    6. Shin, Donggyun & Solon, Gary, 2011. "Trends in men's earnings volatility: What does the Panel Study of Income Dynamics show?," Journal of Public Economics, Elsevier, vol. 95(7-8), pages 973-982, August.
    7. Keisuke Hirano, 2002. "Semiparametric Bayesian Inference in Autoregressive Panel Data Models," Econometrica, Econometric Society, vol. 70(2), pages 781-799, March.
    8. Christopher R. Bollinger & Barry T. Hirsch, 2006. "Match Bias from Earnings Imputation in the Current Population Survey: The Case of Imperfect Matching," Journal of Labor Economics, University of Chicago Press, vol. 24(3), pages 483-520, July.
    9. Stephen P. Jenkins & Richard V. Burkhauser & Shuaizhang Feng & Jeff Larrimore, 2011. "Measuring inequality using censored data: a multiple‐imputation approach to estimation and inference," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 174(1), pages 63-81, January.
    10. Philip Armour & Richard V. Burkhauser & Jeff Larrimore, 2016. "Using The Pareto Distribution To Improve Estimates Of Topcoded Earnings," Economic Inquiry, Western Economic Association International, vol. 54(2), pages 1263-1273, April.
    11. Matteo Richiardi & Ambra Poggi, 2014. "Imputing Individual Effects in Dynamic Microsimulation Models. An application to household formation and labour market participation in Italy," International Journal of Microsimulation, International Microsimulation Association, vol. 7(2), pages 3-39.
    12. Peter H. Friesen & Danny Miller, 1983. "Annual Inequality and Lifetime Inequality," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 98(1), pages 139-155.
    13. Thomas Lemieux, 2006. "Increasing Residual Wage Inequality: Composition Effects, Noisy Data, or Rising Demand for Skill?," American Economic Review, American Economic Association, vol. 96(3), pages 461-498, June.
    14. Feng, Shuaizhang & Burkhauser, Richard V. & Butler, J.S., 2006. "Levels and Long-Term Trends in Earnings Inequality: Overcoming Current Population Survey Censoring Problems Using the GB2 Distribution," Journal of Business & Economic Statistics, American Statistical Association, vol. 24, pages 57-62, January.
    15. Shane T. Jensen & Stephen H. Shore, 2015. "Changes in the Distribution of Earnings Volatility," Journal of Human Resources, University of Wisconsin Press, vol. 50(3), pages 811-836.
    16. NICHOLS Austin, 2008. "Trends in Income Inequality, Volatility and Mobility Risk," IRISS Working Paper Series 2008-10, IRISS at CEPS/INSTEAD.
    17. David H. Autor & Lawrence F. Katz & Melissa S. Kearney, 2008. "Trends in U.S. Wage Inequality: Revising the Revisionists," The Review of Economics and Statistics, MIT Press, vol. 90(2), pages 300-323, May.
    18. Barry T. Hirsch & Edward J. Schumacher, 2004. "Match Bias in Wage Gap Estimates Due to Earnings Imputation," Journal of Labor Economics, University of Chicago Press, vol. 22(3), pages 689-722, July.
    19. Mary C. Daly & Robert G. Valletta, 2006. "Inequality and Poverty in United States: The Effects of Rising Dispersion of Men's Earnings and Changing Family Behaviour," Economica, London School of Economics and Political Science, vol. 73(289), pages 75-98, February.
    20. Richard V. Burkhauser & Shuaizhang Feng & Stephen P. Jenkins & Jeff Larrimore, 2012. "Recent Trends in Top Income Shares in the United States: Reconciling Estimates from March CPS and IRS Tax Return Data," The Review of Economics and Statistics, MIT Press, vol. 94(2), pages 371-388, May.
    21. Jiaying Gu & Roger Koenker, 2017. "Unobserved Heterogeneity in Income Dynamics: An Empirical Bayes Perspective," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 35(1), pages 1-16, January.
    22. Robert F. Bordley & James B. McDonald & Anand Mantrala, 1997. "Something New, Something Old: Parametric Models for the Size of Distribution of Income," Journal of Income Distribution, Ad libros publications inc., vol. 6(1), pages 5-5, June.
    23. Molly Dahl & Thomas DeLeire & Jonathan A. Schwabish, 2011. "Estimates of Year-to-Year Volatility in Earnings and in Household Incomes from Administrative, Survey, and Matched Data," Journal of Human Resources, University of Wisconsin Press, vol. 46(4), pages 750-774.
    24. Stephen P. Jenkins, 2009. "Distributionally‐Sensitive Inequality Indices And The Gb2 Income Distribution," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 55(2), pages 392-398, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Clementi,Fabio & Fabiani,Michele & Molini,Vasco & Schettino,Francesco, 2022. "Is Inequality Systematically Underestimated in Sub-Saharan Africa ? A Proposal toOvercome the Problem," Policy Research Working Paper Series 10058, The World Bank.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bradley Hardy & James P. Ziliak, 2014. "Decomposing Trends In Income Volatility: The “Wild Ride” At The Top And Bottom," Economic Inquiry, Western Economic Association International, vol. 52(1), pages 459-476, January.
    2. Richard Burkhauser & Shuaizhang Feng & Stephen Jenkins & Jeff Larrimore, 2011. "Estimating trends in US income inequality using the Current Population Survey: the importance of controlling for censoring," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 9(3), pages 393-415, September.
    3. João Nicolau & Pedro Raposo & Paulo M. M. Rodrigues, 2023. "Measuring wage inequality under right censoring," Economic Inquiry, Western Economic Association International, vol. 61(2), pages 377-401, April.
    4. Blundell, Richard & Joyce, Robert & Norris Keiller, Agnes & Ziliak, James P., 2018. "Income inequality and the labour market in Britain and the US," Journal of Public Economics, Elsevier, vol. 162(C), pages 48-62.
    5. Philip Armour & Richard V. Burkhauser & Jeff Larrimore, 2016. "Using The Pareto Distribution To Improve Estimates Of Topcoded Earnings," Economic Inquiry, Western Economic Association International, vol. 54(2), pages 1263-1273, April.
    6. Nora Lustig, 2020. "The ``missing rich'' in household surveys: causes and correction approaches," Working Papers 520, ECINEQ, Society for the Study of Economic Inequality.
    7. Vanesa Jorda & José María Sarabia & Markus Jäntti, 2021. "Inequality measurement with grouped data: Parametric and non‐parametric methods," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(3), pages 964-984, July.
    8. Nora Lustig, 2019. "The “Missing Rich” in Household Surveys: Causes and Correction Approaches," Commitment to Equity (CEQ) Working Paper Series 75, Tulane University, Department of Economics.
    9. Ellis Scharfenaker, Markus P.A. Schneider, 2019. "Labor Market Segmentation and the Distribution of Income: New Evidence from Internal Census Bureau Data," Working Paper Series, Department of Economics, University of Utah 2019_08, University of Utah, Department of Economics.
    10. Vanesa Jorda & Jos Mar a Sarabia & Markus J ntti, 2020. "Estimation of Income Inequality from Grouped Data," LIS Working papers 804, LIS Cross-National Data Center in Luxembourg.
    11. Vladimir Hlasny & Paolo Verme, 2022. "The Impact of Top Incomes Biases on the Measurement of Inequality in the United States," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 84(4), pages 749-788, August.
    12. Vladimir Hlasny, 2021. "Parametric representation of the top of income distributions: Options, historical evidence, and model selection," Journal of Economic Surveys, Wiley Blackwell, vol. 35(4), pages 1217-1256, September.
    13. Markus P. A. Schneider, 2013. "Race & Gender Differences in the Experience of Earnings Inequality in the US from 1995 to 2010," Working Papers 1303, New School for Social Research, Department of Economics.
    14. Markus Schneider, 2013. "Illustrating the Implications of How Inequality is Measured: Decomposing Earnings Inequality by Race and Gender," Journal of Labor Research, Springer, vol. 34(4), pages 476-514, December.
    15. Botosaru, Irene, 2023. "Time-varying unobserved heterogeneity in earnings shocks," Journal of Econometrics, Elsevier, vol. 235(2), pages 1378-1393.
    16. Ivan Kitov & Oleg Kitov, 2015. "Gender income disparity in the USA: analysis and dynamic modelling," Papers 1510.02752, arXiv.org.
    17. Bollinger, Christopher R. & Hirsch, Barry & Hokayem, Charles M. & Ziliak, James P., 2018. "Trouble in the Tails? What We Know about Earnings Nonresponse Thirty Years after Lillard, Smith, and Welch," IZA Discussion Papers 11710, Institute of Labor Economics (IZA).
    18. James P. Ziliak & Charles Hokayem & Christopher R. Bollinger, 2022. "Trends in Earnings Volatility Using Linked Administrative and Survey Data," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 41(1), pages 12-19, December.
    19. Luis Ayala & Ana Pérez & Mercedes Prieto-Alaiz, 2022. "The impact of different data sources on the level and structure of income inequality," SERIEs: Journal of the Spanish Economic Association, Springer;Spanish Economic Association, vol. 13(3), pages 583-611, September.
    20. Bradley Hardy, 2014. "Childhood Income Volatility and Adult Outcomes," Demography, Springer;Population Association of America (PAA), vol. 51(5), pages 1641-1665, October.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:obuest:v:83:y:2021:i:1:p:66-87. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/sfeixuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.