IDEAS home Printed from https://ideas.repec.org/p/hal/journl/hal-05084494.html

Dealing with Censored Earnings in Register Data

Author

Listed:
  • Mattis Beckmannshagen

    (DIW Berlin - Deutsches Institut für Wirtschaftsforschung)

  • Johannes König

    (DIW Berlin - Deutsche Institut für Wirtschaftsforschung = German Institute for Economic Research)

  • Isabella Retter

    (DIW Berlin - Deutsche Institut für Wirtschaftsforschung = German Institute for Economic Research)

  • Christian Schluter

    (AMSE - Aix-Marseille Sciences Economiques - EHESS - École des hautes études en sciences sociales - AMU - Aix Marseille Université - ECM - École Centrale de Marseille - CNRS - Centre National de la Recherche Scientifique, University of Southampton)

  • Carsten Schröder

    (DIW Berlin - Deutsches Institut für Wirtschaftsforschung)

  • Yogam Tchokni

    (DIW Berlin - Deutsche Institut für Wirtschaftsforschung = German Institute for Economic Research)

Abstract

Earnings are often top-coded (right-censored) in administrative registers. The censoring threshold in the case of Germany is the limit value for social security contributions, leading to a substantial fraction of censoring: For example, about 12 % of male workers in West Germany are affected, rising to above 30 % for highly educated prime-aged workers. This missing right tail of the earnings distribution constitutes a major problem for researchers studying earnings inequality and top incomes. We overcome this challenge by taking a distributional approach and semi-parametrically modelling the right tail as being Pareto-like. Non-censored earnings survey data matched to administrative records, derived from the SOEP-RV project, let us operate in a laboratory-like setting in which the targets are known. Our approach outperforms alternative imputation methods based on Tobit regressions.

Suggested Citation

  • Mattis Beckmannshagen & Johannes König & Isabella Retter & Christian Schluter & Carsten Schröder & Yogam Tchokni, 2025. "Dealing with Censored Earnings in Register Data," Post-Print hal-05084494, HAL.
  • Handle: RePEc:hal:journl:hal-05084494
    DOI: 10.1515/jbnst-2024-0037
    Note: View the original document on HAL open archive server: https://hal.science/hal-05084494v1
    as

    Download full text from publisher

    File URL: https://hal.science/hal-05084494v1/document
    Download Restriction: no

    File URL: https://libkey.io/10.1515/jbnst-2024-0037?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Mattis Beckmannshagen & Carsten Schröder, 2022. "Changes in Working Hours Are Driving Earnings Inequality," DIW Weekly Report, DIW Berlin, German Institute for Economic Research, vol. 12(32/33/34), pages 195-201.
    2. David Card & Jörg Heining & Patrick Kline, 2013. "Workplace Heterogeneity and the Rise of West German Wage Inequality," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 128(3), pages 967-1015.
    3. Xavier Gabaix, 2016. "Power Laws in Economics: An Introduction," Journal of Economic Perspectives, American Economic Association, vol. 30(1), pages 185-206, Winter.
    4. Drechsler, Jörg & Ludsteck, Johannes, 2025. "Imputation strategies for rightcensored wages in longitudinal datasets," Journal for Labour Market Research, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany], vol. 59, pages 1-019.
    5. Christian Schluter & Mark Trede, 2019. "Size distributions reconsidered," Econometric Reviews, Taylor & Francis Journals, vol. 38(6), pages 695-710, July.
    6. repec:iab:iabjlr:v:54:p:art.10 is not listed on IDEAS
    7. Wildauer, Rafael & Kapeller, Jakob, 2022. "Tracing the invisible rich: A new approach to modelling Pareto tails in survey data," Labour Economics, Elsevier, vol. 75(C).
    8. Stephen P. Jenkins, 2017. "Pareto Models, Top Incomes and Recent Trends in UK Income Inequality," Economica, London School of Economics and Political Science, vol. 84(334), pages 261-289, April.
    9. Karlsson, Martin & Wang, Yulong & Ziebarth, Nicolas R., 2024. "Getting the right tail right: Modeling tails of health expenditure distributions," Journal of Health Economics, Elsevier, vol. 97(C).
    10. Christian Schluter, 2021. "On Zipf’s law and the bias of Zipf regressions," Empirical Economics, Springer, vol. 61(2), pages 529-548, August.
    11. Xavier Gabaix & Rustam Ibragimov, 2011. "Rank - 1 / 2: A Simple Way to Improve the OLS Estimation of Tail Exponents," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 29(1), pages 24-39, January.
    12. Charlotte Bartels & Maria Metzing, 2019. "An integrated approach for a top-corrected income distribution," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 17(2), pages 125-143, June.
    13. Christian Dustmann & Johannes Ludsteck & Uta Schönberg, 2009. "Revisiting the German Wage Structure," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 124(2), pages 843-881.
    14. Christian Schluter, 2018. "Top Incomes, Heavy Tails, and Rank-Size Regressions," Econometrics, MDPI, vol. 6(1), pages 1-16, March.
    15. Beckmannshagen, Mattis & Schröder, Carsten, 2022. "Earnings inequality and working hours mismatch," Labour Economics, Elsevier, vol. 76(C).
    16. Wolfgang Dauth & Johann Eppelsheimer, 2020. "Preparing the sample of integrated labour market biographies (SIAB) for scientific analysis: a guide," Journal for Labour Market Research, Springer;Institute for Employment Research/ Institut für Arbeitsmarkt- und Berufsforschung (IAB), vol. 54(1), pages 1-14, December.
    17. Joerg Drechsler & Johannes Ludsteck, 2025. "Imputation strategies for rightcensored wages in longitudinal datasets," Journal for Labour Market Research, Springer;Institute for Employment Research/ Institut für Arbeitsmarkt- und Berufsforschung (IAB), vol. 59(1), pages 1-14, December.
    18. Gabaix, Xavier & Ibragimov, Rustam, 2011. "Rank − 1 / 2: A Simple Way to Improve the OLS Estimation of Tail Exponents," Journal of Business & Economic Statistics, American Statistical Association, vol. 29(1), pages 24-39.
    19. Christian Schluter & Mark Trede, 2024. "Spatial earnings inequality," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 22(3), pages 531-550, September.
    20. Johannes König & Christian Schluter & Carsten Schröder & Isabella Retter & Mattis Beckmannshagen, 2025. "The beyondpareto command for optimal extreme-value index estimation," Stata Journal, StataCorp LLC, vol. 25(1), pages 169-188, March.
    21. Timm Bönke & Giacomo Corneo & Holger Lüthen, 2015. "Lifetime Earnings Inequality in Germany," Journal of Labor Economics, University of Chicago Press, vol. 33(1), pages 171-208.
    22. repec:iab:iabjlr:v:54:i:1:p:art.10 is not listed on IDEAS
    23. Jorg Drechsler & Johannes Ludsteck, 2025. "Imputation Strategies for Rightcensored Wages in Longitudinal Datasets," Papers 2502.12967, arXiv.org.
    24. Dauth, Wolfgang & Eppelsheimer, Johann, 2020. "Preparing the sample of integrated labour market biographies (SIAB) for scientific analysis," Journal for Labour Market Research, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany], vol. 54, pages 1-010.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sona Badalyan, 2025. "Crowded Career Ladders? Intra-Firm Spillovers of Raised Retirement Age," CERGE-EI Working Papers wp810, The Center for Economic Research and Graduate Education - Economics Institute, Prague.
    2. Stüber, Heiko & Grabka, Markus M. & Schnitzlein, Daniel D., 2023. "A tale of two data sets: comparing German administrative and survey data using wage inequality as an example," Journal for Labour Market Research, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany], vol. 57, pages 1-008.
    3. Moritz Drechsel‐Grau & Andreas Peichl & Kai D. Schmid & Johannes F. Schmieder & Hannes Walz & Stefanie Wolter, 2022. "Inequality and income dynamics in Germany," Quantitative Economics, Econometric Society, vol. 13(4), pages 1593-1635, November.
    4. Ines Heck & Anna Hornykewycz & Jakob Kapeller & Rafael Wildauer, 2024. "Vermögensverteilung in Österreich: eine Analyse auf Basis des HFCS 2021/22," Working Paper Reihe der AK Wien - Materialien zu Wirtschaft und Gesellschaft 255, Kammer für Arbeiter und Angestellte für Wien, Abteilung Wirtschaftswissenschaft und Statistik.
    5. Charlotte Bartels & Carsten Schroeder, 2020. "The role of rental income, real estate and rents for inequality in Germany," Working Papers 7, Forum New Economy.
    6. Wildauer, Rafael & Heck, Ines & Kapeller, Jakob, 2023. "Was Pareto right? Is the distribution of wealth thick-tailed?," Greenwich Papers in Political Economy 38597, University of Greenwich, Greenwich Political Economy Research Centre.
    7. Chen, Zhimin & Ibragimov, Rustam, 2019. "One country, two systems? The heavy-tailedness of Chinese A- and H- share markets," Emerging Markets Review, Elsevier, vol. 38(C), pages 115-141.
    8. Judith Niehues & Maximilian Stockhausen & Andreas Peichl & Charlotte Bartels & Mario Bossler & Bernd Fitzenberger & Arnim Seidlitz & Moritz Kuhn & Till Baldenius & Sebastian Kohl & Moritz Schularick &, 2020. "Ungleichheit unter der Lupe – neue politische Antworten auf ein bekanntes Thema," ifo Schnelldienst, ifo Institute - Leibniz Institute for Economic Research at the University of Munich, vol. 73(02), pages 03-26, February.
    9. Charlotte Bartels & Carsten Schroeder, 2020. "Income, consumption and wealth inequality in Germany: Three concepts, three stories?," Basic Papers 2, Forum New Economy.
    10. Julia Baarck & Moritz Bode & Andreas Peichl, 2025. "Rising Inequality, Declining Mobility: The Evolution of Intergenerational Mobility in Germany," Rationality and Competition Discussion Paper Series 550, CRC TRR 190 Rationality and Competition.
    11. Gabrielczak, Piotr & Serwach, Tomasz, . "Rozkład wielkości firm w Polsce - czy ma zastosowanie prawo potęgowe?," Gospodarka Narodowa-The Polish Journal of Economics, Szkoła Główna Handlowa w Warszawie / SGH Warsaw School of Economics, vol. 2021(2).
    12. Frank Cowell & Emmanuel Flachaire, 2021. "Inequality Measurement: Methods and Data," Post-Print hal-03589066, HAL.
    13. Timm Bönke & Holger Lüthen, 2020. "Die Ungleichheit von Lebenserwerbseinkommen," Wirtschaftsdienst, Springer;ZBW - Leibniz Information Centre for Economics, vol. 100(4), pages 241-245, April.
    14. Joerg Drechsler & Johannes Ludsteck, 2025. "Imputation strategies for rightcensored wages in longitudinal datasets," Journal for Labour Market Research, Springer;Institute for Employment Research/ Institut für Arbeitsmarkt- und Berufsforschung (IAB), vol. 59(1), pages 1-14, December.
    15. Giesecke, Matthias & Bönke, Timm & Lüthen, Holger, 2011. "The Dynamics of Earnings in Germany: Evidence from Social Security Records," VfS Annual Conference 2011 (Frankfurt, Main): The Order of the World Economy - Lessons from the Crisis 48692, Verein für Socialpolitik / German Economic Association.
    16. Heiko Stüber & Wolfgang Dauth & Johann Eppelsheimer, 2023. "A guide to preparing the sample of integrated labour market biographies (SIAB, version 7519 v1) for scientific analysis," Journal for Labour Market Research, Springer;Institute for Employment Research/ Institut für Arbeitsmarkt- und Berufsforschung (IAB), vol. 57(1), pages 1-11, December.
    17. Regina T. Riphahn & Irakli Sauer, 2024. "Earnings Assimilation of Post-Reunification East German Migrants in West Germany," CESifo Working Paper Series 11233, CESifo.
    18. Stefan Bach & Charlotte Bartels & Theresa Neef, 2025. "The Distribution of National Income in Germany, 1992-2019," SOEPpapers on Multidisciplinary Panel Data Research 1227, DIW Berlin, The German Socio-Economic Panel (SOEP).
    19. Julia Baarck & Moritz Bode & Andreas Peichl, 2025. "Rising Inequality, Declining Mobility: The Evolution of Intergenerational Mobility in Germany," CEBI working paper series 25-15, University of Copenhagen. Department of Economics. The Center for Economic Behavior and Inequality (CEBI).
    20. Jacopo Bassetto & Giuseppe Ippedico, 2024. "Tax incentives and return migration," Discussion Papers 2024-05, University of Nottingham, GEP.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    JEL classification:

    • C13 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Estimation: General
    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • R12 - Urban, Rural, Regional, Real Estate, and Transportation Economics - - General Regional Economics - - - Size and Spatial Distributions of Regional Economic Activity; Interregional Trade (economic geography)

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:journl:hal-05084494. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.