Missing Income Data in the German SOEP: Incidence, Imputation and its Impact on the Income Distribution
AbstractThis paper deals with the question of selectivity of missing data on income questions in large panel surveys due to item-non-response and with imputation as one alternative strategy to cope with this issue. In contrast to cross-section surveys, the imputation of missing values in panel data can profit from longitudinal information which is available for the very same observation units from other points in time. The "row-and-column imputation procedure" developed by Little & Su (1989) considers longitudinal as well as cross-sectional information in the imputation process. This procedure is applied to the German Socio-Economic Panel study (SOEP) when deriving annual income variables, complemented by purely cross-sectional techniques. Based on the SOEP, our empirical work starts with a description of the overall incidence of imputation and its relevance given by imputed income as a percentage share of the total income mass: e.g. while 21 % of all observations have at least one missing income component of their pre-tax post-transfer income, 9 % of the overall income mass is imputed. However, this picture varies considerably for more recent sub-samples of the panel survey. Secondly, we analyze the respective impact of imputation on the personal distribution of income as well as on results of income mobility. When comparing income inequality measures based only on truly observed information to those derived from all (i.e., observed and imputed) observations, we find an increase in inequality due to imputation and this effect appears to be relevant in both tails of the distribution, although somewhat more prominent among higher incomes. Longitudinal analyses show firstly a positive correlation of item-non-response on income data over time, but also provide evidence of item-non-response as being a predictor of subsequent unit-non-response. Applying various income mobility indicators there is a robust picture about income mobility being understated using truly observed information only. Finally, multivariate models show that survey-related factors (number of interviews, interview mode) as well as indicators for variability in income receipt (due to increased complexity of household structure and income composition) are significantly correlated with item-non-response. In conclusion, our empirical results based on the German SOEP indicate the selectivity of item-non-response on income questions in social surveys and push the necessity for adequate imputation.
Download InfoIf you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
Bibliographic InfoPaper provided by DIW Berlin, German Institute for Economic Research in its series Discussion Papers of DIW Berlin with number 376.
Length: 35 p.
Date of creation: 2003
Date of revision:
Item-Non-Response; Imputation; Income Inequality;
Find related papers by JEL classification:
- C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
- D31 - Microeconomics - - Distribution - - - Personal Income and Wealth Distribution
- I32 - Health, Education, and Welfare - - Welfare, Well-Being, and Poverty - - - Measurement and Analysis of Poverty
This paper has been announced in the following NEP Reports:
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Fields, Gary S & Ok, Efe A, 1999. "Measuring Movement of Incomes," Economica, London School of Economics and Political Science, vol. 66(264), pages 455-71, November.
- Jürgen Schupp & Gert G. Wagner, 2002. "Maintenance of and Innovation in Long-Term Panel Studies: The Case of the German Socio-Economic Panel (GSOEP)," Discussion Papers of DIW Berlin 276, DIW Berlin, German Institute for Economic Research.
- Regina Riphahn & Oliver Serfling, 2005.
"Item non-response on income and wealth questions,"
Springer, vol. 30(2), pages 521-538, 09.
- Daniel H. Hill & Robert J. Willis, 2001. "Reducing Panel Attrition: A Search for Effective Policy Instruments," Journal of Human Resources, University of Wisconsin Press, vol. 36(3), pages 416-438.
- Landau, Katja & Klasen, Stephan & Zucchini, Walter, 2012.
"Measuring Vulnerability to Poverty Using Long-Term Panel Data,"
Annual Conference 2012 (Goettingen): New Approaches and Challenges for the Labor Market of the 21st Century
66057, Verein für Socialpolitik / German Economic Association.
- Katja Landau & Stephan Klasen & Walter Zucchini, 2012. "Measuring Vulnerability to Poverty Using Long-Term Panel Data," SOEPpapers on Multidisciplinary Panel Data Research 481, DIW Berlin, The German Socio-Economic Panel (SOEP).
- Katja Landau & Stephan Klasen & Walter Zucchini, 2012. "Measuring Vulnerability to Poverty Using Long-Term Panel Data," Courant Research Centre: Poverty, Equity and Growth - Discussion Papers 118, Courant Research Centre PEG.
- Susanne Rässler & Regina Riphahn, 2006. "Survey item nonresponse and its treatment," AStA Advances in Statistical Analysis, Springer, vol. 90(1), pages 217-232, March.
- Viktor Steiner & Peter Haan & Katharina Wrohlich, 2005. "Dokumentation des Steuer-Transfer-Mikrosimulationsmodells STSM 1999 - 2002," Data Documentation 9, DIW Berlin, German Institute for Economic Research.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Bibliothek).
If references are entirely missing, you can add them using this form.