IDEAS home Printed from https://ideas.repec.org/p/zbw/fubsbe/201527.html
   My bibliography  Save this paper

Kernel density estimation for heaped data

Author

Listed:
  • Groß, Marcus
  • Rendtel, Ulrich

Abstract

In self-reported data usually a phenomenon called 'heaping' occurs, i.e. survey participants round the values of their income, weight or height to some degree. Additionally, respondents may be more prone to round off or up due to social desirability. By ignoring the heaping process a severe bias in terms of spikes and bumps is introduced when applying kernel density methods naively to the rounded data. A generalized Stochastic Expectation Maximization (SEM) approach accounting for heaping with potentially asymmetric rounding behaviour in univariate kernel density estimation is presented in this work. The introduced methods are applied to survey data of the German Socio-Economic Panel and exhibit very good performance simulations.

Suggested Citation

  • Groß, Marcus & Rendtel, Ulrich, 2015. "Kernel density estimation for heaped data," Discussion Papers 2015/27, Free University Berlin, School of Business & Economics.
  • Handle: RePEc:zbw:fubsbe:201527
    as

    Download full text from publisher

    File URL: https://www.econstor.eu/bitstream/10419/117333/1/834585685.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Jan Marcus & Rainer Siegers & Markus M. Grabka, 2013. "Preparation of Data from the New SOEP Consumption Module: Editing, Imputation, and Smoothing," Data Documentation 70, DIW Berlin, German Institute for Economic Research.
    2. Pudney, Stephen, 2008. "Heaping and leaping: survey response behaviour and the dynamics of self-reported consumption expenditure," ISER Working Paper Series 2008-09, Institute for Social and Economic Research.
    3. Erich Battistin & Raffaele Miniaci & Guglielmo Weber, 2003. "What Do We Learn from Recall Consumption Data?," Journal of Human Resources, University of Wisconsin Press, vol. 38(2).
    4. Groß, Marcus & Rendtel, Ulrich & Schmid, Timo & Schmon, Sebastian & Tzavidis, Nikos, 2015. "Estimating the density of ethnic minorities and aged people in Berlin: Multivariate kernel density estimation applied to sensitive geo-referenced administrative data protected via measurement error," Discussion Papers 2015/7, Free University Berlin, School of Business & Economics.
    5. Zhang, Xibin & King, Maxwell L. & Hyndman, Rob J., 2006. "A Bayesian approach to bandwidth selection for multivariate kernel density estimation," Computational Statistics & Data Analysis, Elsevier, vol. 50(11), pages 3009-3031, July.
    6. Steffen Otterbach & Alfonso Sousa-Poza, 2010. "How Accurate are German Work-time Data? A Comparison of Time-diary Reports and Stylized Estimates," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 97(3), pages 325-339, July.
    7. repec:mpr:mprres:6195 is not listed on IDEAS
    8. Gert G. Wagner & Joachim R. Frick & Jürgen Schupp, 2007. "The German Socio-Economic Panel Study (SOEP) – Scope, Evolution and Enhancements," Schmollers Jahrbuch : Journal of Applied Social Science Studies / Zeitschrift für Wirtschafts- und Sozialwissenschaften, Duncker & Humblot, Berlin, vol. 127(1), pages 139-169.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Michael C. Knaus & Steffen Otterbach, 2019. "Work Hour Mismatch And Job Mobility: Adjustment Channels And Resolution Rates," Economic Inquiry, Western Economic Association International, vol. 57(1), pages 227-242, January.
    2. Loeffler, Max & Peichl, Andreas & Pestel, Nico & Siegloch, Sebastian & Sommer, Eric, 2014. "Documentation IZA?MOD v3.0: The IZA Policy Simulation Model," IZA Discussion Papers 8553, Institute of Labor Economics (IZA).
    3. Stefan Bach & Martin Beznoska & Viktor Steiner, 2016. "An Integrated Micro Data Base for Tax Analysis in Germany," Data Documentation 86, DIW Berlin, German Institute for Economic Research.
    4. Marcus Groß & Ulrich Rendtel & Timo Schmid & Sebastian Schmon & Nikos Tzavidis, 2017. "Estimating the density of ethnic minorities and aged people in Berlin: multivariate kernel density estimation applied to sensitive georeferenced administrative data protected via measurement error," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 180(1), pages 161-183, January.
    5. Thomas F. Crossley & Joachim K. Winter, 2014. "Asking Households about Expenditures: What Have We Learned?," NBER Chapters, in: Improving the Measurement of Consumer Expenditures, pages 23-50, National Bureau of Economic Research, Inc.
    6. Nico Pestel & Eric Sommer, 2017. "Shifting Taxes from Labor to Consumption: More Employment and more Inequality?," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 63(3), pages 542-563, September.
    7. Bettina Sonnenberg & Michaela Riediger & Cornelia Wrzus & Gert G. Wagner, 2011. "Measuring Time Use in Surveys: How Valid Are Time Use Questions in Surveys? Concordance of Survey and Experience Sampling Measures," SOEPpapers on Multidisciplinary Panel Data Research 390, DIW Berlin, The German Socio-Economic Panel (SOEP).
    8. Byung-hill Jun & Hosin Song, 2019. "Tests for Detecting Probability Mass Points," Korean Economic Review, Korean Economic Association, vol. 35, pages 205-248.
    9. Cabane, Charlotte & Hille, Adrian & Lechner, Michael, 2015. "Mozart or Pelé? The effects of teenagers’ participation in music and sports," Economics Working Paper Series 1509, University of St. Gallen, School of Economics and Political Science.
    10. Heineck, Guido & Süssmuth, Bernd, 2013. "A different look at Lenin’s legacy: Social capital and risk taking in the Two Germanies," Journal of Comparative Economics, Elsevier, vol. 41(3), pages 789-803.
    11. Kemptner, Daniel & Tolan, Songül, 2018. "The role of time preferences in educational decision making," Economics of Education Review, Elsevier, vol. 67(C), pages 25-39.
    12. Eibich, Peter & Siedler, Thomas, 2020. "Retirement, intergenerational time transfers, and fertility," European Economic Review, Elsevier, vol. 124(C).
    13. Geyer, Johannes & Haan, Peter & Wrohlich, Katharina, 2015. "The effects of family policy on maternal labor supply: Combining evidence from a structural model and a quasi-experimental approach," Labour Economics, Elsevier, vol. 36(C), pages 84-98.
    14. Johannes Abeler & Armin Falk & Fabian Kosse, 2021. "Malleability of Preferences for Honesty," CESifo Working Paper Series 9033, CESifo.
    15. Luis Aguiar & Jörg Claussen & Christian Peukert, 2018. "Catch Me If You Can: Effectiveness and Consequences of Online Copyright Enforcement," Information Systems Research, INFORMS, vol. 29(3), pages 656-678, September.
    16. Fossen, Frank M. & König, Johannes, 2015. "Public health insurance and entry into self-employment," VfS Annual Conference 2015 (Muenster): Economic Development - Theory and Policy 112934, Verein für Socialpolitik / German Economic Association.
    17. Marco Caliendo & Frank M. Fossen & Alexander Kritikos & Miriam Wetter, 2015. "The Gender Gap in Entrepreneurship: Not just a Matter of Personality," CESifo Economic Studies, CESifo Group, vol. 61(1), pages 202-238.
    18. Shvartsman, Elena & Beckmann, Michael, 2015. "Stressed by your job: What is the role of personnel policy?," Working papers 2015/15, Faculty of Business and Economics - University of Basel.
    19. Stefanie Sperlich & Frauke-Marie Adler & Johannes Beller & Batoul Safieddine & Juliane Tetzlaff & Fabian Tetzlaff & Siegfried Geyer, 2022. "Getting Better or Getting Worse? A Population-Based Study on Trends in Self-Rated Health among Single Mothers in Germany between 1994 and 2018," IJERPH, MDPI, vol. 19(5), pages 1-13, February.
    20. Sarah Flèche & Richard Layard, 2017. "Do More of Those in Misery Suffer from Poverty, Unemployment or Mental Illness?," Kyklos, Wiley Blackwell, vol. 70(1), pages 27-41, February.

    More about this item

    Keywords

    Heaping; Survey Data; Measurement error; Self-reported data; Kernel density estimation; Rounded data;
    All these keywords.

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:zbw:fubsbe:201527. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ZBW - Leibniz Information Centre for Economics (email available below). General contact details of provider: https://edirc.repec.org/data/fwfubde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.