IDEAS home Printed from https://ideas.repec.org/a/spr/astaws/v14y2020i3d10.1007_s11943-020-00275-8.html
   My bibliography  Save this article

Treatment of sample under-representation and skewed heavy-tailed distributions in survey-based microsimulation: An analysis of redistribution effects in compulsory health care insurance in Switzerland
[Methoden zur Behandlung von Unterrepräsentation bei schiefen Verteilungen für die stichprobenbasierte Mikrosimulation: Eine Analyse zu den Umverteilungseffekten in der obligatorischen Krankenversicherung der Schweiz]

Author

Listed:
  • Tobias Schoch

    (University of Applied Sciences Northwestern Switzerland)

  • André Müller

    (Ecoplan AG – Research in Economics and Policy Consultancy)

Abstract

The credibility of microsimulation modeling with the research community and policymakers depends on high-quality baseline surveys. Quality problems with the baseline survey tend to impair the quality of microsimulation built on top of the survey data. We address two potential issues that both relate to skewed and heavy-tailed distributions. First, we find that ultra-high-income households are under-represented in the baseline household survey. Moreover, the sample estimate of average income underestimates the known population average. Although the Deville–Särndal calibration method corrects the under-representation, it cannot achieve alignment of estimated average income in the right tail of the distribution with known population values without distorting the empirical income distribution. To overcome the problem, we introduce a Pareto tail model. With the help of the tail model, we can adjust the sample income distribution in the tail to meet the alignment targets. Our method can be a useful tool for microsimulation modelers working with survey income data. The second contribution refers to the treatment of an outlier-prone variable that has been added to the survey by record linkage (our empirical example is health care cost). The nature of the baseline survey is not affected by record linkage, that is, the baseline survey still covers only a small part of the population. Hence, the sampling weights are relatively large. An outlying observation together with a high sampling weight can heavily influence or even ruin an estimate of a population characteristic. Thus, we argue that it is beneficial—in terms of mean square error—to use robust estimation and alignment methods, because robust methods are less affected by the presence of outliers.

Suggested Citation

  • Tobias Schoch & André Müller, 2020. "Treatment of sample under-representation and skewed heavy-tailed distributions in survey-based microsimulation: An analysis of redistribution effects in compulsory health care insurance in Switzerland," AStA Wirtschafts- und Sozialstatistisches Archiv, Springer;Deutsche Statistische Gesellschaft - German Statistical Society, vol. 14(3), pages 267-304, December.
  • Handle: RePEc:spr:astaws:v:14:y:2020:i:3:d:10.1007_s11943-020-00275-8
    DOI: 10.1007/s11943-020-00275-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11943-020-00275-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11943-020-00275-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. François Bourguignon & Amedeo Spadaro, 2006. "Microsimulation as a tool for evaluating redistribution policies," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 4(1), pages 77-106, April.
    2. Reto Foellmi & Isabel Z. Martínez, 2017. "Volatile Top Income Shares in Switzerland? Reassessing the Evolution between 1981 and 2010," The Review of Economics and Statistics, MIT Press, vol. 99(5), pages 793-809, December.
    3. Mary E. Thompson, 2019. "Combining Data from New and Traditional Sources in Population Surveys," International Statistical Review, International Statistical Institute, vol. 87(S1), pages 79-89, May.
    4. Anthony Atkinson & Thomas Piketty, 2007. "Top incomes over the twentieth century: A contrast between continental european and english-speaking countries," Post-Print halshs-00754859, HAL.
    5. Michal Myck & Mateusz Najsztub, 2015. "Data and Model Cross-validation to Improve Accuracy of Microsimulation Results: Estimates for the Polish Household Budget Survey," International Journal of Microsimulation, International Microsimulation Association, vol. 8(1), pages 33-66.
    6. Anthony B Atkinson & François Bourguignon, 2014. "Handbook of Income Distribution," PSE-Ecole d'économie de Paris (Postprint) halshs-02923231, HAL.
    7. Frank Cowell & Maria-Pia Victoria-Feser, 2007. "Robust stochastic dominance: A semi-parametric approach," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 5(1), pages 21-37, April.
    8. John Creedy & Ivan Tuckwell, 2004. "Reweighting Household Surveys for Tax Microsimulation Modelling: An Application to the New Zealand Household Economic Survey," Australian Journal of Labour Economics (AJLE), Bankwest Curtin Economics Centre (BCEC), Curtin Business School, vol. 7(1), pages 71-88, March.
    9. Atkinson, A. B. & Piketty, Thomas (ed.), 2007. "Top Incomes Over the Twentieth Century: A Contrast Between Continental European and English-Speaking Countries," OUP Catalogue, Oxford University Press, number 9780199286881.
    10. Deborah J Schofield & Melanie J B Zeppel & Owen Tan & Sharyn Lymer & Michelle M Cunich & Rupendra N Shrestha, 2018. "A Brief, Global History of Microsimulation Models in Health: Past Applications, Lessons Learned and Future Directions," International Journal of Microsimulation, International Microsimulation Association, vol. 11(1), pages 97-142.
    11. Anders Klevmarken, 2022. "Statistical Inference in Micro-simulation Models: Incorporating External Information," International Journal of Microsimulation, International Microsimulation Association, vol. 15(1), pages 111-120.
    12. Figari, Francesco & Paulus, Alari & Sutherland, Holly, 2014. "Microsimulation and policy analysis," ISER Working Paper Series 2014-23, Institute for Social and Economic Research.
    13. Andreas Alfons & Matthias Templ & Peter Filzmoser, 2013. "Robust estimation of economic indicators from survey samples based on Pareto tail modelling," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 62(2), pages 271-286, March.
    14. Anton Korinek & Johan Mistiaen & Martin Ravallion, 2006. "Survey nonresponse and the distribution of income," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 4(1), pages 33-55, April.
    15. Luca Crivelli & Massimo Filippini & Ilaria Mosca, 2006. "Federalism and regional health care expenditures: an empirical analysis for the Swiss cantons," Health Economics, John Wiley & Sons, Ltd., vol. 15(5), pages 535-541, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Arthur Charpentier & Emmanuel Flachaire, 2022. "Pareto models for top incomes and wealth," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 20(1), pages 1-25, March.
    2. Brzeziński, Michał & Myck, Michal & Najsztub, Mateusz, 2019. "Reevaluating Distributional Consequences of the Transition to Market Economy in Poland: New Results from Combined Household Survey and Tax Return Data," IZA Discussion Papers 12734, Institute of Labor Economics (IZA).
    3. Nora Lustig, 2019. "The “Missing Rich” in Household Surveys: Causes and Correction Approaches," Commitment to Equity (CEQ) Working Paper Series 75, Tulane University, Department of Economics.
    4. Nora Lustig, 2018. "Measuring the Distribution of Household Income, Consumption and Wealth: State of Play and Measurement Challenges," Working Papers 1801, Tulane University, Department of Economics.
    5. Frank A. Cowell & Philippe Kerm, 2015. "Wealth Inequality: A Survey," Journal of Economic Surveys, Wiley Blackwell, vol. 29(4), pages 671-710, September.
    6. Stephen P. Jenkins, 2017. "Pareto Models, Top Incomes and Recent Trends in UK Income Inequality," Economica, London School of Economics and Political Science, vol. 84(334), pages 261-289, April.
    7. Nora Lustig, 2020. "The ``missing rich'' in household surveys: causes and correction approaches," Working Papers 520, ECINEQ, Society for the Study of Economic Inequality.
    8. Ursina Kuhn, 2020. "Augmented wealth in Switzerland: the influence of pension wealth on wealth inequality," Swiss Journal of Economics and Statistics, Springer;Swiss Society of Economics and Statistics, vol. 156(1), pages 1-16, December.
    9. Thomas Blanchet & Ignacio Flores & Marc Morgan, 2022. "The weight of the rich: improving surveys using tax data," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 20(1), pages 119-150, March.
    10. François Bourguignon, 2018. "Simple adjustments of observed distributions for missing income and missing people," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 16(2), pages 171-188, June.
    11. Ohlsson, Henry & Roine, Jesper & Waldenström, Daniel, 2014. "Inherited Wealth over the Path of Development: Sweden, 1810–2010," Working Paper Series 1033, Research Institute of Industrial Economics.
    12. Christoph Lakner & Branko Milanovic, 2016. "Global Income Distribution: From the Fall of the Berlin Wall to the Great Recession," The World Bank Economic Review, World Bank, vol. 30(2), pages 203-232.
    13. Channing Arndt & Kristi Mahrt, 2017. "Is inequality underestimated in Mozambique? Accounting for underreported consumption," WIDER Working Paper Series 153, World Institute for Development Economic Research (UNU-WIDER).
    14. Diego Winkelried & Bruno Escobar, 2022. "Declining inequality in Latin America? Robustness checks for Peru," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 20(1), pages 223-243, March.
    15. Frank A. Cowell, 2008. "Income Distribution and Inequality," Chapters, in: John B. Davis & Wilfred Dolfsma (ed.), The Elgar Companion to Social Economics, chapter 13, Edward Elgar Publishing.
    16. Prydz, Espen Beer & Jolliffe, Dean & Serajuddin, Umar, 2021. "Mind the Gap," GLO Discussion Paper Series 944, Global Labor Organization (GLO).
    17. Stephen P. Jenkins & John Micklewright, 2007. "New Directions in the Analysis of Inequality and Poverty," Discussion Papers of DIW Berlin 700, DIW Berlin, German Institute for Economic Research.
    18. Lidia Ceriani & Paolo Verme, 2022. "Population Changes and the Measurement of Inequality," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 162(2), pages 549-575, July.
    19. Xavier Jara & Nicolás Oliva, 2018. "Top income adjustments and tax reforms in Ecuador," WIDER Working Paper Series wp-2018-165, World Institute for Development Economic Research (UNU-WIDER).
    20. Nishant Yonzan & Branko Milanovic & Salvatore Morelli & Janet Gornick, 2022. "Drawing a Line: Comparing the Estimation of Top Incomes between Tax Data and Household Survey Data," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 20(1), pages 67-95, March.

    More about this item

    Keywords

    Simulation; Pareto distribution; Representative outliers; Nonresponse; Calibration; Imputation;
    All these keywords.

    JEL classification:

    • C15 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Statistical Simulation Methods: General
    • C54 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Quantitative Policy Modeling
    • C63 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling - - - Computational Techniques
    • C83 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Survey Methods; Sampling Methods
    • I13 - Health, Education, and Welfare - - Health - - - Health Insurance, Public and Private
    • I18 - Health, Education, and Welfare - - Health - - - Government Policy; Regulation; Public Health

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:astaws:v:14:y:2020:i:3:d:10.1007_s11943-020-00275-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.