IDEAS home Printed from https://ideas.repec.org/a/ses/arsjes/2016-i-3.html
   My bibliography  Save this article

A Simple Method for Predicting Distributions by Means of Covariates with Examples from Poverty and Health Economics

Author

Listed:
  • Jing Dai
  • Stefan Sperlich
  • Walter Zucchini

Abstract

We present an integration based procedure for predicting the distribution f of an indicator of interest in situations where, in addition to the sample data, one has access to covariates that are available for the entire population. The proposed method, based on similar ideas that have been used in the literature on policy evaluation, provides an alternative to existing simulation and imputation methods. It is very simple to apply, flexible, requires no additional assumptions, and does not involve the inclusion of artificial random terms. It therefore yields reproducible estimates and allows for valid inference. It also provides a tool for future predictions, scenarios and ex-ante impact evaluation. We illustrate our procedure by predicting income distributions in a case with sample selection, and both current and future doctor visits. We find our approach outperforms other commonly used procedures substantially.

Suggested Citation

  • Jing Dai & Stefan Sperlich & Walter Zucchini, 2016. "A Simple Method for Predicting Distributions by Means of Covariates with Examples from Poverty and Health Economics," Swiss Journal of Economics and Statistics (SJES), Swiss Society of Economics and Statistics (SSES), vol. 152(I), pages 49-80, March.
  • Handle: RePEc:ses:arsjes:2016-i-3
    as

    Download full text from publisher

    File URL: http://www.sjes.ch/papers/2016-I-3.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. DiNardo, John & Fortin, Nicole M & Lemieux, Thomas, 1996. "Labor Market Institutions and the Distribution of Wages, 1973-1992: A Semiparametric Approach," Econometrica, Econometric Society, vol. 64(5), pages 1001-1044, September.
    2. Horton N. J. & Lipsitz S. R., 2001. "Multiple Imputation in Practice: Comparison of Software Packages for Regression Models With Missing Variables," The American Statistician, American Statistical Association, vol. 55, pages 244-254, August.
    3. Alessandro Tarozzi & Angus Deaton, 2009. "Using Census and Survey Data to Estimate Poverty and Inequality for Small Areas," The Review of Economics and Statistics, MIT Press, vol. 91(4), pages 773-792, November.
    4. Philippe Van Kerm, 2013. "Generalized measures of wage differentials," Empirical Economics, Springer, vol. 45(1), pages 465-482, August.
    5. Melly, Blaise, 2005. "Decomposition of differences in distribution using quantile regression," Labour Economics, Elsevier, vol. 12(4), pages 577-590, August.
    6. Härdle, Wolfgang & Huet, Sylvie & Mammen, Enno & Sperlich, Stefan, 2004. "Bootstrap Inference In Semiparametric Generalized Additive Models," Econometric Theory, Cambridge University Press, vol. 20(2), pages 265-300, April.
    7. Juhn, Chinhui & Murphy, Kevin M & Pierce, Brooks, 1993. "Wage Inequality and the Rise in Returns to Skill," Journal of Political Economy, University of Chicago Press, vol. 101(3), pages 410-442, June.
    8. Stephen G. Donald & Yu‐Chin Hsu & Garry F. Barrett, 2012. "Incorporating covariates in the measurement of welfare and inequality: methods and applications," Econometrics Journal, Royal Economic Society, vol. 15(1), pages 1-30, February.
    9. Martin Biewen & Stephen Jenkins, 2005. "A framework for the decomposition of poverty differences with an application to poverty differences between countries," Empirical Economics, Springer, vol. 30(2), pages 331-358, September.
    10. Rothe, Christoph, 2010. "Nonparametric estimation of distributional policy effects," Journal of Econometrics, Elsevier, vol. 155(1), pages 56-70, March.
    11. Leonardo Gasparini & Mariana Marchionni & Federico Gutierrez, 2004. "Simulating Income Distribution Changes in Bolivia: a Microeconometric Approach," CEDLAS, Working Papers 0012, CEDLAS, Universidad Nacional de La Plata.
    12. Ravallion, Martin, 2001. "Growth, Inequality and Poverty: Looking Beyond Averages," World Development, Elsevier, vol. 29(11), pages 1803-1815, November.
    13. Patrick Royston, 2004. "Multiple imputation of missing values," Stata Journal, StataCorp LP, vol. 4(3), pages 227-241, September.
    14. Sahn, David E. & Stifel, David C., 2000. "Poverty Comparisons Over Time and Across Countries in Africa," World Development, Elsevier, vol. 28(12), pages 2123-2155, December.
    15. Deon Filmer & Lant Pritchett, 2001. "Estimating Wealth Effects Without Expenditure Data—Or Tears: An Application To Educational Enrollments In States Of India," Demography, Springer;Population Association of America (PAA), vol. 38(1), pages 115-132, February.
    16. María José Lombardía & Stefan Sperlich, 2008. "Semiparametric inference in generalized mixed effects models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(5), pages 913-930, November.
    17. Angela Noufaily & M. C. Jones, 2013. "Parametric quantile regression based on the generalized gamma distribution," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 62(5), pages 723-740, November.
    18. A.B. Atkinson & F. Bourguignon (ed.), 2000. "Handbook of Income Distribution," Handbook of Income Distribution, Elsevier, edition 1, volume 1, number 1.
    19. Hentschel, Jesko, et al, 2000. "Combining Census and Survey Data to Trace the Spatial Dimensions of Poverty: A Case Study of Ecuador," The World Bank Economic Review, World Bank, vol. 14(1), pages 147-165, January.
    20. Stefan Sperlich, 2014. "On the choice of regularization parameters in specification testing: a critical discussion," Empirical Economics, Springer, vol. 47(2), pages 427-450, September.
    21. R. A. Rigby & D. M. Stasinopoulos, 2005. "Generalized additive models for location, scale and shape," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 54(3), pages 507-554, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Inés Barbeito & Ricardo Cao & Stefan Sperlich, 2023. "Bandwidth selection for statistical matching and prediction," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 32(1), pages 418-446, March.
    2. José María Sarabia & Faustino Prieto & Vanesa Jordá & Stefan Sperlich, 2020. "A Note on Combining Machine Learning with Statistical Modeling for Financial Data Analysis," Risks, MDPI, vol. 8(2), pages 1-14, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jing Dai & Stefan Sperlich & Walter Zucchini, 2011. "Estimating and Predicting Household Expenditures and Income Distributions," MAGKS Papers on Economics 201147, Philipps-Universität Marburg, Faculty of Business Administration and Economics, Department of Economics (Volkswirtschaftliche Abteilung).
    2. Denisa M. Sologon & Philippe Kerm & Jinjing Li & Cathal O’Donoghue, 2021. "Accounting for differences in income inequality across countries: tax-benefit policy, labour market structure, returns and demographics," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 19(1), pages 13-43, March.
    3. Philippe Van Kerm & Seunghee Yu & Chung Choe, 2016. "Decomposing quantile wage gaps: a conditional likelihood approach," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 65(4), pages 507-527, August.
    4. Sonja C. Kassenboehmer & Mathias G. Sinning, 2014. "Distributional Changes in the Gender Wage Gap," ILR Review, Cornell University, ILR School, vol. 67(2), pages 335-361, April.
    5. Ekaterina Selezneva & Philippe Van Kerm, 2016. "A distribution-sensitive examination of the gender wage gap in Germany," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 14(1), pages 21-40, March.
    6. SOLOGON Denisa & ALMEIDA Vanda & VAN KERM Philippe, 2019. "Accounting for the distributional effects of the 2007-2008 crisis and the Economic Adjustment Program in Portugal," LISER Working Paper Series 2019-05, Luxembourg Institute of Socio-Economic Research (LISER).
    7. Leonardo Bonilla Mejía, 2011. "Diferencias regionales en la distribución del ingreso en Colombia," Revista Sociedad y Economía, Universidad del Valle, CIDSE, December.
    8. Nerijus Černiauskas & Denisa M. Sologon & Cathal O’Donoghue & Linas Tarasonis, 2022. "Income Inequality and Redistribution in Lithuania: The Role of Policy, Labor Market, Income, and Demographics," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 68(S1), pages 131-166, April.
    9. Dang,Hai-Anh H. & Lanjouw,Peter F. & Serajuddin,Umar & Dang,Hai-Anh H. & Lanjouw,Peter F. & Serajuddin,Umar, 2014. "Updating poverty estimates at frequent intervals in the absence of consumption data : methods and illustration with reference to a middle-income country," Policy Research Working Paper Series 7043, The World Bank.
    10. Ekaterina Selezneva & Philippe Van Kerm, 2013. "Inequality-adjusted gender wage differentials in Germany," Working Papers 334, Leibniz Institut für Ost- und Südosteuropaforschung (Institute for East and Southeast European Studies).
    11. Sloczynski, Tymon, 2018. "Average Gaps and Oaxaca's Blinder Decompositions: A Cautionary Tale about Regression Estimates of Racial Differences in Labor Market Outcomes," IZA Discussion Papers 12041, Institute of Labor Economics (IZA).
    12. Fortin, Nicole & Lemieux, Thomas & Firpo, Sergio, 2011. "Decomposition Methods in Economics," Handbook of Labor Economics, in: O. Ashenfelter & D. Card (ed.), Handbook of Labor Economics, edition 1, volume 4, chapter 1, pages 1-102, Elsevier.
    13. SOLOGON Denisa & VAN KERM Philippe & LI Jinjing & O'DONOGHUE Cathal, 2018. "Accounting for Differences in Income Inequality across Countries: Ireland and the United Kingdom," LISER Working Paper Series 2018-01, Luxembourg Institute of Socio-Economic Research (LISER).
    14. Makarski, Krzysztof & Tyrowicz, Joanna, 2023. "Preference for redistribution during structural change with labor mobility frictions," European Journal of Political Economy, Elsevier, vol. 77(C).
    15. VAN KERM Philippe & YU Seunghee & CHOE Chung, 2014. "Wage differentials between native, immigrant and cross-border workers: Evidence and model comparisons," LISER Working Paper Series 2014-05, Luxembourg Institute of Socio-Economic Research (LISER).
    16. Ghosh, Pallab Kumar, 2014. "The contribution of human capital variables to changes in the wage distribution function," Labour Economics, Elsevier, vol. 28(C), pages 58-69.
    17. Brantly Callaway & Weige Huang, 2020. "Distributional Effects of a Continuous Treatment with an Application on Intergenerational Mobility," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 82(4), pages 808-842, August.
    18. Sergio P. Firpo & Nicole M. Fortin & Thomas Lemieux, 2018. "Decomposing Wage Distributions Using Recentered Influence Function Regressions," Econometrics, MDPI, vol. 6(2), pages 1-40, May.
    19. Nerijus Cerniauskas & Denisa M. Sologon & Cathal O’Donoghue & Linas Tarasonis, 2020. "Changes in income inequality in Lithuania: the role of policy, labour market structure, returns and demographics," Bank of Lithuania Working Paper Series 71, Bank of Lithuania.
    20. Essama-Nssah, B., 2012. "Identification of sources of variation in poverty outcomes," Policy Research Working Paper Series 5954, The World Bank.

    More about this item

    Keywords

    predicting distributions; missing values; household expenditures; income distribution; health economics; impact evaluation;
    All these keywords.

    JEL classification:

    • C1 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General
    • C4 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics
    • I32 - Health, Education, and Welfare - - Welfare, Well-Being, and Poverty - - - Measurement and Analysis of Poverty
    • I15 - Health, Education, and Welfare - - Health - - - Health and Economic Development

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ses:arsjes:2016-i-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Kurt Schmidheiny (email available below). General contact details of provider: https://edirc.repec.org/data/sgvssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.