IDEAS home Printed from https://ideas.repec.org/p/hal/wpaper/halshs-01115988.html
   My bibliography  Save this paper

Log-Transform Kernel Density Estimation of Income Distribution

Author

Listed:
  • Arthur Charpentier

    (UQAM - Université du Québec à Montréal = University of Québec in Montréal)

  • Emmanuel Flachaire

    (GREQAM - Groupement de Recherche en Économie Quantitative d'Aix-Marseille - EHESS - École des hautes études en sciences sociales - AMU - Aix Marseille Université - ECM - École Centrale de Marseille - CNRS - Centre National de la Recherche Scientifique)

Abstract

Standard kernel density estimation methods are very often used in practice to estimate density function. It works well in numerous cases. However, it is known not to work so well with skewed, multimodal and heavy-tailed distributions. Such features are usual with income distributions, defined over the positive support. In this paper, we show that a preliminary logarithmic transformation of the data, combined with standard kernel density estimation methods, can provide a much better fit of the density estimation.

Suggested Citation

  • Arthur Charpentier & Emmanuel Flachaire, 2014. "Log-Transform Kernel Density Estimation of Income Distribution," Working Papers halshs-01115988, HAL.
  • Handle: RePEc:hal:wpaper:halshs-01115988
    Note: View the original document on HAL open archive server: https://shs.hal.science/halshs-01115988
    as

    Download full text from publisher

    File URL: https://shs.hal.science/halshs-01115988/document
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Pagan,Adrian & Ullah,Aman, 1999. "Nonparametric Econometrics," Cambridge Books, Cambridge University Press, number 9780521355643, September.
    2. Russell Davidson, 2012. "Statistical inference in the presence of heavy tails," Econometrics Journal, Royal Economic Society, vol. 15(1), pages 31-53, February.
    3. Hagmann, M. & Scaillet, O., 2007. "Local multiplicative bias correction for asymmetric kernel density estimators," Journal of Econometrics, Elsevier, vol. 141(1), pages 213-249, November.
    4. Davidson, Russell & Flachaire, Emmanuel, 2007. "Asymptotic and bootstrap inference for inequality and poverty measures," Journal of Econometrics, Elsevier, vol. 141(1), pages 141-166, November.
    5. Marron, J.S. & Schmitz, H.-P., 1992. "Simultaneous Density Estimation of Several Income Distributions," Econometric Theory, Cambridge University Press, vol. 8(4), pages 476-488, December.
    6. Cowell, Frank A. & Flachaire, Emmanuel, 2007. "Income distribution and inequality measurement: The problem of extreme values," Journal of Econometrics, Elsevier, vol. 141(2), pages 1044-1072, December.
    7. BOUEZMARNI, Taoufik & ROMBOUTS, Jeroen VK, 2010. "Nonparametric density estimation for multivariate bounded data," LIDAM Reprints CORE 2301, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
    8. Song Chen, 2000. "Probability Density Function Estimation Using Gamma Kernels," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 52(3), pages 471-480, September.
    9. Bouezmarni, Taoufik & Scaillet, Olivier, 2005. "Consistency Of Asymmetric Kernel Density Estimators And Smoothed Histograms With Application To Income Data," Econometric Theory, Cambridge University Press, vol. 21(2), pages 390-412, April.
    10. Chen, Song Xi, 1999. "Beta kernel estimators for density functions," Computational Statistics & Data Analysis, Elsevier, vol. 31(2), pages 131-145, August.
    11. Frank A. Cowell, 2008. "Income Distribution and Inequality," Chapters, in: John B. Davis & Wilfred Dolfsma (ed.), The Elgar Companion to Social Economics, chapter 13, Edward Elgar Publishing.
    12. Abadir, Karim M. & Lawford, Steve, 2004. "Optimal asymmetric kernels," Economics Letters, Elsevier, vol. 83(1), pages 61-68, April.
    13. Qi Li & Jeffrey Scott Racine, 2006. "Nonparametric Econometrics: Theory and Practice," Economics Books, Princeton University Press, edition 1, volume 1, number 8355.
    14. Arthur Charpentier & Abder Oulidi, 2010. "Beta kernel quantile estimators of heavy-tailed loss distributions," Post-Print halshs-00425566, HAL.
    15. Ahamada, Ibrahim & Flachaire, Emmanuel, 2010. "Non-Parametric Econometrics," OUP Catalogue, Oxford University Press, number 9780199578009.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Edwin Fourrier-Nicolaï & Michel Lubrano, 2021. "Bayesian Inference for Parametric Growth Incidence Curves," Research on Economic Inequality, in: Research on Economic Inequality: Poverty, Inequality and Shocks, volume 29, pages 31-55, Emerald Group Publishing Limited.
    2. Surya Bhushan, 2021. "Labour Productivity Dynamics in Indian Agriculture: 2000–2016," The Indian Journal of Labour Economics, Springer;The Indian Society of Labour Economics (ISLE), vol. 64(2), pages 371-388, June.
    3. Pierre Lafaye de Micheaux & Frédéric Ouimet, 2021. "A Study of Seven Asymmetric Kernels for the Estimation of Cumulative Distribution Functions," Mathematics, MDPI, vol. 9(20), pages 1-35, October.
    4. Ouimet, Frédéric & Tolosana-Delgado, Raimon, 2022. "Asymptotic properties of Dirichlet kernel density estimators," Journal of Multivariate Analysis, Elsevier, vol. 187(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bouezmarni, Taoufik & Rombouts, Jeroen V.K., 2010. "Nonparametric density estimation for positive time series," Computational Statistics & Data Analysis, Elsevier, vol. 54(2), pages 245-261, February.
    2. Ouimet, Frédéric & Tolosana-Delgado, Raimon, 2022. "Asymptotic properties of Dirichlet kernel density estimators," Journal of Multivariate Analysis, Elsevier, vol. 187(C).
    3. Hagmann, M. & Scaillet, O., 2007. "Local multiplicative bias correction for asymmetric kernel density estimators," Journal of Econometrics, Elsevier, vol. 141(1), pages 213-249, November.
    4. Malec, Peter & Schienle, Melanie, 2014. "Nonparametric kernel density estimation near the boundary," Computational Statistics & Data Analysis, Elsevier, vol. 72(C), pages 57-76.
    5. Frank A. Cowell & Emmanuel Flachaire, 2014. "Statistical Methods for Distributional Analysis," Working Papers halshs-01115996, HAL.
    6. Marcelo Fernandes & Eduardo Mendes & Olivier Scaillet, 2015. "Testing for symmetry and conditional symmetry using asymmetric kernels," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 67(4), pages 649-671, August.
    7. Frank Cowell & Emmanuel Flachaire & Sanghamitra Bandyopadhyay, 2013. "Reference distributions and inequality measurement," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 11(4), pages 421-437, December.
    8. Jean-Marie Dufour & Emmanuel Flachaire & Lynda Khalaf, 2019. "Permutation Tests for Comparing Inequality Measures," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 37(3), pages 457-470, July.
    9. Ouimet, Frédéric, 2022. "A symmetric matrix-variate normal local approximation for the Wishart distribution and some applications," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    10. Stéphane Guerrier & Samuel Orso & Maria-Pia Victoria-Feser, 2018. "Parametric Inference for Index Functionals," Econometrics, MDPI, vol. 6(2), pages 1-11, April.
    11. Brzezinski, Michal, 2013. "Asymptotic and bootstrap inference for top income shares," Economics Letters, Elsevier, vol. 120(1), pages 10-13.
    12. Renault, Olivier & Scaillet, Olivier, 2004. "On the way to recovery: A nonparametric bias free estimation of recovery rate densities," Journal of Banking & Finance, Elsevier, vol. 28(12), pages 2915-2931, December.
    13. Mohammadi, Faezeh & Izadi, Muhyiddin & Lai, Chin-Diew, 2016. "On testing whether burn-in is required under the long-run average cost," Statistics & Probability Letters, Elsevier, vol. 110(C), pages 217-224.
    14. Pierre Lafaye de Micheaux & Frédéric Ouimet, 2021. "A Study of Seven Asymmetric Kernels for the Estimation of Cumulative Distribution Functions," Mathematics, MDPI, vol. 9(20), pages 1-35, October.
    15. Mahdi Salehi & Andriette Bekker & Mohammad Arashi, 2023. "A Semi-parametric Density Estimation with Application in Clustering," Journal of Classification, Springer;The Classification Society, vol. 40(1), pages 52-78, April.
    16. Marchant, Carolina & Bertin, Karine & Leiva, Víctor & Saulo, Helton, 2013. "Generalized Birnbaum–Saunders kernel density estimators and an analysis of financial data," Computational Statistics & Data Analysis, Elsevier, vol. 63(C), pages 1-15.
    17. Hirukawa, Masayuki, 2010. "Nonparametric multiplicative bias correction for kernel-type density estimation on the unit interval," Computational Statistics & Data Analysis, Elsevier, vol. 54(2), pages 473-495, February.
    18. Fernandes, Marcelo & Grammig, Joachim, 2005. "Nonparametric specification tests for conditional duration models," Journal of Econometrics, Elsevier, vol. 127(1), pages 35-68, July.
    19. Nikolay Gospodinov & Masayuki Hirukawa, 2008. "Time Series Nonparametric Regression Using Asymmetric Kernels with an Application to Estimation of Scalar Diffusion Processes," CIRJE F-Series CIRJE-F-573, CIRJE, Faculty of Economics, University of Tokyo.
    20. Vladimir Hlasny & Paolo Verme, 2022. "The Impact of Top Incomes Biases on the Measurement of Inequality in the United States," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 84(4), pages 749-788, August.

    More about this item

    Keywords

    nonparametric density estimation; heavy-tail; income distribution; data transformation; lognormal kernel;
    All these keywords.

    JEL classification:

    • C15 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Statistical Simulation Methods: General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:wpaper:halshs-01115988. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.