IDEAS home Printed from https://ideas.repec.org/a/spr/compst/v27y2012i4p757-777.html
   My bibliography  Save this article

Density estimation and comparison with a penalized mixture approach

Author

Listed:
  • Christian Schellhase
  • Göran Kauermann

Abstract

The paper presents smooth estimation of densities utilizing penalized splines. The idea is to represent the unknown density by a convex mixture of basis densities, where the weights are estimated in a penalized form. The proposed method extends the work of Komárek and Lesaffre (Comput Stat Data Anal 52(7):3441–3458, 2008 ) and allows for general density estimation. Simulations show a convincing performance in comparison to existing density estimation routines. The idea is extended to allow the density to depend on some (factorial) covariate. Assuming a binary group indicator, for instance, we can test on equality of the densities in the groups. This provides a smooth alternative to the classical Kolmogorov-Smirnov test or an Analysis of Variance and it shows stable and powerful behaviour. Copyright Springer-Verlag 2012

Suggested Citation

  • Christian Schellhase & Göran Kauermann, 2012. "Density estimation and comparison with a penalized mixture approach," Computational Statistics, Springer, vol. 27(4), pages 757-777, December.
  • Handle: RePEc:spr:compst:v:27:y:2012:i:4:p:757-777
    DOI: 10.1007/s00180-011-0289-6
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1007/s00180-011-0289-6
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s00180-011-0289-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Gilles Celeux & Gilda Soromenho, 1996. "An entropy criterion for assessing the number of clusters in a mixture model," Journal of Classification, Springer;The Classification Society, vol. 13(2), pages 195-212, September.
    2. Göran Kauermann & Jean D. Opsomer, 2011. "Data-driven selection of the spline dimension in penalized spline regression," Biometrika, Biometrika Trust, vol. 98(1), pages 225-230.
    3. Wendimagegn Ghidey & Emmanuel Lesaffre & Paul Eilers, 2004. "Smooth Random Effects Distribution in a Linear Mixed Model," Biometrics, The International Biometric Society, vol. 60(4), pages 945-953, December.
    4. Ruppert,David & Wand,M. P. & Carroll,R. J., 2003. "Semiparametric Regression," Cambridge Books, Cambridge University Press, number 9780521780506, January.
    5. Philip T. Reiss & R. Todd Ogden, 2009. "Smoothing parameter selection for a class of semiparametric linear models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(2), pages 505-523, April.
    6. M. P. Wand, 2003. "Smoothing and mixed models," Computational Statistics, Springer, vol. 18(2), pages 223-249, July.
    7. Benaglia, Tatiana & Chauveau, Didier & Hunter, David R. & Young, Derek S., 2009. "mixtools: An R Package for Analyzing Mixture Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 32(i06).
    8. Simon N. Wood, 2011. "Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 73(1), pages 3-36, January.
    9. Håvard Rue & Sara Martino & Nicolas Chopin, 2009. "Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(2), pages 319-392, April.
    10. Göran Kauermann & Tatyana Krivobokova & Ludwig Fahrmeir, 2009. "Some asymptotic results on generalized penalized spline smoothing," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(2), pages 487-503, April.
    11. Ja‐Yong Koo & Charles Kooperberg & Jinho Park, 1999. "Logspline Density Estimation under Censoring and Truncation," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 26(1), pages 87-105, March.
    12. Ruppert,David & Wand,M. P. & Carroll,R. J., 2003. "Semiparametric Regression," Cambridge Books, Cambridge University Press, number 9780521785167, January.
    13. Yingxing Li & David Ruppert, 2008. "On the asymptotics of penalized splines," Biometrika, Biometrika Trust, vol. 95(2), pages 415-436.
    14. Gerda Claeskens & Tatyana Krivobokova & Jean D. Opsomer, 2009. "Asymptotic properties of penalized spline estimators," Biometrika, Biometrika Trust, vol. 96(3), pages 529-544.
    15. Komárek, Arnost & Lesaffre, Emmanuel, 2008. "Generalized linear mixed model with a penalized Gaussian mixture as a random effects distribution," Computational Statistics & Data Analysis, Elsevier, vol. 52(7), pages 3441-3458, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Shang, Han Lin, 2013. "Bayesian bandwidth estimation for a nonparametric functional regression model with unknown error density," Computational Statistics & Data Analysis, Elsevier, vol. 67(C), pages 185-198.
    2. Roland Langrock & Théo Michelot & Alexander Sohn & Thomas Kneib, 2015. "Semiparametric stochastic volatility modelling using penalized splines," Computational Statistics, Springer, vol. 30(2), pages 517-537, June.
    3. Roland Langrock & Timo Adam & Vianey Leos‐Barajas & Sina Mews & David L. Miller & Yannis P. Papastamatiou, 2018. "Spline‐based nonparametric inference in general state‐switching models," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 72(3), pages 179-200, August.
    4. Morteza Amini & Afarin Bayat & Reza Salehian, 2023. "hhsmm: an R package for hidden hybrid Markov/semi-Markov models," Computational Statistics, Springer, vol. 38(3), pages 1283-1335, September.
    5. Dalla Valle, Luciana & De Giuli, Maria Elena & Tarantola, Claudia & Manelli, Claudio, 2016. "Default probability estimation via pair copula constructions," European Journal of Operational Research, Elsevier, vol. 249(1), pages 298-311.
    6. Christian Schellhase & Torben Kuhlenkasper, 2017. "Semi-parametric estimation of income mobility with D‑vines using bivariate penalised splines," AStA Wirtschafts- und Sozialstatistisches Archiv, Springer;Deutsche Statistische Gesellschaft - German Statistical Society, vol. 11(2), pages 107-134, October.
    7. Roland Langrock & Thomas Kneib & Alexander Sohn & Stacy L. DeRuiter, 2015. "Nonparametric inference in hidden Markov models using P-splines," Biometrics, The International Biometric Society, vol. 71(2), pages 520-528, June.
    8. Jaspers, Stijn & Aerts, Marc & Verbeke, Geert & Beloeil, Pierre-Alexandre, 2014. "A new semi-parametric mixture model for interval censored data, with applications in the field of antimicrobial resistance," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 30-42.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Simon N. Wood & Zheyuan Li & Gavin Shaddick & Nicole H. Augustin, 2017. "Generalized Additive Models for Gigadata: Modeling the U.K. Black Smoke Network Daily Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(519), pages 1199-1210, July.
    2. Takuma Yoshida, 2016. "Asymptotics and smoothing parameter selection for penalized spline regression with various loss functions," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 70(4), pages 278-303, November.
    3. Smith, Michael S. & Kauermann, Göran, 2011. "Bicycle commuting in Melbourne during the 2000s energy crisis: A semiparametric analysis of intraday volumes," Transportation Research Part B: Methodological, Elsevier, vol. 45(10), pages 1846-1862.
    4. Lee, Wang-Sheng, 2014. "Big and Tall: Is there a Height Premium or Obesity Penalty in the Labor Market?," IZA Discussion Papers 8606, Institute of Labor Economics (IZA).
    5. Michael Wegener & Göran Kauermann, 2017. "Forecasting in nonlinear univariate time series using penalized splines," Statistical Papers, Springer, vol. 58(3), pages 557-576, September.
    6. Sonja Greven & Ciprian Crainiceanu, 2013. "On likelihood ratio testing for penalized splines," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 97(4), pages 387-402, October.
    7. Lee, Wang-Sheng, 2014. "Is the BMI a Relic of the Past?," IZA Discussion Papers 8637, Institute of Labor Economics (IZA).
    8. I. Gijbels & I. Prosdocimi & G. Claeskens, 2010. "Nonparametric estimation of mean and dispersion functions in extended generalized linear models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 19(3), pages 580-608, November.
    9. Andrada Ivanescu & Ana-Maria Staicu & Fabian Scheipl & Sonja Greven, 2015. "Penalized function-on-function regression," Computational Statistics, Springer, vol. 30(2), pages 539-568, June.
    10. Wu, Ximing & Sickles, Robin, 2018. "Semiparametric estimation under shape constraints," Econometrics and Statistics, Elsevier, vol. 6(C), pages 74-89.
    11. Simon N. Wood, 2020. "Inference and computation with generalized additive models and their extensions," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(2), pages 307-339, June.
    12. repec:wyi:journl:002174 is not listed on IDEAS
    13. Göran Kauermann & Timo Teuber & Peter Flaschel, 2012. "Exploring US Business Cycles with Bivariate Loops Using Penalized Spline Regression," Computational Economics, Springer;Society for Computational Economics, vol. 39(4), pages 409-427, April.
    14. Kauermann Goeran & Krivobokova Tatyana & Semmler Willi, 2011. "Filtering Time Series with Penalized Splines," Studies in Nonlinear Dynamics & Econometrics, De Gruyter, vol. 15(2), pages 1-28, March.
    15. Holland, Ashley D., 2017. "Penalized spline estimation in the partially linear model," Journal of Multivariate Analysis, Elsevier, vol. 153(C), pages 211-235.
    16. Longhi, Christian & Musolesi, Antonio & Baumont, Catherine, 2014. "Modeling structural change in the European metropolitan areas during the process of economic integration," Economic Modelling, Elsevier, vol. 37(C), pages 395-407.
    17. Simon N. Wood & Natalya Pya & Benjamin Säfken, 2016. "Smoothing Parameter and Model Selection for General Smooth Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(516), pages 1548-1563, October.
    18. Strasak, Alexander M. & Umlauf, Nikolaus & Pfeiffer, Ruth M. & Lang, Stefan, 2011. "Comparing penalized splines and fractional polynomials for flexible modelling of the effects of continuous predictor variables," Computational Statistics & Data Analysis, Elsevier, vol. 55(4), pages 1540-1551, April.
    19. Blöchl, Andreas, 2014. "Trend Estimation with Penalized Splines as Mixed Models for Series with Structural Breaks," Discussion Papers in Economics 18446, University of Munich, Department of Economics.
    20. Kuhlenkasper, Torben & Steinhardt, Max Friedrich, 2017. "Who leaves and when? Selective outmigration of immigrants from Germany," Economic Systems, Elsevier, vol. 41(4), pages 610-621.
    21. Basile, Roberto & Durbán, María & Mínguez, Román & María Montero, Jose & Mur, Jesús, 2014. "Modeling regional economic dynamics: Spatial dependence, spatial heterogeneity and nonlinearities," Journal of Economic Dynamics and Control, Elsevier, vol. 48(C), pages 229-245.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:27:y:2012:i:4:p:757-777. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.