IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0259111.html
   My bibliography  Save this article

Semiparametric maximum likelihood probability density estimation

Author

Listed:
  • Frank Kwasniok

Abstract

A comprehensive methodology for semiparametric probability density estimation is introduced and explored. The probability density is modelled by sequences of mostly regular or steep exponential families generated by flexible sets of basis functions, possibly including boundary terms. Parameters are estimated by global maximum likelihood without any roughness penalty. A statistically orthogonal formulation of the inference problem and a numerically stable and fast convex optimization algorithm for its solution are presented. Automatic model selection over the type and number of basis functions is performed with the Bayesian information criterion. The methodology can naturally be applied to densities supported on bounded, infinite or semi-infinite domains without boundary bias. Relationships to the truncated moment problem and the moment-constrained maximum entropy principle are discussed and a new theorem on the existence of solutions is contributed. The new technique compares very favourably to kernel density estimation, the diffusion estimator, finite mixture models and local likelihood density estimation across a diverse range of simulation and observation data sets. The semiparametric estimator combines a very small mean integrated squared error with a high degree of smoothness which allows for a robust and reliable detection of the modality of the probability density in terms of the number of modes and bumps.

Suggested Citation

  • Frank Kwasniok, 2021. "Semiparametric maximum likelihood probability density estimation," PLOS ONE, Public Library of Science, vol. 16(11), pages 1-33, November.
  • Handle: RePEc:plo:pone00:0259111
    DOI: 10.1371/journal.pone.0259111
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0259111
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0259111&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0259111?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Sundberg,Rolf, 2019. "Statistical Modelling by Exponential Families," Cambridge Books, Cambridge University Press, number 9781108701112.
    2. Wu, Ximing, 2003. "Calculation of maximum entropy densities with application to income distribution," Journal of Econometrics, Elsevier, vol. 115(2), pages 347-354, August.
    3. Kooperberg, Charles & Stone, Charles J., 1991. "A study of logspline density estimation," Computational Statistics & Data Analysis, Elsevier, vol. 12(3), pages 327-347, November.
    4. Sundberg,Rolf, 2019. "Statistical Modelling by Exponential Families," Cambridge Books, Cambridge University Press, number 9781108476591.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Park, Jaewoo & Jin, Ick Hoon & Schweinberger, Michael, 2022. "Bayesian model selection for high-dimensional Ising models, with applications to educational data," Computational Statistics & Data Analysis, Elsevier, vol. 165(C).
    2. Whiteley, Nick, 2021. "Dimension-free Wasserstein contraction of nonlinear filters," Stochastic Processes and their Applications, Elsevier, vol. 135(C), pages 31-50.
    3. Vassilios Bazinas & Bent Nielsen, 2022. "Causal Transmission in Reduced-Form Models," Econometrics, MDPI, vol. 10(2), pages 1-25, March.
    4. Michael Schweinberger, 2022. "Discussion to: Bayesian graphical models for modern biological applications by Y. Ni, V. Baladandayuthapani, M. Vannucci and F.C. Stingo," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 31(2), pages 253-260, June.
    5. Aris Spanos, 2022. "Frequentist Model-based Statistical Induction and the Replication Crisis," Journal of Quantitative Economics, Springer;The Indian Econometric Society (TIES), vol. 20(1), pages 133-159, September.
    6. Jesse B. Tack & David Ubilava, 2015. "Climate and agricultural risk: measuring the effect of ENSO on U.S. crop insurance," Agricultural Economics, International Association of Agricultural Economists, vol. 46(2), pages 245-257, March.
    7. Carol Alexander & José María Sarabia, 2012. "Quantile Uncertainty and Value‐at‐Risk Model Risk," Risk Analysis, John Wiley & Sons, vol. 32(8), pages 1293-1308, August.
    8. Tanaka, Ken'ichiro & Toda, Alexis Akira, 2015. "Discretizing Distributions with Exact Moments: Error Estimate and Convergence Analysis," University of California at San Diego, Economics Working Paper Series qt7g23r5kh, Department of Economics, UC San Diego.
    9. Wu, Ximing & Perloff, Jeffrey M., 2004. "China's Income Distribution Over Time: Reasons for Rising Inequality," Institute for Research on Labor and Employment, Working Paper Series qt9jw2v939, Institute of Industrial Relations, UC Berkeley.
    10. Matthew Wiser & Gloria Yeomans-Maldonado & Sudipta Sarangi, 2014. "Having More Fun with Organized Kissing," Southern Economic Journal, John Wiley & Sons, vol. 80(3), pages 855-865, January.
    11. Wu, Ximing & Perloff, Jeffrey M., 2007. "Information-Theoretic Deconvolution Approximation of Treatment Effect Distribution," Department of Agricultural & Resource Economics, UC Berkeley, Working Paper Series qt6bm6n30x, Department of Agricultural & Resource Economics, UC Berkeley.
    12. Kirkby, J. Lars & Leitao, Álvaro & Nguyen, Duy, 2021. "Nonparametric density estimation and bandwidth selection with B-spline bases: A novel Galerkin method," Computational Statistics & Data Analysis, Elsevier, vol. 159(C).
    13. Chang, Meng-Shiuh & Wu, Ximing, 2015. "Transformation-based nonparametric estimation of multivariate densities," Journal of Multivariate Analysis, Elsevier, vol. 135(C), pages 71-88.
    14. Domenico Di Gangi & Fabrizio Lillo & Davide Pirino, 2015. "Assessing systemic risk due to fire sales spillover through maximum entropy network reconstruction," Papers 1509.00607, arXiv.org, revised Jul 2018.
    15. Beatty, Timothy K. M. & LaFrance, Jeffrey T, 2005. "U.S. Demand for Food and Nutrition in the 20th Century," Department of Agricultural & Resource Economics, UC Berkeley, Working Paper Series qt41f3z7cd, Department of Agricultural & Resource Economics, UC Berkeley.
    16. Carol Alexander & Jose Maria Sarabia, 2010. "Endogenizing Model Risk to Quantile Estimates," ICMA Centre Discussion Papers in Finance icma-dp2010-07, Henley Business School, University of Reading.
    17. Lee Tae-Hwy & Ullah Aman & Mao Millie Yi, 2021. "Maximum Entropy Analysis of Consumption-based Capital Asset Pricing Model and Volatility," Journal of Econometric Methods, De Gruyter, vol. 10(1), pages 1-19, January.
    18. Jesse Tack & Ardian Harri & Keith Coble, 2012. "More than Mean Effects: Modeling the Effect of Climate on the Higher Order Moments of Crop Yields," American Journal of Agricultural Economics, Agricultural and Applied Economics Association, vol. 94(5), pages 1037-1054.
    19. Yiguo Sun & Thanasis Stengos, 2008. "The absolute health income hypothesis revisited: a semiparametric quantile regression approach," Empirical Economics, Springer, vol. 35(2), pages 395-412, September.
    20. Sung Y. Park & Anil K. Bera, 2018. "Information theoretic approaches to income density estimation with an application to the U.S. income data," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 16(4), pages 461-486, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0259111. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.