IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v101y2016icp148-160.html

A fast and objective multidimensional kernel density estimation method: fastKDE

Author

Listed:
  • O’Brien, Travis A.
  • Kashinath, Karthik
  • Cavanaugh, Nicholas R.
  • Collins, William D.
  • O’Brien, John P.

Abstract

Numerous facets of scientific research implicitly or explicitly call for the estimation of probability densities. Histograms and kernel density estimates (KDEs) are two commonly used techniques for estimating such information, with the KDE generally providing a higher fidelity representation of the probability density function (PDF). Both methods require specification of either a bin width or a kernel bandwidth. While techniques exist for choosing the kernel bandwidth optimally and objectively, they are computationally intensive, since they require repeated calculation of the KDE. A solution for objectively and optimally choosing both the kernel shape and width has recently been developed by Bernacchia and Pigolotti (2011). While this solution theoretically applies to multidimensional KDEs, it has not been clear how to practically do so.

Suggested Citation

  • O’Brien, Travis A. & Kashinath, Karthik & Cavanaugh, Nicholas R. & Collins, William D. & O’Brien, John P., 2016. "A fast and objective multidimensional kernel density estimation method: fastKDE," Computational Statistics & Data Analysis, Elsevier, vol. 101(C), pages 148-160.
  • Handle: RePEc:eee:csdana:v:101:y:2016:i:c:p:148-160
    DOI: 10.1016/j.csda.2016.02.014
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947316300408
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2016.02.014?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Ernst Wit & Edwin van den Heuvel & Jan-Willem Romeijn, 2012. "‘All models are wrong...’: an introduction to model uncertainty," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 66(3), pages 217-236, August.
    2. Tarn Duong & Martin L. Hazelton, 2005. "Cross‐validation Bandwidth Matrices for Multivariate Kernel Density Estimation," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 32(3), pages 485-506, September.
    3. Joerg Luedicke & Alberto Bernacchia, 2014. "Self-consistent density estimation," Stata Journal, StataCorp LLC, vol. 14(2), pages 237-258, June.
    4. Nils-Bastian Heidenreich & Anja Schindler & Stefan Sperlich, 2013. "Bandwidth selection for kernel density estimation: a review of fully automatic selectors," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 97(4), pages 403-433, October.
    5. Alberto Bernacchia & Simone Pigolotti, 2011. "Self‐consistent method for density estimation," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 73(3), pages 407-422, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Liu, Zhi-Feng & Liu, You-Yuan & Chen, Xiao-Rui & Zhang, Shu-Rui & Luo, Xing-Fu & Li, Ling-Ling & Yang, Yi-Zhou & You, Guo-Dong, 2024. "A novel deep learning-based evolutionary model with potential attention and memory decay-enhancement strategy for short-term wind power point-interval forecasting," Applied Energy, Elsevier, vol. 360(C).
    2. Pavel Loskot, 2021. "A Generative Model for Correlated Graph Signals," Mathematics, MDPI, vol. 9(23), pages 1-12, November.
    3. Mourad Zribi & Ibrahim Sadok & Bassel Marhaba, 2025. "Kernel-Diffeomorphism Bayesian Bootstrap Filter to reduce speckle noise on SAR images," Computational Statistics, Springer, vol. 40(7), pages 3613-3643, September.
    4. Federico Palacios-González & Rosa M. García-Fernández, 2020. "A faster algorithm to estimate multiresolution densities," Computational Statistics, Springer, vol. 35(3), pages 1207-1230, September.
    5. Yumi Oh & Peng Lyu & Sunwoo Ko & Jeongik Min & Juwhan Song, 2024. "Enhancing Broiler Weight Estimation through Gaussian Kernel Density Estimation Modeling," Agriculture, MDPI, vol. 14(6), pages 1-20, May.
    6. Zhiyuan Zhang & Huiyan Qin, 2025. "Spatiotemporal Differentiation of Carbon Emission Efficiency and Influencing Factors in the Five Major Maize Producing Areas of China," Agriculture, MDPI, vol. 15(15), pages 1-23, July.
    7. Amir Khosheghbal & Peter J. Haas & Chaitra Gopalappa, 2025. "Mechanistic modeling of social conditions in disease-prediction simulations via copulas and probabilistic graphical models: HIV case study," Health Care Management Science, Springer, vol. 28(1), pages 28-49, March.
    8. DMSLB Dissanayake & Takehiro Morimoto & Yuji Murayama & Manjula Ranagalage & Hepi H. Handayani, 2018. "Impact of Urban Surface Characteristics and Socio-Economic Variables on the Spatial Variation of Land Surface Temperature in Lagos City, Nigeria," Sustainability, MDPI, vol. 11(1), pages 1-23, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yan, Hanhuan & Han, Liyan, 2019. "Empirical distributions of stock returns: Mixed normal or kernel density?," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 514(C), pages 473-486.
    2. Scholz, Michael & Nielsen, Jens Perch & Sperlich, Stefan, 2015. "Nonparametric prediction of stock returns based on yearly data: The long-term view," Insurance: Mathematics and Economics, Elsevier, vol. 65(C), pages 143-155.
    3. Hu, Shuowen & Poskitt, D.S. & Zhang, Xibin, 2012. "Bayesian adaptive bandwidth kernel density estimation of irregular multivariate distributions," Computational Statistics & Data Analysis, Elsevier, vol. 56(3), pages 732-740.
    4. Billings, Stephen B. & Johnson, Erik B., 2012. "A non-parametric test for industrial specialization," Journal of Urban Economics, Elsevier, vol. 71(3), pages 312-331.
    5. Inés Barbeito & Ricardo Cao & Stefan Sperlich, 2023. "Bandwidth selection for statistical matching and prediction," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 32(1), pages 418-446, March.
    6. Tingting Cheng & Jiti Gao & Xibin Zhang, 2014. "Semiparametric Localized Bandwidth Selection in Kernel Density Estimation," Monash Econometrics and Business Statistics Working Papers 14/14, Monash University, Department of Econometrics and Business Statistics.
    7. Boris Branisa & Adriana Cardozo, 2009. "Regional Growth Convergence in Colombia Using Social Indicators," Ibero America Institute for Econ. Research (IAI) Discussion Papers 195, Ibero-America Institute for Economic Research.
    8. Pulkkinen, Seppo, 2015. "Ridge-based method for finding curvilinear structures from noisy data," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 89-109.
    9. Surya T. Tokdar & Ryan Martin, 2021. "Bayesian Test of Normality Versus a Dirichlet Process Mixture Alternative," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(1), pages 66-96, May.
    10. Tingting Cheng & Jiti Gao & Xibin Zhang, 2019. "Nonparametric localized bandwidth selection for Kernel density estimation," Econometric Reviews, Taylor & Francis Journals, vol. 38(7), pages 733-762, August.
    11. Francisco Richter & Bart Haegeman & Rampal S. Etienne & Ernst C. Wit, 2020. "Introducing a general class of species diversification models for phylogenetic trees," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 74(3), pages 261-274, August.
    12. Christoph Lambio & Tillman Schmitz & Richard Elson & Jeffrey Butler & Alexandra Roth & Silke Feller & Nicolai Savaskan & Tobia Lakes, 2023. "Exploring the Spatial Relative Risk of COVID-19 in Berlin-Neukölln," IJERPH, MDPI, vol. 20(10), pages 1-22, May.
    13. Marina Johnson & Abdullah Albizri & Serhat Simsek, 2022. "Artificial intelligence in healthcare operations to enhance treatment outcomes: a framework to predict lung cancer prognosis," Annals of Operations Research, Springer, vol. 308(1), pages 275-305, January.
    14. Wenchen Liu & Yincai Tang & Ancha Xu, 2021. "Zero-and-one-inflated Poisson regression model," Statistical Papers, Springer, vol. 62(2), pages 915-934, April.
    15. Max Köhler & Anja Schindler & Stefan Sperlich, 2014. "A Review and Comparison of Bandwidth Selection Methods for Kernel Regression," International Statistical Review, International Statistical Institute, vol. 82(2), pages 243-274, August.
    16. Muhammad Ali Musarat & Wesam Salah Alaloul & Muhammad Babar Ali Rabbani & Mujahid Ali & Muhammad Altaf & Roman Fediuk & Nikolai Vatin & Sergey Klyuev & Hamna Bukhari & Alishba Sadiq & Waqas Rafiq & Wa, 2021. "Kabul River Flow Prediction Using Automated ARIMA Forecasting: A Machine Learning Approach," Sustainability, MDPI, vol. 13(19), pages 1-26, September.
    17. Peiyuan Zhang & Jiaming Li & Wenzhong Zhang, 2022. "Characteristics of High-Technology Industry Migration within Metropolitan Areas—A Case Study of Beijing Metropolitan Area," Sustainability, MDPI, vol. 14(19), pages 1-19, October.
    18. M. Hiabu & E. Mammen & M. D. Martìnez-Miranda & J. P. Nielsen, 2016. "In-sample forecasting with local linear survival densities," Biometrika, Biometrika Trust, vol. 103(4), pages 843-859.
    19. Ge Song & Jiahui Yuan & Charlie Cheng-Jie Ji, 2021. "Application of Gibbs Sampling in Modelling the Utilization Rate of Raw Materials for Drug Coating," International Journal of Statistics and Probability, Canadian Center of Science and Education, vol. 10(2), pages 1-28, March.
    20. Michael Govorov & Giedrė Beconytė & Gennady Gienko, 2023. "Trivariate Kernel Density Estimation of Spatiotemporal Crime Events with Case Study for Lithuania," Sustainability, MDPI, vol. 15(11), pages 1-17, May.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:101:y:2016:i:c:p:148-160. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.