IDEAS home Printed from https://ideas.repec.org/p/zbw/caseps/200416.html
   My bibliography  Save this paper

Multivariate Density Estimation and Visualization

Author

Listed:
  • Scott, David W.

Abstract

This chapter examines the use of flexible methods to approximate an unknown density function, and techniques appropriate for visualization of densities in up to four dimensions. The statistical analysis of data is a multilayered endeavor. Data must be carefully examined and cleaned to avoid spurious findings. A preliminary examination of data by graphical means is useful for this purpose. Graphical exploration of data was popularized by Tukey (1977) in his book on exploratory data analysis (EDA). Modern data mining packages also include an array of graphical tools such as the histogram, which is the simplest example of a density estimator. Exploring data is particularly challenging when the sample size is massive or if the number of variables exceeds a handful. In either situation, the use of nonparametric density estimation can aid in the fundamental goal of understanding the important features hidden in the data. In the following sections, the algorithms and theory of nonparametric density estimation will be described, as well as descriptions of the visualization of multivariate data and density estimates. For simplicity, the discussion will assume the data and functions are continuous. Extensions to discrete and mixed data are straightforward.

Suggested Citation

  • Scott, David W., 2004. "Multivariate Density Estimation and Visualization," Papers 2004,16, Humboldt University of Berlin, Center for Applied Statistics and Economics (CASE).
  • Handle: RePEc:zbw:caseps:200416
    as

    Download full text from publisher

    File URL: https://www.econstor.eu/bitstream/10419/22190/1/16_ds.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. B. W. Silverman, 1982. "Kernel Density Estimation Using the Fast Fourier Transform," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 31(1), pages 93-99, March.
    2. A. Azzalini & A.W. Bowman, 1990. "A Look at Some Data on the Old Faithful Geyser," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 39(3), pages 357-365, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ouafae Benrabah & Elias Ould Saïd & Abdelkader Tatachak, 2015. "A kernel mode estimate under random left truncation and time series model: asymptotic normality," Statistical Papers, Springer, vol. 56(3), pages 887-910, August.
    2. Sayed A. Mostafa & Ibrahim A. Ahmad, 2019. "Kernel density estimation from complex surveys in the presence of complete auxiliary information," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 82(3), pages 295-338, April.
    3. Dietmar Pfeifer & Olena Ragulina, 2018. "Generating VaR scenarios with product beta distributions," Papers 1808.02457, arXiv.org, revised Jan 2019.
    4. Dietmar Pfeifer & Andreas Mandle & Olena Ragulina & C^ome Girschig, 2018. "New copulas based on general partitions-of-unity (part III) - the continuous case (extended version)," Papers 1803.00957, arXiv.org, revised May 2019.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gramacki, Artur & Gramacki, Jarosław, 2017. "FFT-based fast bandwidth selector for multivariate kernel density estimation," Computational Statistics & Data Analysis, Elsevier, vol. 106(C), pages 27-45.
    2. Ichimura, Hidehiko & Todd, Petra E., 2007. "Implementing Nonparametric and Semiparametric Estimators," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 6, chapter 74, Elsevier.
    3. Hirukawa, Masayuki, 2010. "Nonparametric multiplicative bias correction for kernel-type density estimation on the unit interval," Computational Statistics & Data Analysis, Elsevier, vol. 54(2), pages 473-495, February.
    4. Mark S. Handcock & Adrian E. Raftery & Jeremy M. Tantrum, 2007. "Model‐based clustering for social networks," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 170(2), pages 301-354, March.
    5. Holmström, Lasse, 2000. "The Accuracy and the Computational Complexity of a Multivariate Binned Kernel Density Estimator," Journal of Multivariate Analysis, Elsevier, vol. 72(2), pages 264-309, February.
    6. Fred Huffer & Cheolyong Park, 2000. "A test for multivariate structure," Journal of Applied Statistics, Taylor & Francis Journals, vol. 27(5), pages 633-650.
    7. Adriano Z. Zambom & Ronaldo Dias, 2013. "A Review of Kernel Density Estimation with Applications to Econometrics," International Econometric Review (IER), Econometric Research Association, vol. 5(1), pages 20-42, April.
    8. Branislav Panić & Marko Nagode & Jernej Klemenc & Simon Oman, 2022. "On Methods for Merging Mixture Model Components Suitable for Unsupervised Image Segmentation Tasks," Mathematics, MDPI, vol. 10(22), pages 1-22, November.
    9. Pinkse, Joris, 1998. "A consistent nonparametric test for serial independence," Journal of Econometrics, Elsevier, vol. 84(2), pages 205-231, June.
    10. Goutis, Constantinos, 1996. "Nonparametric estimation of a mixing density via the kernel method," DES - Working Papers. Statistics and Econometrics. WS 10437, Universidad Carlos III de Madrid. Departamento de Estadística.
    11. Yana Melnykov & Xuwen Zhu & Volodymyr Melnykov, 2021. "Transformation mixture modeling for skewed data groups with heavy tails and scatter," Computational Statistics, Springer, vol. 36(1), pages 61-78, March.
    12. Hardle, W. & Marron, J. S., 1995. "Fast and simple scatterplot smoothing," Computational Statistics & Data Analysis, Elsevier, vol. 20(1), pages 1-17, July.
    13. repec:cte:wsrepe:ws1450804 is not listed on IDEAS
    14. Rizwan Muhammad & Yaolong Zhao & Fan Liu, 2019. "Spatiotemporal Analysis to Observe Gender Based Check-In Behavior by Using Social Media Big Data: A Case Study of Guangzhou, China," Sustainability, MDPI, vol. 11(10), pages 1-30, May.
    15. José E. Chacón, 2020. "The Modal Age of Statistics," International Statistical Review, International Statistical Institute, vol. 88(1), pages 122-141, April.
    16. Jernigan, Robert W. & Baran, Robert H., 2003. "Testing lumpability in Markov chains," Statistics & Probability Letters, Elsevier, vol. 64(1), pages 17-23, August.
    17. José E. Chacón, 2019. "Mixture model modal clustering," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(2), pages 379-404, June.
    18. Meintanis, S. & Ushakov, N. G., 2004. "Binned goodness-of-fit tests based on the empirical characteristic function," Statistics & Probability Letters, Elsevier, vol. 69(3), pages 305-314, September.
    19. O’Hagan, Adrian & Murphy, Thomas Brendan & Gormley, Isobel Claire & McNicholas, Paul D. & Karlis, Dimitris, 2016. "Clustering with the multivariate normal inverse Gaussian distribution," Computational Statistics & Data Analysis, Elsevier, vol. 93(C), pages 18-30.
    20. R. N. Rattihalli & S. B. Patil, 2021. "Data Dependent Asymmetric Kernels for Estimating the Density Function," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(1), pages 155-186, February.
    21. Bashtannyk, David M. & Hyndman, Rob J., 2001. "Bandwidth selection for kernel conditional density estimation," Computational Statistics & Data Analysis, Elsevier, vol. 36(3), pages 279-298, May.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:zbw:caseps:200416. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ZBW - Leibniz Information Centre for Economics (email available below). General contact details of provider: https://edirc.repec.org/data/cahubde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.