Principal points of a multivariate mixture distribution
AbstractA set of n-principal points of a distribution is defined as a set of n points that optimally represent the distribution in terms of mean squared distance. It provides an optimal n-point-approximation of the distribution. However, it is in general difficult to find a set of principal points of a multivariate distribution. Tarpey et al. [T. Tarpey, L. Li, B. Flury, Principal points and self-consistent points of elliptical distributions, Ann. Statist. 23 (1995) 103-112] established a theorem which states that any set of n-principal points of an elliptically symmetric distribution is in the linear subspace spanned by some principal eigenvectors of the covariance matrix. This theorem, called a "principal subspace theorem", is a strong tool for the calculation of principal points. In practice, we often come across distributions consisting of several subgroups. Hence it is of interest to know whether the principal subspace theorem remains valid even under such complex distributions. In this paper, we define a multivariate location mixture model. A theorem is established that clarifies a linear subspace in which n-principal points exist.
Download InfoIf you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
Bibliographic InfoArticle provided by Elsevier in its journal Journal of Multivariate Analysis.
Volume (Year): 102 (2011)
Issue (Month): 2 (February)
Contact details of provider:
Web page: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Tarpey, T., 1995. "Principal Points and Self-Consistent Points of Symmetrical Multivariate Distributions," Journal of Multivariate Analysis, Elsevier, vol. 53(1), pages 39-51, April.
- Tarpey, Thaddeus, 1994. "Two principal points of symmetric, strongly unimodal distributions," Statistics & Probability Letters, Elsevier, vol. 20(4), pages 253-257, July.
- Su, Yingcai, 1997. "On the Asymptotics of Quantizers in Two Dimensions," Journal of Multivariate Analysis, Elsevier, vol. 61(1), pages 67-85, April.
- Tarpey, Thaddeus, 2007. "Linear Transformations and the k-Means Clustering Algorithm: Applications to Clustering Curves," The American Statistician, American Statistical Association, vol. 61, pages 34-40, February.
- Li, Luning & Flury, Bernard, 1995. "Uniqueness of principal points for univariate distributions," Statistics & Probability Letters, Elsevier, vol. 25(4), pages 323-327, December.
- Thaddeus Tarpey, 1997. "Estimating principal points of univariate distributions," Journal of Applied Statistics, Taylor and Francis Journals, vol. 24(5), pages 499-512.
- Bali, Juan Lucas & Boente, Graciela, 2009. "Principal points and elliptical distributions from the multivariate setting to the functional case," Statistics & Probability Letters, Elsevier, vol. 79(17), pages 1858-1865, September.
- Yamamoto, Wataru & Shinozaki, Nobuo, 2000. "On uniqueness of two principal points for univariate location mixtures," Statistics & Probability Letters, Elsevier, vol. 46(1), pages 33-42, January.
- Thaddeus Tarpey, 2007. "A parametric k-means algorithm," Computational Statistics, Springer, vol. 22(1), pages 71-89, April.
If references are entirely missing, you can add them using this form.