Greedy clustering of count data through a mixture of multinomial PCA
Author
Abstract
Suggested Citation
DOI: 10.1007/s00180-020-01008-9
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
References listed on IDEAS
- Daniel D. Lee & H. Sebastian Seung, 1999. "Learning the parts of objects by non-negative matrix factorization," Nature, Nature, vol. 401(6755), pages 788-791, October.
- Grün, Bettina & Hornik, Kurt, 2011. "topicmodels: An R Package for Fitting Topic Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 40(i13).
- Bouveyron, C. & Girard, S. & Schmid, C., 2007. "High-dimensional data clustering," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 502-519, September.
- Bergé, Laurent R. & Bouveyron, Charles & Corneli, Marco & Latouche, Pierre, 2019. "The latent topic block model for the co-clustering of textual interaction data," Computational Statistics & Data Analysis, Elsevier, vol. 137(C), pages 247-270.
- Isabella Zwiener & Barbara Frisch & Harald Binder, 2014. "Transforming RNA-Seq Data to Improve the Performance of Prognostic Gene Signatures," PLOS ONE, Public Library of Science, vol. 9(1), pages 1-13, January.
- Carl Eckart & Gale Young, 1936. "The approximation of one matrix by another of lower rank," Psychometrika, Springer;The Psychometric Society, vol. 1(3), pages 211-218, September.
- David M. Blei & Alp Kucukelbir & Jon D. McAuliffe, 2017. "Variational Inference: A Review for Statisticians," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 859-877, April.
- James A Fordyce & Zachariah Gompert & Matthew L Forister & Chris C Nice, 2011. "A Hierarchical Bayesian Approach to Ecological Count Data: A Flexible Tool for Ecologists," PLOS ONE, Public Library of Science, vol. 6(11), pages 1-7, November.
- Léna CAREL & Pierre ALQUIER, 2017. "Simultaneous Dimension Reduction and Clustering via the NMF-EM Algorithm," Working Papers 2017-38, Center for Research in Economics and Statistics.
- Ding, Chris & Li, Tao & Peng, Wei, 2008. "On the equivalence between Non-negative Matrix Factorization and Probabilistic Latent Semantic Indexing," Computational Statistics & Data Analysis, Elsevier, vol. 52(8), pages 3913-3927, April.
- Celeux, Gilles & Govaert, Gerard, 1992. "A classification EM algorithm for clustering and two stochastic versions," Computational Statistics & Data Analysis, Elsevier, vol. 14(3), pages 315-332, October.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Triss Ashton & Nicholas Evangelopoulos & Victor Prybutok, 2014. "Extending monitoring methods to textual data: a research agenda," Quality & Quantity: International Journal of Methodology, Springer, vol. 48(4), pages 2277-2294, July.
- Bastian Schaefermeier & Gerd Stumme & Tom Hanika, 2021. "Topic space trajectories," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(7), pages 5759-5795, July.
- Yoshi Fujiwara & Rubaiyat Islam, 2021. "Bitcoin's Crypto Flow Network," Papers 2106.11446, arXiv.org, revised Jul 2021.
- Bouveyron, Charles & Brunet, Camille, 2012. "Theoretical and practical considerations on the convergence properties of the Fisher-EM algorithm," Journal of Multivariate Analysis, Elsevier, vol. 109(C), pages 29-41.
- van Loon, Austin, 2022. "Three Families of Automated Text Analysis," SocArXiv htnej, Center for Open Science.
- Paul Hofmarcher & Sourav Adhikari & Bettina Grun, 2022. "Gaining Insights on U.S. Senate Speeches Using a Time Varying Text Based Ideal Point Model," Papers 2206.10877, arXiv.org.
- Travis R Meyer & Daniel Balagué & Miguel Camacho-Collados & Hao Li & Katie Khuu & P Jeffrey Brantingham & Andrea L Bertozzi, 2019. "A year in Madrid as described through the analysis of geotagged Twitter data," Environment and Planning B, , vol. 46(9), pages 1724-1740, November.
- Andreas Falke & Harald Hruschka, 2022. "Analyzing browsing across websites by machine learning methods," Journal of Business Economics, Springer, vol. 92(5), pages 829-852, July.
- Zhang, Zhong-Yuan & Gai, Yujie & Wang, Yu-Fei & Cheng, Hui-Min & Liu, Xin, 2018. "On equivalence of likelihood maximization of stochastic block model and constrained nonnegative matrix factorization," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 503(C), pages 687-697.
- Anastasios Bellas & Charles Bouveyron & Marie Cottrell & Jérôme Lacaille, 2013. "Model-based clustering of high-dimensional data streams with online mixture of probabilistic PCA," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 7(3), pages 281-300, September.
- Zhang, Lingsong & Lu, Shu & Marron, J.S., 2015. "Nested nonnegative cone analysis," Computational Statistics & Data Analysis, Elsevier, vol. 88(C), pages 100-110.
- Zura Kakushadze & Willie Yu, 2020. "Machine Learning Treasury Yields," Bulletin of Applied Economics, Risk Market Journals, vol. 7(1), pages 1-65.
- Ning Zhong & David A. Schweidel, 2020. "Capturing Changes in Social Media Content: A Multiple Latent Changepoint Topic Model," Marketing Science, INFORMS, vol. 39(4), pages 827-846, July.
- Imran Ali & Devika Kannan, 2022. "Mapping research on healthcare operations and supply chain management: a topic modelling-based literature review," Annals of Operations Research, Springer, vol. 315(1), pages 29-55, August.
- Nikulin, Vladimir & Huang, Tian-Hsiang & Ng, Shu-Kay & Rathnayake, Suren I. & McLachlan, Geoffrey J., 2011. "A very fast algorithm for matrix factorization," Statistics & Probability Letters, Elsevier, vol. 81(7), pages 773-782, July.
- Sun, Lijun & Axhausen, Kay W., 2016. "Understanding urban mobility patterns with a probabilistic tensor factorization framework," Transportation Research Part B: Methodological, Elsevier, vol. 91(C), pages 511-524.
- Ma, Xiaoke & Wang, Bingbo & Yu, Liang, 2018. "Semi-supervised spectral algorithms for community detection in complex networks based on equivalence of clustering methods," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 490(C), pages 786-802.
- Mohammadamin Edrisi & Xiru Huang & Huw A. Ogilvie & Luay Nakhleh, 2023. "Accurate integration of single-cell DNA and RNA for analyzing intratumor heterogeneity using MaCroDNA," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
- Zura Kakushadze & Willie Yu, 2020. "Machine Learning Treasury Yields," Papers 2003.05095, arXiv.org.
- Alexandre L. M. Levada, 2021. "PCA-KL: a parametric dimensionality reduction approach for unsupervised metric learning," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 15(4), pages 829-868, December.
More about this item
Keywords
Clustering; Mixture models; Count data; Dimension reduction; Topic modeling; Variational inference;All these keywords.
Statistics
Access and download statisticsCorrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:36:y:2021:i:1:d:10.1007_s00180-020-01008-9. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.