Mixtures of Dirichlet-Multinomial distributions for supervised and unsupervised classification of short text data
Author
Abstract
Suggested Citation
DOI: 10.1007/s11634-020-00399-3
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
References listed on IDEAS
- Ian Holmes & Keith Harris & Christopher Quince, 2012. "Dirichlet Multinomial Mixtures: Generative Models for Microbial Metagenomics," PLOS ONE, Public Library of Science, vol. 7(2), pages 1-15, February.
- Feinerer, Ingo & Hornik, Kurt & Meyer, David, 2008. "Text Mining Infrastructure in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 25(i05).
Citations
Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
Cited by:
- Zhao, Xin & Zhang, Jingru & Lin, Wei, 2023. "Clustering multivariate count data via Dirichlet-multinomial network fusion," Computational Statistics & Data Analysis, Elsevier, vol. 179(C).
- Massimo Bilancia & Andrea Nigri & Samuele Magro, 2025. "Stochastic variational inference for clustering short text data with finite mixtures of Dirichlet-Multinomial distributions," Statistical Papers, Springer, vol. 66(4), pages 1-39, June.
- Massimo Bilancia & Michele Nanni & Fabio Manca & Gianvito Pio, 2023. "Variational Bayes estimation of hierarchical Dirichlet-multinomial mixtures for text clustering," Computational Statistics, Springer, vol. 38(4), pages 2015-2051, December.
- Angela Maria D’Uggento & Albino Biafora & Fabio Manca & Claudia Marin & Massimo Bilancia, 2023. "A text data mining approach to the study of emotions triggered by new advertising formats during the COVID-19 pandemic," Quality & Quantity: International Journal of Methodology, Springer, vol. 57(3), pages 2303-2325, June.
- Marzia Freo & Alessandra Luati, 2024. "Lasso-based variable selection methods in text regression: the case of short texts," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 108(1), pages 69-99, March.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Achal Dhariwal & Polona Rajar & Gabriela Salvadori & Heidi Aarø Åmdal & Dag Berild & Ola Didrik Saugstad & Drude Fugelseth & Gorm Greisen & Ulf Dahle & Kirsti Haaland & Fernanda Cristina Petersen, 2024. "Prolonged hospitalization signature and early antibiotic effects on the nasopharyngeal resistome in preterm infants," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
- Shuyue Huang & Lena Jingen Liang & Hwansuk Chris Choi, 2022. "How We Failed in Context: A Text-Mining Approach to Understanding Hotel Service Failures," Sustainability, MDPI, vol. 14(5), pages 1-18, February.
- Daoud, Adel & Kohl, Sebastian, 2016. "How much do sociologists write about economic topics? Using big data to test some conventional views in economic sociology, 1890 to 2014," MPIfG Discussion Paper 16/7, Max Planck Institute for the Study of Societies.
- David C Molik & DeAndre Tomlinson & Shane Davitt & Eric L Morgan & Matthew Sisk & Benjamin Roche & Natalie Meyers & Michael E Pfrender, 2021. "Combining natural language processing and metabarcoding to reveal pathogen-environment associations," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 15(4), pages 1-21, April.
- Hornik, Kurt & Grün, Bettina, 2014. "movMF: An R Package for Fitting Mixtures of von Mises-Fisher Distributions," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 58(i10).
- Croce, Annalisa & Toschi, Laura & Ughetto, Elisa & Zanni, Sara, 2024. "Cleantech and policy framework in Europe: A machine learning approach," Energy Policy, Elsevier, vol. 186(C).
- Holand, Øystein & Contiero, Barbara & Næss, Marius W. & Cozzi, Giulio, 2024. "“The Times They Are A-Changin' “ – research trends and perspectives of reindeer pastoralism – A review using text mining and topic modelling," Land Use Policy, Elsevier, vol. 136(C).
- B Ian Hutchins & Xin Yuan & James M Anderson & George M Santangelo, 2016. "Relative Citation Ratio (RCR): A New Metric That Uses Citation Rates to Measure Influence at the Article Level," PLOS Biology, Public Library of Science, vol. 14(9), pages 1-25, September.
- Motta Queiroz, Mariza & Roque, Carlos & Moura, Filipe & Marôco, João, 2024. "Understanding the expectations of parents regarding their children's school commuting by public transport using latent Dirichlet Allocation," Transportation Research Part A: Policy and Practice, Elsevier, vol. 181(C).
- repec:diw:diwwpp:dp1835 is not listed on IDEAS
- Sanjeena Subedi & Drew Neish & Stephen Bak & Zeny Feng, 2020. "Cluster analysis of microbiome data by using mixtures of Dirichlet–multinomial regression models," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 69(5), pages 1163-1187, November.
- KOCAK, Necmettin Alpay, 2021. "The Impacts Of Speeches On Nowcasting Gdp: A Case Study On Euro Area Markets," Studii Financiare (Financial Studies), Centre of Financial and Monetary Research "Victor Slavescu", vol. 25(1), pages 6-29, March.
- Lisa Bruttel & Maximilian Andres, 2024. "Communicating Cartel Intentions," CEPA Discussion Papers 77, Center for Economic Policy Analysis.
- Zhao, Xin & Zhang, Jingru & Lin, Wei, 2023. "Clustering multivariate count data via Dirichlet-multinomial network fusion," Computational Statistics & Data Analysis, Elsevier, vol. 179(C).
- Olgun Aydin & Cansu Altunbas & Elvan Hayat, 2021. "Using Text Mining Techniques to Understand the Economic Effects of COVID-19 Pandemic," European Research Studies Journal, European Research Studies Journal, vol. 0(Special 4), pages 760-774.
- Abhinav Khare & Qing He & Rajan Batta, 2020. "Predicting gasoline shortage during disasters using social media," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 42(3), pages 693-726, September.
- Lovrić, Marko & Lovrić, Nataša & Mavsar, Robert, 2020. "Mapping forest-based bioeconomy research in Europe," Forest Policy and Economics, Elsevier, vol. 110(C).
- Yaru Song & Hongyu Zhao & Tao Wang, 2020. "An adaptive independence test for microbiome community data," Biometrics, The International Biometric Society, vol. 76(2), pages 414-426, June.
- Cristian Mejia & Yuya Kajikawa, 2021. "The Academic Landscapes of Manufacturing Enterprise Performance and Environmental Sustainability: A Study of Commonalities and Differences," IJERPH, MDPI, vol. 18(7), pages 1-16, March.
- Lehotský, Lukáš & Černoch, Filip & Osička, Jan & Ocelík, Petr, 2019. "When climate change is missing: Media discourse on coal mining in the Czech Republic," Energy Policy, Elsevier, vol. 129(C), pages 774-786.
- Doblinger, Claudia & Surana, Kavita & Li, Deyu & Hultman, Nathan & Anadón, Laura Díaz, 2022. "How do global manufacturing shifts affect long-term clean energy innovation? A study of wind energy suppliers," Research Policy, Elsevier, vol. 51(7).
More about this item
Keywords
Clustering; Gradient descent algorithm; Mixture models; Text data analysis;All these keywords.
Statistics
Access and download statisticsCorrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:advdac:v:14:y:2020:i:4:d:10.1007_s11634-020-00399-3. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.