IDEAS home Printed from https://ideas.repec.org/a/tsj/stataj/v18y2018i1p101-117.html
   My bibliography  Save this article

ldagibbs: A command for topic modeling in Stata using latent Dirichlet allocation

Author

Listed:
  • Carlo Schwarz

    (University of Warwick)

Abstract

In this article, I introduce the ldagibbs command, which implements latent Dirichlet allocation in Stata. Latent Dirichlet allocation is the most popular machine-learning topic model. Topic models automatically cluster text documents into a user-chosen number of topics. Latent Dirichlet allocation represents each document as a probability distribution over topics and represents each topic as a probability distribution over words. Therefore, latent Dirichlet allocation provides a way to analyze the content of large unclassified text data and an alternative to predefined document classifications.

Suggested Citation

  • Carlo Schwarz, 2018. "ldagibbs: A command for topic modeling in Stata using latent Dirichlet allocation," Stata Journal, StataCorp LP, vol. 18(1), pages 101-117, March.
  • Handle: RePEc:tsj:stataj:v:18:y:2018:i:1:p:101-117
    Note: to access software from within Stata, net describe http://www.stata-journal.com/software/sj18-1/st0515/
    as

    Download full text from publisher

    File URL: http://www.stata-journal.com/article.html?article=st0515
    File Function: link to article purchase
    Download Restriction: no
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. McCannon, Bryan & Zhou, Yang & Hall, Joshua, 2021. "Measuring a Contract’s Breadth: A Text Analysis," Working Papers 11013, George Mason University, Mercatus Center.
    2. Julio Elias & Nicola Lacetera & Mario Macis, 2022. "Is the Price Right? The Role of Morals, Ideology, and Tradeoff Thinking in Explaining Reactions to Price Surges," CESifo Working Paper Series 9712, CESifo.
    3. Leonardo Bursztyn & Aakaash Rao & Christopher Roth & David Yanagizawa-Drott, 2023. "Opinions as Facts," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 90(4), pages 1832-1864.
    4. Hali Edison & Hector Carcel, 2021. "Text data analysis using Latent Dirichlet Allocation: an application to FOMC transcripts," Applied Economics Letters, Taylor & Francis Journals, vol. 28(1), pages 38-42, January.
    5. Elisa Facchetti & Lorenzo Neri & Marco Ovidi, 2021. "Should you Meet The Parents? The impact of information on non-test score attributes on school choice," DISCE - Working Papers del Dipartimento di Economia e Finanza def113, Università Cattolica del Sacro Cuore, Dipartimenti e Istituti di Scienze Economiche (DISCE).
    6. Bryan McCannon & Joshua Hall & Yang Zhou, 2023. "Measuring a contract's breadth: A text analysis," American Journal of Economics and Sociology, Wiley Blackwell, vol. 82(1), pages 5-14, January.
    7. Christopher Rauh & Laëtitia Renée, 2023. "How to measure parenting styles?," Review of Economics of the Household, Springer, vol. 21(3), pages 1063-1081, September.
    8. Yuexia Han & Zhuang Chen & Yuxin Hu & Liyan Zhang & Huishan Fu & Renyong Zhang & Wei Zhang, 2023. "A PIE analysis of China’s commercial space development," Palgrave Communications, Palgrave Macmillan, vol. 10(1), pages 1-11, December.
    9. Damane Moeti, 2022. "Topic Classification of Central Bank Monetary Policy Statements: Evidence from Latent Dirichlet Allocation in Lesotho," Acta Universitatis Sapientiae, Economics and Business, Sciendo, vol. 10(1), pages 199-227, September.
    10. Ali Sina Önder & Sergey V. Popov & Sascha Schweitzer, 2021. "Leadership in Scholarship: Editors’ Appointments and the Profession’s Narrative," Working Papers in Economics & Finance 2021-05, University of Portsmouth, Portsmouth Business School, Economics and Finance Subject Group.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:tsj:stataj:v:18:y:2018:i:1:p:101-117. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum or Lisa Gilmore (email available below). General contact details of provider: http://www.stata-journal.com/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.