IDEAS home Printed from https://ideas.repec.org/a/cup/polals/v28y2020i1p87-111_5.html
   My bibliography  Save this article

A Timely Intervention: Tracking the Changing Meanings of Political Concepts with Word Vectors

Author

Listed:
  • Rodman, Emma

Abstract

Word vectorization is an emerging text-as-data method that shows great promise for automating the analysis of semantics—here, the cultural meanings of words—in large volumes of text. Yet successes with this method have largely been confined to massive corpora where the meanings of words are presumed to be fixed. In political science applications, however, many corpora are comparatively small and many interesting questions hinge on the recognition that meaning changes over time. Together, these two facts raise vexing methodological challenges. Can word vectors trace the changing cultural meanings of words in typical small corpora use cases? I test four time-sensitive implementations of word vectors (word2vec) against a gold standard developed from a modest data set of 161 years of newspaper coverage. I find that one implementation method clearly outperforms the others in matching human assessments of how public dialogues around equality in America have changed over time. In addition, I suggest best practices for using word2vec to study small corpora for time series questions, including bootstrap resampling of documents and pretraining of vectors. I close by showing that word2vec allows granular analysis of the changing meaning of words, an advance over other common text-as-data methods for semantic research questions.

Suggested Citation

  • Rodman, Emma, 2020. "A Timely Intervention: Tracking the Changing Meanings of Political Concepts with Word Vectors," Political Analysis, Cambridge University Press, vol. 28(1), pages 87-111, January.
  • Handle: RePEc:cup:polals:v:28:y:2020:i:1:p:87-111_5
    as

    Download full text from publisher

    File URL: https://www.cambridge.org/core/product/identifier/S1047198719000238/type/journal_article
    File Function: link to article abstract page
    Download Restriction: no
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Stijn Daenekindt & Julian Schaap, 2022. "Using word embedding models to capture changing media discourses: a study on the role of legitimacy, gender and genre in 24,000 music reviews, 1999–2021," Journal of Computational Social Science, Springer, vol. 5(2), pages 1615-1636, November.
    2. Miguel Won & Jorge M. Fernandes, 2022. "Analyzing Twitter networks using graph embeddings: an application to the British case," Journal of Computational Social Science, Springer, vol. 5(1), pages 253-263, May.
    3. Gloria Gennaro & Elliott Ash, 2022. "Emotion and Reason in Political Language," The Economic Journal, Royal Economic Society, vol. 132(643), pages 1037-1059.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cup:polals:v:28:y:2020:i:1:p:87-111_5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Kirk Stebbing (email available below). General contact details of provider: https://www.cambridge.org/pan .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.