IDEAS home Printed from https://ideas.repec.org/p/cam/camdae/2609.html

Semantic Similarity Measures in Newspaper Text for Detecting and Predicting Disruptive Institutional Events

Author

Listed:
  • Mayoral, L.
  • Mueller, H.
  • Philipp, M.
  • Rauh, C.
  • Vassallo, R.

Abstract

This article proposes a semantic-similarity approach to detecting and predicting rare events in newspaper text and applies it to institutional disruptions. Using a global news corpus covering more than 170 countries, we measure the similarity of headlines to event-specific prototypes in embedding space and aggregate these signals to identify disruptions to political institutions. We combine these text-based measures with supervised nowcasting and targeted human verification to expand existing datasets on military coups, irregular term-limit extensions, and weakening of the judiciary. The resulting event data are then used to forecast the likelihood of disruptions up to 12 months ahead, providing a high-frequency and scalable tool for monitoring institutional risk. As an illustration of its empirical value, we document that coups are followed by large and persistent declines in economic growth. More broadly, the framework can be adapted to detect and track a wide range of economic and political events and policy actions from news text in real time and in historical archives.

Suggested Citation

  • Mayoral, L. & Mueller, H. & Philipp, M. & Rauh, C. & Vassallo, R., 2026. "Semantic Similarity Measures in Newspaper Text for Detecting and Predicting Disruptive Institutional Events," Cambridge Working Papers in Economics 2609, Faculty of Economics, University of Cambridge.
  • Handle: RePEc:cam:camdae:2609
    as

    Download full text from publisher

    File URL: https://www.econ.cam.ac.uk/sites/default/files/publication-cwpe-pdfs/cwpe2609.pdf
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;
    ;
    ;
    ;

    JEL classification:

    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis
    • D72 - Microeconomics - - Analysis of Collective Decision-Making - - - Political Processes: Rent-seeking, Lobbying, Elections, Legislatures, and Voting Behavior
    • P16 - Political Economy and Comparative Economic Systems - - Capitalist Economies - - - Capitalist Institutions; Welfare State

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cam:camdae:2609. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Jake Dyer (email available below). General contact details of provider: https://www.econ.cam.ac.uk/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.