Author
Listed:
- Dengxi Hong
(College of Systems Engineering, National University of Defense Technology, Changsha 410073, China)
- Zexin Fu
(College of Systems Engineering, National University of Defense Technology, Changsha 410073, China)
- Xin Zhang
(College of Systems Engineering, National University of Defense Technology, Changsha 410073, China)
- Yan Pan
(College of Systems Engineering, National University of Defense Technology, Changsha 410073, China)
Abstract
This study investigates the development and application of the GDELT (Global Database of Events, Language, and Tone) news database. Through experiments, we conducted a quantitative statistical analysis of the GDELT event database to evaluate its practical characteristics. The results indicate that although the database achieves comprehensive coverage across all countries and regions and includes most major global media outlets, the accuracy rate of its key fields is only approximately 55%, with a data redundancy as high as 20%. Based on these findings, while the GDELT data demonstrates good coverage and data integrity, data correction and deduplication are recommended before its use in research contexts and industrial applications. Subsequently, a survey of the existing literature reveals that current studies using GDELT primarily focused on event-related metrics, such as event quantity, tone, and GoldsteinScale, for application in international relations analysis, crisis event prediction, policy effectiveness testing, and public opinion impact analysis. Nevertheless, news constitutes a fundamental channel of information dissemination in media networks, and the propagation of news events through these networks represents a critical area of study for information recommendation, public opinion guidance, and crisis intervention. Existing research has employed the Event, GKG, and Mentions tables to construct cross-national news flow network models. However, the informational correlations across different data table fields have not been fully leveraged in preliminary data selection, leading to substantial computational overhead. To advance research in this field, this study employs chained list queries on the Event and Mentions tables within GDELT. Using social network analysis, we constructed a media co-occurrence network of event reports, through which core hubs and associative relationships within the event dissemination network are identified.
Suggested Citation
Dengxi Hong & Zexin Fu & Xin Zhang & Yan Pan, 2025.
"Research on the Development and Application of the GDELT Event Database,"
Data, MDPI, vol. 10(10), pages 1-13, October.
Handle:
RePEc:gam:jdataj:v:10:y:2025:i:10:p:158-:d:1762908
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jdataj:v:10:y:2025:i:10:p:158-:d:1762908. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.