IDEAS home Printed from https://ideas.repec.org/a/spr/jcsosc/v6y2023i2d10.1007_s42001-023-00215-w.html
   My bibliography  Save this article

Between news and history: identifying networked topics of collective attention on Wikipedia

Author

Listed:
  • Patrick Gildersleve

    (London School of Economics and Political Science)

  • Renaud Lambiotte

    (University of Oxford)

  • Taha Yasseri

    (University College Dublin)

Abstract

The digital information landscape has introduced a new dimension to understanding how we collectively react to new information and preserve it at the societal level. This, together with the emergence of platforms such as Wikipedia, has challenged traditional views on the relationship between current events and historical accounts of events, with an ever-shrinking divide between “news” and “history”. Wikipedia’s place as the Internet’s primary reference work thus poses the question of how it represents both traditional encyclopaedic knowledge and evolving important news stories. In other words, how is information on and attention towards current events integrated into the existing topical structures of Wikipedia? To address this, we develop a temporal community detection approach towards topic detection that takes into account both short term dynamics of attention as well as long term article network structures. We apply this method to a dataset of one year of current events on Wikipedia to identify clusters of Wikipedia articles related to news events, distinct from those that would be found solely from page view time series correlations or static network structure. We are able to resolve the topics that more strongly reflect unfolding current events vs more established knowledge by the relative importance of collective attention dynamics vs link structures. We also offer important developments by identifying and describing the emergent topics on Wikipedia. This work provides a means of distinguishing how these information and attention clusters are related to Wikipedia’s twin faces of encyclopaedic knowledge and current events—crucial to understanding the production and consumption of knowledge in the digital age.

Suggested Citation

  • Patrick Gildersleve & Renaud Lambiotte & Taha Yasseri, 2023. "Between news and history: identifying networked topics of collective attention on Wikipedia," Journal of Computational Social Science, Springer, vol. 6(2), pages 845-875, October.
  • Handle: RePEc:spr:jcsosc:v:6:y:2023:i:2:d:10.1007_s42001-023-00215-w
    DOI: 10.1007/s42001-023-00215-w
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s42001-023-00215-w
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s42001-023-00215-w?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Ewa S. Callahan & Susan C. Herring, 2011. "Cultural bias in Wikipedia content on famous persons," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 62(10), pages 1899-1915, October.
    2. Giovanni Luca Ciampaglia & Prashant Shiralkar & Luis M Rocha & Johan Bollen & Filippo Menczer & Alessandro Flammini, 2015. "Computational Fact Checking from Knowledge Networks," PLOS ONE, Public Library of Science, vol. 10(6), pages 1-13, June.
    3. Mark Graham & Bernie Hogan & Ralph K. Straumann & Ahmed Medhat, 2014. "Uneven Geographies of User-Generated Information: Patterns of Increasing Informational Poverty," Annals of the American Association of Geographers, Taylor & Francis Journals, vol. 104(4), pages 746-764, July.
    4. Michael Scharkow, 2013. "Thematic content analysis using supervised machine learning: An empirical evaluation using German online news," Quality & Quantity: International Journal of Methodology, Springer, vol. 47(2), pages 761-773, February.
    5. Márton Mestyán & Taha Yasseri & János Kertész, 2013. "Early Prediction of Movie Box Office Success Based on Wikipedia Activity Big Data," PLOS ONE, Public Library of Science, vol. 8(8), pages 1-8, August.
    6. Grimmer, Justin & Stewart, Brandon M., 2013. "Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts," Political Analysis, Cambridge University Press, vol. 21(3), pages 267-297, July.
    7. Kai Zhu & Dylan Walker & Lev Muchnik, 2020. "Content Growth and Attention Contagion in Information Networks: Addressing Information Poverty on Wikipedia," Information Systems Research, INFORMS, vol. 31(2), pages 491-509, June.
    8. Cristian Candia & C. Jara-Figueroa & Carlos Rodriguez-Sickert & Albert-László Barabási & César A. Hidalgo, 2019. "The universal decay of collective memory and attention," Nature Human Behaviour, Nature, vol. 3(1), pages 82-91, January.
    9. Ewa S. Callahan & Susan C. Herring, 2011. "Cultural bias in Wikipedia content on famous persons," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 62(10), pages 1899-1915, October.
    10. Don Fallis, 2008. "Toward an epistemology of Wikipedia," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 59(10), pages 1662-1674, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nicolas Jullien, 2012. "What We Know About Wikipedia: A Review of the Literature Analyzing the Project(s)," Post-Print hal-00857208, HAL.
    2. Jaehun Joo & Ismatilla Normatov, 2013. "Determinants of collective intelligence quality: comparison between Wiki and Q&A services in English and Korean users," Service Business, Springer;Pan-Pacific Business Association, vol. 7(4), pages 687-711, December.
    3. Xiang Zheng & Jiajing Chen & Erjia Yan & Chaoqun Ni, 2023. "Gender and country biases in Wikipedia citations to scholarly publications," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 74(2), pages 219-233, February.
    4. Nicole Schwitter, 2023. "Bridging the offline and online: 20 years of offline meeting data of the German-language Wikipedia," Journal of Computational Social Science, Springer, vol. 6(2), pages 1103-1124, October.
    5. Eyal Eckhaus & Zachary Sheaffer, 2018. "Managerial hubris detection: the case of Enron," Risk Management, Palgrave Macmillan, vol. 20(4), pages 304-325, November.
    6. José Gustavo Góngora-Goloubintseff, 2020. "The Falklands/Malvinas war taken to the Wikipedia realm: a multimodal discourse analysis of cross-lingual violations of the Neutral Point of View," Palgrave Communications, Palgrave Macmillan, vol. 6(1), pages 1-9, December.
    7. Anna Kerkhof & Johannes Münster, 2021. "Detecting coverage bias in user-generated content," ECONtribute Discussion Papers Series 057, University of Bonn and University of Cologne, Germany.
    8. Yu Lim Lee & Minji Jung & Robert Jeyakumar Nathan & Jae-Eun Chung, 2020. "Cross-National Study on the Perception of the Korean Wave and Cultural Hybridity in Indonesia and Malaysia Using Discourse on Social Media," Sustainability, MDPI, vol. 12(15), pages 1-33, July.
    9. Kevin Crowston & Nicolas Jullien & Felipe Ortega, 2013. "Is Wikipedia Inefficient? Modelling Effort and Participation in Wikipedia," Post-Print hal-00947731, HAL.
    10. Anna Kerkhof & Johannes Münster, 2021. "Detecting Coverage Bias in User-Generated Content," CESifo Working Paper Series 8844, CESifo.
    11. Anton Oleinik, 2024. "A Bayesian index of association: comparison with other measures and performance," Quality & Quantity: International Journal of Methodology, Springer, vol. 58(1), pages 277-305, February.
    12. Dwaipayan Roy & Sumit Bhatia & Prateek Jain, 2022. "Information asymmetry in Wikipedia across different languages: A statistical analysis," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 73(3), pages 347-361, March.
    13. Bernhardt, Lea & Dewenter, Ralf & Thomas, Tobias, 2023. "Measuring partisan media bias in US newscasts from 2001 to 2012," European Journal of Political Economy, Elsevier, vol. 78(C).
    14. Rauh, Christian, 2015. "Communicating supranational governance? The salience of EU affairs in the German Bundestag, 1991–2013," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 16(1), pages 116-138.
    15. Julia Seiermann, 2018. "Only Words? How Power in Trade Agreement Texts Affects International Trade Flows," UNCTAD Blue Series Papers 80, United Nations Conference on Trade and Development.
    16. Roger D. Magarey & Christina M. Trexler, 2020. "Information: a missing component in understanding and mitigating social epidemics," Palgrave Communications, Palgrave Macmillan, vol. 7(1), pages 1-11, December.
    17. Arthur Dyevre & Nicolas Lampach, 2021. "Issue attention on international courts: Evidence from the European Court of Justice," The Review of International Organizations, Springer, vol. 16(4), pages 793-815, October.
    18. Dewenter, Ralf & Dulleck, Uwe & Thomas, Tobias, 2018. "The political coverage index and its application to government capture," Research Papers 6, EcoAustria – Institute for Economic Research.
    19. Pastwa, Anna M. & Shrestha, Prabal & Thewissen, James & Torsin, Wouter, 2021. "Unpacking the black box of ICO white papers: a topic modeling approach," LIDAM Discussion Papers LFIN 2021018, Université catholique de Louvain, Louvain Finance (LFIN).
    20. Maksym Polyakov & Morteza Chalak & Md. Sayed Iftekhar & Ram Pandit & Sorada Tapsuwan & Fan Zhang & Chunbo Ma, 2018. "Authorship, Collaboration, Topics, and Research Gaps in Environmental and Resource Economics 1991–2015," Environmental & Resource Economics, Springer;European Association of Environmental and Resource Economists, vol. 71(1), pages 217-239, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:jcsosc:v:6:y:2023:i:2:d:10.1007_s42001-023-00215-w. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.