IDEAS home Printed from https://ideas.repec.org/a/bla/jinfst/v72y2021i4p478-492.html
   My bibliography  Save this article

Follow the leader: Documents on the leading edge of semantic change get more citations

Author

Listed:
  • Sandeep Soni
  • Kristina Lerman
  • Jacob Eisenstein

Abstract

Diachronic word embeddings—vector representations of words over time—offer remarkable insights into the evolution of language and provide a tool for quantifying sociocultural change from text documents. Prior work has used such embeddings to identify shifts in the meaning of individual words. However, simply knowing that a word has changed in meaning is insufficient to identify the instances of word usage that convey the historical meaning or the newer meaning. In this study, we link diachronic word embeddings to documents, by situating those documents as leaders or laggards with respect to ongoing semantic changes. Specifically, we propose a novel method to quantify the degree of semantic progressiveness in each word usage, and then show how these usages can be aggregated to obtain scores for each document. We analyze two large collections of documents, representing legal opinions and scientific articles. Documents that are scored as semantically progressive receive a larger number of citations, indicating that they are especially influential. Our work thus provides a new technique for identifying lexical semantic leaders and demonstrates a new link between progressive use of language and influence in a citation network.

Suggested Citation

  • Sandeep Soni & Kristina Lerman & Jacob Eisenstein, 2021. "Follow the leader: Documents on the leading edge of semantic change get more citations," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 72(4), pages 478-492, April.
  • Handle: RePEc:bla:jinfst:v:72:y:2021:i:4:p:478-492
    DOI: 10.1002/asi.24421
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/asi.24421
    Download Restriction: no

    File URL: https://libkey.io/10.1002/asi.24421?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Bryan Kelly & Dimitris Papanikolaou & Amit Seru & Matt Taddy, 2021. "Measuring Technological Innovation over the Long Run," American Economic Review: Insights, American Economic Association, vol. 3(3), pages 303-320, September.
    2. Nikhil Garg & Londa Schiebinger & Dan Jurafsky & James Zou, 2018. "Word embeddings quantify 100 years of gender and ethnic stereotypes," Proceedings of the National Academy of Sciences, Proceedings of the National Academy of Sciences, vol. 115(16), pages 3635-3644, April.
    3. Cameron, A. Colin & Trivedi, Pravin K., 1990. "Regression-based tests for overdispersion in the Poisson model," Journal of Econometrics, Elsevier, vol. 46(3), pages 347-364, December.
    4. Kevin W. Boyack & Richard Klavans, 2014. "Creation of a highly detailed, dynamic, global model and map of science," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 65(4), pages 670-685, April.
    5. Jian Xu & Yi Bu & Ying Ding & Sinan Yang & Hongli Zhang & Chen Yu & Lin Sun, 2018. "Understanding the formation of interdisciplinary research from the perspective of keyword evolution: a case study on joint attention," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(2), pages 973-995, November.
    6. Fowler, James H. & Johnson, Timothy R. & Spriggs, James F. & Jeon, Sangick & Wahlbeck, Paul J., 2007. "Network Analysis and the Law: Measuring the Legal Importance of Precedents at the U.S. Supreme Court," Political Analysis, Cambridge University Press, vol. 15(3), pages 324-346, July.
    7. Aaron Gerow & Yuening Hu & Jordan Boyd-Graber & David M. Blei & James A. Evans, 2018. "Measuring discursive influence across scholarship," Proceedings of the National Academy of Sciences, Proceedings of the National Academy of Sciences, vol. 115(13), pages 3308-3313, March.
    8. Ding, Ying, 2011. "Scientific collaboration and endorsement: Network analysis of coauthorship and citation networks," Journal of Informetrics, Elsevier, vol. 5(1), pages 187-203.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jay Bhattacharya & Mikko Packalen, 2020. "Stagnation and Scientific Incentives," NBER Working Papers 26752, National Bureau of Economic Research, Inc.
    2. Antonio De Nicola & Gregorio D’Agostino, 2021. "Assessment of gender divide in scientific communities," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(5), pages 3807-3840, May.
    3. Yanto Chandra, 2018. "Mapping the evolution of entrepreneurship as a field of research (1990–2013): A scientometric analysis," PLOS ONE, Public Library of Science, vol. 13(1), pages 1-24, January.
    4. Jeong, Yujin & Park, Inchae & Yoon, Byungun, 2019. "Identifying emerging Research and Business Development (R&BD) areas based on topic modeling and visualization with intellectual property right data," Technological Forecasting and Social Change, Elsevier, vol. 146(C), pages 655-672.
    5. Bettina Becker & Martin Theuringer, 2000. "Macroeconomic Determinants of Contingent Protection: The Case of the European Union," IWP Discussion Paper Series 02/2000, Institute for Economic Policy, Cologne, Germany.
    6. Luiz Paulo Fávero & Joseph F. Hair & Rafael de Freitas Souza & Matheus Albergaria & Talles V. Brugni, 2021. "Zero-Inflated Generalized Linear Mixed Models: A Better Way to Understand Data Relationships," Mathematics, MDPI, vol. 9(10), pages 1-28, May.
    7. Kun Sun & Rong Wang, 2022. "The Evolutionary Pattern of Language in English Fiction Over the Last Two Centuries: Insights From Linguistic Concreteness and Imageability," SAGE Open, , vol. 12(1), pages 21582440211, January.
    8. Rui Baptista & Joana Mendonça, 2010. "Proximity to knowledge sources and the location of knowledge-based start-ups," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 45(1), pages 5-29, August.
    9. Boncinelli, Fabio & Bartolini, Fabio & Casini, Leonardo, 2018. "Structural factors of labour allocation for farm diversification activities," Land Use Policy, Elsevier, vol. 71(C), pages 204-212.
    10. Steven F. Kreft & Nancy M. Epling, 2007. "Do border crossings contribute to underage motor‐vehicle fatalities? An analysis of Michigan border crossings," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 40(3), pages 765-781, August.
    11. Marian-Gabriel Hâncean & Matjaž Perc & Jürgen Lerner, 2021. "The coauthorship networks of the most productive European researchers," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(1), pages 201-224, January.
    12. Stefan Feuerriegel & Mateusz Dolata & Gerhard Schwabe, 2020. "Fair AI," Business & Information Systems Engineering: The International Journal of WIRTSCHAFTSINFORMATIK, Springer;Gesellschaft für Informatik e.V. (GI), vol. 62(4), pages 379-384, August.
    13. Monika Stachowiak-Kudła & Janusz Kudła, 2023. "Measuring the prestige of administrative courts," Quality & Quantity: International Journal of Methodology, Springer, vol. 57(4), pages 3637-3662, August.
    14. Minchul Lee & Min Song, 2020. "Incorporating citation impact into analysis of research trends," Scientometrics, Springer;Akadémiai Kiadó, vol. 124(2), pages 1191-1224, August.
    15. Greene, William, 2007. "Functional Form and Heterogeneity in Models for Count Data," Foundations and Trends(R) in Econometrics, now publishers, vol. 1(2), pages 113-218, August.
    16. Christopher J. W. Zorn, 1998. "An Analytic and Empirical Examination of Zero-Inflated and Hurdle Poisson Specifications," Sociological Methods & Research, , vol. 26(3), pages 368-400, February.
    17. Jim Millington, 2000. "Migration and Age: The Effect of Age on Sensitivity to Migration Stimuli," Regional Studies, Taylor & Francis Journals, vol. 34(6), pages 521-533.
    18. Christian Kleiber & Achim Zeileis, 2016. "Visualizing Count Data Regressions Using Rootograms," The American Statistician, Taylor & Francis Journals, vol. 70(3), pages 296-303, July.
    19. Ash, Elliott & Durante, Ruben & Grebenshchikova, Mariia & Schwarz, Carlo, 2022. "Visual Representation and Stereotypes in News Media," CEPR Discussion Papers 16624, C.E.P.R. Discussion Papers.
    20. Boubaker, Sabri & Labégorre, Florence, 2008. "Ownership structure, corporate governance and analyst following: A study of French listed firms," Journal of Banking & Finance, Elsevier, vol. 32(6), pages 961-976, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jinfst:v:72:y:2021:i:4:p:478-492. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.asis.org .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.