IDEAS home Printed from https://ideas.repec.org/a/eee/infome/v10y2016i4p1037-1051.html
   My bibliography  Save this article

Clustering citation histories in the Physical Review

Author

Listed:
  • Colavizza, Giovanni
  • Franceschet, Massimo

Abstract

We investigate publications through their citation histories – the history events are the citations given to the article by younger publications and the time of the event is the date of publication of the citing article. We propose a methodology, based on spectral clustering, to group citation histories, and the corresponding publications, into communities and apply multinomial logistic regression to provide the revealed communities with semantics in terms of publication features. We study the case of publications from the full Physical Review archive, covering 120 years of physics in all its domains. We discover two clear archetypes of publications – marathoners and sprinters – that deviate from the average middle-of-the-roads behaviour, and discuss some publication features, like age of references and type of publication, that are correlated with the membership of a publication into a certain community.

Suggested Citation

  • Colavizza, Giovanni & Franceschet, Massimo, 2016. "Clustering citation histories in the Physical Review," Journal of Informetrics, Elsevier, vol. 10(4), pages 1037-1051.
  • Handle: RePEc:eee:infome:v:10:y:2016:i:4:p:1037-1051
    DOI: 10.1016/j.joi.2016.07.009
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1751157716301560
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.joi.2016.07.009?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Rodrigo Costas & Thed N. van Leeuwen & Anthony F.J. van Raan, 2010. "Is scientific literature subject to a ‘Sell-By-Date’? A general methodology to analyze the ‘durability’ of scientific documents," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 61(2), pages 329-339, February.
    2. S. Redner, 1998. "How popular is your paper? An empirical study of the citation distribution," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 4(2), pages 131-134, July.
    3. Hendrik P. van Dalen & K?ne Henkens, 2005. "Signals in science - On the importance of signaling in gaining attention in science," Scientometrics, Springer;Akadémiai Kiadó, vol. 64(2), pages 209-233, August.
    4. Zhenquan Lin & Shanci Hou & Jinshan Wu, 2016. "The correlation between editorial delay and the ratio of highly cited papers in Nature, Science and Physical Review Letters," Scientometrics, Springer;Akadémiai Kiadó, vol. 107(3), pages 1457-1464, June.
    5. Jianjun Sun & Chao Min & Jiang Li, 2016. "A vector for measuring obsolescence of scientific articles," Scientometrics, Springer;Akadémiai Kiadó, vol. 107(2), pages 745-757, May.
    6. Anthony F. J. van Raan, 2004. "Sleeping Beauties in science," Scientometrics, Springer;Akadémiai Kiadó, vol. 59(3), pages 467-472, March.
    7. Jonathan M. Levitt & Mike Thelwall, 2009. "The most highly cited Library and Information Science articles: Interdisciplinarity, first authors and citation patterns," Scientometrics, Springer;Akadémiai Kiadó, vol. 78(1), pages 45-67, January.
    8. Parolo, Pietro Della Briotta & Pan, Raj Kumar & Ghosh, Rumi & Huberman, Bernardo A. & Kaski, Kimmo & Fortunato, Santo, 2015. "Attention decay in science," Journal of Informetrics, Elsevier, vol. 9(4), pages 734-745.
    9. Susanne E. Baumgartner & Loet Leydesdorff, 2014. "Group-based trajectory modeling (GBTM) of citations in scholarly literature: Dynamic qualities of “transient” and “sticky knowledge claims”," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 65(4), pages 797-811, April.
    10. Dag W Aksnes, 2003. "Characteristics of highly cited papers," Research Evaluation, Oxford University Press, vol. 12(3), pages 159-170, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhang, Yi & Lu, Jie & Liu, Feng & Liu, Qian & Porter, Alan & Chen, Hongshu & Zhang, Guangquan, 2018. "Does deep learning help topic extraction? A kernel k-means clustering method with word embedding," Journal of Informetrics, Elsevier, vol. 12(4), pages 1099-1117.
    2. Liang, Guoqiang & Hou, Haiyan & Ding, Ying & Hu, Zhigang, 2020. "Knowledge recency to the birth of Nobel Prize-winning articles: Gender, career stage, and country," Journal of Informetrics, Elsevier, vol. 14(3).
    3. Zhang, Ruizhi & Wang, Jian & Mei, Yajun, 2017. "Search for evergreens in science: A functional data analysis," Journal of Informetrics, Elsevier, vol. 11(3), pages 629-644.
    4. Zhichao Fang & Rodrigo Costas, 2020. "Studying the accumulation velocity of altmetric data tracked by Altmetric.com," Scientometrics, Springer;Akadémiai Kiadó, vol. 123(2), pages 1077-1101, May.
    5. Jianhua Hou & Xiucai Yang, 2019. "Patent sleeping beauties: evolutionary trajectories and identification methods," Scientometrics, Springer;Akadémiai Kiadó, vol. 120(1), pages 187-215, July.
    6. Mayra Z Rodriguez & Cesar H Comin & Dalcimar Casanova & Odemir M Bruno & Diego R Amancio & Luciano da F Costa & Francisco A Rodrigues, 2019. "Clustering algorithms: A comparative approach," PLOS ONE, Public Library of Science, vol. 14(1), pages 1-34, January.
    7. Lutz Bornmann & Adam Y. Ye & Fred Y. Ye, 2017. "Sequence analysis of annually normalized citation counts: an empirical analysis based on the characteristic scores and scales (CSS) method," Scientometrics, Springer;Akadémiai Kiadó, vol. 113(3), pages 1665-1680, December.
    8. Katchanov, Yurij L. & Markova, Yulia V., 2022. "Dynamics of senses of new physics discourse: Co-keywords analysis," Journal of Informetrics, Elsevier, vol. 16(1).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Onodera, Natsuo, 2016. "Properties of an index of citation durability of an article," Journal of Informetrics, Elsevier, vol. 10(4), pages 981-1004.
    2. Jianhua Hou & Xiucai Yang, 2019. "Patent sleeping beauties: evolutionary trajectories and identification methods," Scientometrics, Springer;Akadémiai Kiadó, vol. 120(1), pages 187-215, July.
    3. Li, Jiang & Shi, Dongbo & Zhao, Star X. & Ye, Fred Y., 2014. "A study of the “heartbeat spectra” for “sleeping beauties”," Journal of Informetrics, Elsevier, vol. 8(3), pages 493-502.
    4. Hou, Jianhua & Yang, Xiucai, 2020. "Social media-based sleeping beauties: Defining, identifying and features," Journal of Informetrics, Elsevier, vol. 14(2).
    5. You Song & Fangling Situ & Hongjun Zhu & Jinzhi Lei, 2018. "To be the Prince to wake up Sleeping Beauty: the rediscovery of the delayed recognition studies," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(1), pages 9-24, October.
    6. Cristina López-Duarte & Marta M. Vidal-Suárez & Belén González-Díaz, 2019. "Cross-national distance and international business: an analysis of the most influential recent models," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(1), pages 173-208, October.
    7. Rodrigo Costas & Thed N. Leeuwen & Anthony F. J. Raan, 2011. "The “Mendel syndrome” in science: durability of scientific literature and its effects on bibliometric analysis of individual scientists," Scientometrics, Springer;Akadémiai Kiadó, vol. 89(1), pages 177-205, October.
    8. Jiang Li, 2014. "Citation curves of “all-elements-sleeping-beauties”: “flash in the pan” first and then “delayed recognition”," Scientometrics, Springer;Akadémiai Kiadó, vol. 100(2), pages 595-601, August.
    9. Hui Fang, 2019. "A transition stage co-citation criterion for identifying the awakeners of sleeping beauty publications," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(1), pages 307-322, October.
    10. Mingyang Wang & Guang Yu & Shuang An & Daren Yu, 2012. "Discovery of factors influencing citation impact based on a soft fuzzy rough set model," Scientometrics, Springer;Akadémiai Kiadó, vol. 93(3), pages 635-644, December.
    11. Jianjun Sun & Chao Min & Jiang Li, 2016. "A vector for measuring obsolescence of scientific articles," Scientometrics, Springer;Akadémiai Kiadó, vol. 107(2), pages 745-757, May.
    12. Jiang Li & Fred Y. Ye, 2016. "Distinguishing sleeping beauties in science," Scientometrics, Springer;Akadémiai Kiadó, vol. 108(2), pages 821-828, August.
    13. Costas, Rodrigo & van Leeuwen, Thed N. & van Raan, Anthony F.J., 2013. "Effects of the durability of scientific literature at the group level: Case study of chemistry research groups in the Netherlands," Research Policy, Elsevier, vol. 42(4), pages 886-894.
    14. Lachance, Christian & Larivière, Vincent, 2014. "On the citation lifecycle of papers with delayed recognition," Journal of Informetrics, Elsevier, vol. 8(4), pages 863-872.
    15. Hui Fang, 2018. "Analysing the variation tendencies of the numbers of yearly citations for sleeping beauties in science by using derivative analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(2), pages 1051-1070, May.
    16. Yin, Yian & Wang, Dashun, 2017. "The time dimension of science: Connecting the past to the future," Journal of Informetrics, Elsevier, vol. 11(2), pages 608-621.
    17. Zhang, Ruizhi & Wang, Jian & Mei, Yajun, 2017. "Search for evergreens in science: A functional data analysis," Journal of Informetrics, Elsevier, vol. 11(3), pages 629-644.
    18. Jonathan M. Levitt & Mike Thelwall, 2009. "The most highly cited Library and Information Science articles: Interdisciplinarity, first authors and citation patterns," Scientometrics, Springer;Akadémiai Kiadó, vol. 78(1), pages 45-67, January.
    19. Jonathan M. Levitt & Mike Thelwall, 2008. "Patterns of annual citation of highly cited articles and the prediction of their citation ranking: A comparison across subjects," Scientometrics, Springer;Akadémiai Kiadó, vol. 77(1), pages 41-60, October.
    20. Mingyang Wang & Shi Li & Guangsheng Chen, 2017. "Detecting latent referential articles based on their vitality performance in the latest 2 years," Scientometrics, Springer;Akadémiai Kiadó, vol. 112(3), pages 1557-1571, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:infome:v:10:y:2016:i:4:p:1037-1051. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/joi .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.