IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v105y2015i3d10.1007_s11192-015-1737-9.html
   My bibliography  Save this article

Modelling citation networks

Author

Listed:
  • S. R. Goldberg

    (Queen Mary University of London)

  • H. Anthony

    (Imperial College London)

  • T. S. Evans

    (Imperial College London)

Abstract

The distribution of the number of academic publications against citation count for papers published in the same year is remarkably similar from year to year. We characterise the shape of such distributions by a ‘width’, $$\sigma ^2$$ σ 2 , associated with fitting a log-normal to each distribution, and find the width to be approximately constant for publications published in different years. This similarity is not surprising, after all, why would papers in a given year be cited more than another year? Nevertheless, we show that simple citation models fail to capture this behaviour. We then provide a simple three parameter citation network model which can reproduce the correct width over time. We use the citation network of papers from the hep-th section of arXiv to test our model. Our final model reproduces the data’s observed ‘width’ when around 20 % of the citations in the model are made to recently published papers in the entire network (‘global information’). The remaining 80 % of citations are made using the references from these papers’ bibliographies (‘local searches’). We note that this is consistent with other studies, though our motivation to achieve the above distribution with time is very different. Finally, we find that, in the citation network model, varying the number of papers referenced by a new publication is important as it alters the parameters in the model which are fitted to the data. This is not addressed in current models and needs further work.

Suggested Citation

  • S. R. Goldberg & H. Anthony & T. S. Evans, 2015. "Modelling citation networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 105(3), pages 1577-1604, December.
  • Handle: RePEc:spr:scient:v:105:y:2015:i:3:d:10.1007_s11192-015-1737-9
    DOI: 10.1007/s11192-015-1737-9
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-015-1737-9
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-015-1737-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. S. Redner, 1998. "How popular is your paper? An empirical study of the citation distribution," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 4(2), pages 131-134, July.
    2. Michael J. Stringer & Marta Sales-Pardo & Luís A. Nunes Amaral, 2010. "Statistical validation of a global model for the distribution of the ultimate number of citations accrued by papers published in a scientific journal," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 61(7), pages 1377-1385, July.
    3. Michael J Stringer & Marta Sales-Pardo & Luís A Nunes Amaral, 2008. "Effectiveness of Journal Ranking Schemes as a Tool for Locating Information," PLOS ONE, Public Library of Science, vol. 3(2), pages 1-8, February.
    4. Wu, Yan & Fu, Tom Z.J. & Chiu, Dah Ming, 2014. "Generalized preferential attachment considering aging," Journal of Informetrics, Elsevier, vol. 8(3), pages 650-658.
    5. Sternitzke, Christian & Bartkowski, Adam & Schramm, Reinhard, 2008. "Visualizing patent statistics by means of social network analysis tools," World Patent Information, Elsevier, vol. 30(2), pages 115-131, June.
    6. M. V. Simkin & V. P. Roychowdhury, 2005. "Stochastic modeling of citation slips," Scientometrics, Springer;Akadémiai Kiadó, vol. 62(3), pages 367-384, March.
    7. Michal Brzezinski, 2015. "Power laws in citation distributions: evidence from Scopus," Scientometrics, Springer;Akadémiai Kiadó, vol. 103(1), pages 213-228, April.
    8. T. S. Evans & N. Hopkins & B. S. Kaube, 2012. "Universality of performance indicators based on citation and reference counts," Scientometrics, Springer;Akadémiai Kiadó, vol. 93(2), pages 473-495, November.
    9. Ren, Fu-Xin & Shen, Hua-Wei & Cheng, Xue-Qi, 2012. "Modeling the clustering in citation networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(12), pages 3533-3539.
    10. Young-Ho Eom & Santo Fortunato, 2011. "Characterizing and Modeling Citation Dynamics," PLOS ONE, Public Library of Science, vol. 6(9), pages 1-7, September.
    11. Per O. Seglen, 1992. "The skewness of science," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 43(9), pages 628-638, October.
    12. Lawrence Smolinsky & Aaron Lercher & Andrew McDaniel, 2015. "Testing theories of preferential attachment in random networks of citations," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 66(10), pages 2132-2145, October.
    13. Thijs Pollman, 2000. "Forgetting and the Ageing of Scientific Publications," Scientometrics, Springer;Akadémiai Kiadó, vol. 47(1), pages 43-54, January.
    14. Hajra, Kamalika Basu & Sen, Parongama, 2005. "Aging in citation networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 346(1), pages 44-48.
    15. van Raan, Anthony F.J., 2001. "Two-step competition process leads to quasi power-law income distributions," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 298(3), pages 530-536.
    16. Saramäki, Jari & Kaski, Kimmo, 2004. "Scale-free networks generated by random walkers," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 341(C), pages 80-86.
    17. Derek De Solla Price, 1976. "A general theory of bibliometric and other cumulative advantage processes," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 27(5), pages 292-306, September.
    18. Ludo Waltman & Nees Jan van Eck & Anthony F. J. van Raan, 2012. "Universality of citation distributions revisited," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 63(1), pages 72-77, January.
    19. Mikhail V. Simkin & Vwani P. Roychowdhury, 2007. "A mathematical theory of citing," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 58(11), pages 1661-1673, September.
    20. Hajra, Kamalika Basu & Sen, Parongama, 2006. "Modelling aging characteristics in citation networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 368(2), pages 575-582.
    21. Ludo Waltman & Nees Jan van Eck & Anthony F. J. van Raan, 2012. "Universality of citation distributions revisited," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 63(1), pages 72-77, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zheng Xie & Zonglin Xie & Miao Li & Jianping Li & Dongyun Yi, 2017. "Modeling the coevolution between citations and coauthorship of scientific papers," Scientometrics, Springer;Akadémiai Kiadó, vol. 112(1), pages 483-507, July.
    2. Yanzhu Hu & Huiyang Zhao & Xinbo Ai, 2016. "Inferring Weighted Directed Association Network from Multivariate Time Series with a Synthetic Method of Partial Symbolic Transfer Entropy Spectrum and Granger Causality," PLOS ONE, Public Library of Science, vol. 11(11), pages 1-25, November.
    3. Maria Cristiana Martini & Elvira Pelle & Francesco Poggi & Andrea Sciandra, 2022. "The role of citation networks to explain academic promotions: an empirical analysis of the Italian national scientific qualification," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(10), pages 5633-5659, October.
    4. Monachary Kammari & Durga Bhavani S, 2023. "Time-stamp based network evolution model for citation networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(6), pages 3723-3741, June.
    5. Tanya Araújo & Elsa Fontainha, 2018. "Are scientific memes inherited differently from gendered authorship?," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(2), pages 953-972, November.
    6. Jelnov, Pavel & Weiss, Yoram, 2022. "Influence in economics and aging," Labour Economics, Elsevier, vol. 77(C).
    7. Xie, Zheng & Ouyang, Zhenzheng & Liu, Qi & Li, Jianping, 2016. "A geometric graph model for citation networks of exponentially growing scientific papers," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 456(C), pages 167-175.
    8. Martin Ho & Henry CW Price & Tim S Evans & Eoin O'Sullivan, 2023. "Order in Innovation," Papers 2302.13076, arXiv.org.
    9. Xie, Zonglin & Xie, Zheng & Li, Jianping & Yang, Qian, 2018. "Exploring the influence of social activity on scientific career," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 500(C), pages 189-198.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Clough, James R. & Evans, Tim S., 2016. "What is the dimension of citation space?," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 448(C), pages 235-247.
    2. Andrea Bonaccorsi & Cinzia Daraio & Stefano Fantoni & Viola Folli & Marco Leonetti & Giancarlo Ruocco, 2017. "Do social sciences and humanities behave like life and hard sciences?," Scientometrics, Springer;Akadémiai Kiadó, vol. 112(1), pages 607-653, July.
    3. Yin, Yian & Wang, Dashun, 2017. "The time dimension of science: Connecting the past to the future," Journal of Informetrics, Elsevier, vol. 11(2), pages 608-621.
    4. Thelwall, Mike, 2016. "Are the discretised lognormal and hooked power law distributions plausible for citation data?," Journal of Informetrics, Elsevier, vol. 10(2), pages 454-470.
    5. Giancarlo Ruocco & Cinzia Daraio, 2013. "An empirical approach to compare the performance of heterogeneous academic fields," Scientometrics, Springer;Akadémiai Kiadó, vol. 97(3), pages 601-625, December.
    6. Tol, Richard S.J., 2013. "The Matthew effect for cohorts of economists," Journal of Informetrics, Elsevier, vol. 7(2), pages 522-527.
    7. Vîiu, Gabriel-Alexandru, 2018. "The lognormal distribution explains the remarkable pattern documented by characteristic scores and scales in scientometrics," Journal of Informetrics, Elsevier, vol. 12(2), pages 401-415.
    8. Thelwall, Mike, 2016. "The discretised lognormal and hooked power law distributions for complete citation data: Best options for modelling and regression," Journal of Informetrics, Elsevier, vol. 10(2), pages 336-346.
    9. Abramo, Giovanni & Cicero, Tindaro & D’Angelo, Ciriaco Andrea, 2012. "How important is choice of the scaling factor in standardizing citations?," Journal of Informetrics, Elsevier, vol. 6(4), pages 645-654.
    10. Xie, Zheng & Ouyang, Zhenzheng & Liu, Qi & Li, Jianping, 2016. "A geometric graph model for citation networks of exponentially growing scientific papers," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 456(C), pages 167-175.
    11. Zhihui Zhang & Ying Cheng & Nian Cai Liu, 2015. "Improving the normalization effect of mean-based method from the perspective of optimization: optimization-based linear methods and their performance," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(1), pages 587-607, January.
    12. Michal Brzezinski, 2015. "Power laws in citation distributions: evidence from Scopus," Scientometrics, Springer;Akadémiai Kiadó, vol. 103(1), pages 213-228, April.
    13. Sangwal, Keshra, 2013. "Comparison of different mathematical functions for the analysis of citation distribution of papers of individual authors," Journal of Informetrics, Elsevier, vol. 7(1), pages 36-49.
    14. Brito, Ricardo & Rodríguez-Navarro, Alonso, 2018. "Research assessment by percentile-based double rank analysis," Journal of Informetrics, Elsevier, vol. 12(1), pages 315-329.
    15. Ren, Fu-Xin & Shen, Hua-Wei & Cheng, Xue-Qi, 2012. "Modeling the clustering in citation networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(12), pages 3533-3539.
    16. Thelwall, Mike, 2016. "Are there too many uncited articles? Zero inflated variants of the discretised lognormal and hooked power law distributions," Journal of Informetrics, Elsevier, vol. 10(2), pages 622-633.
    17. T. S. Evans & N. Hopkins & B. S. Kaube, 2012. "Universality of performance indicators based on citation and reference counts," Scientometrics, Springer;Akadémiai Kiadó, vol. 93(2), pages 473-495, November.
    18. Bertoli-Barsotti, Lucio & Lando, Tommaso, 2015. "On a formula for the h-index," Journal of Informetrics, Elsevier, vol. 9(4), pages 762-776.
    19. Lu Liu & Benjamin F. Jones & Brian Uzzi & Dashun Wang, 2023. "Data, measurement and empirical methods in the science of science," Nature Human Behaviour, Nature, vol. 7(7), pages 1046-1058, July.
    20. Thelwall, Mike, 2016. "Citation count distributions for large monodisciplinary journals," Journal of Informetrics, Elsevier, vol. 10(3), pages 863-874.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:105:y:2015:i:3:d:10.1007_s11192-015-1737-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.