IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0269845.html
   My bibliography  Save this article

A stochastic generative model for citation networks among academic papers

Author

Listed:
  • Yuichiro Yasui
  • Junji Nakano

Abstract

We propose a stochastic generative model to represent a directed graph constructed by citations among academic papers, where nodes and directed edges represent papers with discrete publication time and citations respectively. The proposed model assumes that a citation between two papers occurs with a probability based on the type of the citing paper, the importance of cited paper, and the difference between their publication times, like the existing models. We consider the out-degrees of citing paper as its type, because, for example, survey paper cites many papers. We approximate the importance of a cited paper by its in-degrees. In our model, we adopt three functions: a logistic function for illustrating the numbers of papers published in discrete time, an inverse Gaussian probability distribution function to express the aging effect based on the difference between publication times, and an exponential distribution (or a generalized Pareto distribution) for describing the out-degree distribution. We consider that our model is a more reasonable and appropriate stochastic model than other existing models and can perform complete simulations without using original data. In this paper, we first use the Web of Science database and see the features used in our model. By using the proposed model, we can generate simulated graphs and demonstrate that they are similar to the original data concerning the in- and out-degree distributions, and node triangle participation. In addition, we analyze two other citation networks derived from physics papers in the arXiv database and verify the effectiveness of the model.

Suggested Citation

  • Yuichiro Yasui & Junji Nakano, 2022. "A stochastic generative model for citation networks among academic papers," PLOS ONE, Public Library of Science, vol. 17(6), pages 1-16, June.
  • Handle: RePEc:plo:pone00:0269845
    DOI: 10.1371/journal.pone.0269845
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0269845
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0269845&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0269845?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. M. V. Simkin & V. P. Roychowdhury, 2005. "Stochastic modeling of citation slips," Scientometrics, Springer;Akadémiai Kiadó, vol. 62(3), pages 367-384, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Martin Ricker, 2017. "Letter to the Editor: About the quality and impact of scientific articles," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(3), pages 1851-1855, June.
    2. Bruno S. Frey & Katja Rost, 2010. "Do rankings reflect research quality?," Journal of Applied Economics, Universidad del CEMA, vol. 13, pages 1-38, May.
    3. James K. Wetterer, 2006. "Quotation error, citation copying, and ant extinctions in Madeira," Scientometrics, Springer;Akadémiai Kiadó, vol. 67(3), pages 351-372, June.
    4. Tol, Richard S.J., 2013. "The Matthew effect for cohorts of economists," Journal of Informetrics, Elsevier, vol. 7(2), pages 522-527.
    5. Bramoullé, Yann & Currarini, Sergio & Jackson, Matthew O. & Pin, Paolo & Rogers, Brian W., 2012. "Homophily and long-run integration in social networks," Journal of Economic Theory, Elsevier, vol. 147(5), pages 1754-1786.
    6. Xie, Zheng & Ouyang, Zhenzheng & Liu, Qi & Li, Jianping, 2016. "A geometric graph model for citation networks of exponentially growing scientific papers," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 456(C), pages 167-175.
    7. Waltman, Ludo & van Eck, Nees Jan & Wouters, Paul, 2013. "Counting publications and citations: Is more always better?," Journal of Informetrics, Elsevier, vol. 7(3), pages 635-641.
    8. S. R. Goldberg & H. Anthony & T. S. Evans, 2015. "Modelling citation networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 105(3), pages 1577-1604, December.
    9. Christian Borghesi & Jean-Philippe Bouchaud, 2007. "Of songs and men: a model for multiple choice with herding," Quality & Quantity: International Journal of Methodology, Springer, vol. 41(4), pages 557-568, August.
    10. repec:plo:pone00:0184727 is not listed on IDEAS
    11. Clough, James R. & Evans, Tim S., 2016. "What is the dimension of citation space?," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 448(C), pages 235-247.
    12. Pawel Sobkowicz, 2011. "Simulations of opinion changes in scientific communities," Scientometrics, Springer;Akadémiai Kiadó, vol. 87(2), pages 233-250, May.
    13. Martin Ho & Henry CW Price & Tim S Evans & Eoin O'Sullivan, 2023. "Order in Innovation," Papers 2302.13076, arXiv.org.
    14. Brito, Ana C.M. & Silva, Filipi N. & Amancio, Diego R., 2021. "Associations between author-level metrics in subsequent time periods," Journal of Informetrics, Elsevier, vol. 15(4).
    15. Miroslav Nedelchev, 2017. "A Bibliometric Study Of Citations In Corporate Governance," Entrepreneurship, Faculty of Economics, SOUTH-WEST UNIVERSITY "NEOFIT RILSKI", BLAGOEVGRAD, vol. 5(2), pages 95-105.
    16. Wang, Jian, 2014. "Unpacking the Matthew effect in citations," Journal of Informetrics, Elsevier, vol. 8(2), pages 329-339.
    17. Liming Liang & Zhen Zhong & Ronald Rousseau, 2014. "Scientists’ referencing (mis)behavior revealed by the dissemination network of referencing errors," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(3), pages 1973-1986, December.
    18. Lin Zhang & Wolfgang Glänzel, 2017. "A citation-based cross-disciplinary study on literature aging: part I—the synchronous approach," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(3), pages 1573-1589, June.
    19. Katja Rost & Bruno S. Frey, 2011. "Quantitative and Qualitative Rankings of Scholars," Schmalenbach Business Review (sbr), LMU Munich School of Management, vol. 63(1), pages 63-91, January.
    20. Aaron Cumberledge & Neal Smith & Benjamin W. Riley, 2023. "Unverified history: an analysis of quotation accuracy in leading history journals," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(8), pages 4677-4687, August.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0269845. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.