IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0227804.html
   My bibliography  Save this article

Exponential random graph model parameter estimation for very large directed networks

Author

Listed:
  • Alex Stivala
  • Garry Robins
  • Alessandro Lomi

Abstract

Exponential random graph models (ERGMs) are widely used for modeling social networks observed at one point in time. However the computational difficulty of ERGM parameter estimation has limited the practical application of this class of models to relatively small networks, up to a few thousand nodes at most, with usually only a few hundred nodes or fewer. In the case of undirected networks, snowball sampling can be used to find ERGM parameter estimates of larger networks via network samples, and recently published improvements in ERGM network distribution sampling and ERGM estimation algorithms have allowed ERGM parameter estimates of undirected networks with over one hundred thousand nodes to be made. However the implementations of these algorithms to date have been limited in their scalability, and also restricted to undirected networks. Here we describe an implementation of the recently published Equilibrium Expectation (EE) algorithm for ERGM parameter estimation of large directed networks. We test it on some simulated networks, and demonstrate its application to an online social network with over 1.6 million nodes.

Suggested Citation

  • Alex Stivala & Garry Robins & Alessandro Lomi, 2020. "Exponential random graph model parameter estimation for very large directed networks," PLOS ONE, Public Library of Science, vol. 15(1), pages 1-21, January.
  • Handle: RePEc:plo:pone00:0227804
    DOI: 10.1371/journal.pone.0227804
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0227804
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0227804&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0227804?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Babkin, Sergii & Stewart, Jonathan R. & Long, Xiaochen & Schweinberger, Michael, 2020. "Large-scale estimation of random graph models with local dependence," Computational Statistics & Data Analysis, Elsevier, vol. 152(C).
    2. Jones, Galin L. & Haran, Murali & Caffo, Brian S. & Neath, Ronald, 2006. "Fixed-Width Output Analysis for Markov Chain Monte Carlo," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1537-1547, December.
    3. Krivitsky, Pavel N., 2017. "Using contrastive divergence to seed Monte Carlo MLE for exponential-family random graph models," Computational Statistics & Data Analysis, Elsevier, vol. 107(C), pages 149-161.
    4. Anna D. Broido & Aaron Clauset, 2019. "Scale-free networks are rare," Nature Communications, Nature, vol. 10(1), pages 1-10, December.
    5. Hunter, David R. & Handcock, Mark S. & Butts, Carter T. & Goodreau, Steven M. & Morris, Martina, 2008. "ergm: A Package to Fit, Simulate and Diagnose Exponential-Family Models for Networks," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 24(i03).
    6. Morris, Martina & Handcock, Mark S. & Hunter, David R., 2008. "Specification of Exponential-Family Random Graph Models: Terms and Computational Aspects," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 24(i04).
    7. Handcock, Mark S. & Hunter, David R. & Butts, Carter T. & Goodreau, Steven M. & Morris, Martina, 2008. "statnet: Software Tools for the Representation, Visualization, Analysis and Simulation of Network Data," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 24(i01).
    8. Dootika Vats & James M Flegal & Galin L Jones, 2019. "Multivariate output analysis for Markov chain Monte Carlo," Biometrika, Biometrika Trust, vol. 106(2), pages 321-337.
    9. Gillespie, Colin S., 2015. "Fitting Heavy Tailed Distributions: The poweRlaw Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 64(i02).
    10. Hunter, David R. & Goodreau, Steven M. & Handcock, Mark S., 2008. "Goodness of Fit of Social Network Models," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 248-258, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Duncan A. Clark & Mark S. Handcock, 2022. "Comparing the real‐world performance of exponential‐family random graph models and latent order logistic models for social network analysis," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(2), pages 566-587, April.
    2. Juan Li & Keyin Liu & Zixin Yang & Yi Qu, 2023. "Evolution and Impacting Factors of Global Renewable Energy Products Trade Network: An Empirical Investigation Based on ERGM Model," Sustainability, MDPI, vol. 15(11), pages 1-27, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Duxbury, Scott W, 2019. "Mediation and Moderation in Statistical Network Models," SocArXiv 9bs4u, Center for Open Science.
    2. Goodreau, Steven M. & Handcock, Mark S. & Hunter, David R. & Butts, Carter T. & Morris, Martina, 2008. "A statnet Tutorial," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 24(i09).
    3. Emily Casleton & Daniel J. Nordman & Mark S. Kaiser, 2022. "Modeling Transitivity in Local Structure Graph Models," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 84(1), pages 389-417, June.
    4. Liu, Linqing & Shen, Mengyun & Sun, Da & Yan, Xiaofei & Hu, Shi, 2022. "Preferential attachment, R&D expenditure and the evolution of international trade networks from the perspective of complex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 603(C).
    5. Kei, Yik Lun & Chen, Yanzhen & Madrid Padilla, Oscar Hernan, 2023. "A partially separable model for dynamic valued networks," Computational Statistics & Data Analysis, Elsevier, vol. 187(C).
    6. Vishesh Karwa & Pavel N. Krivitsky & Aleksandra B. Slavković, 2017. "Sharing social network data: differentially private estimation of exponential family random-graph models," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 66(3), pages 481-500, April.
    7. Hunter, David R. & Goodreau, Steven M. & Handcock, Mark S., 2013. "ergm.userterms: A Template Package for Extending statnet," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 52(i02).
    8. Ashish Arora & Michelle Gittelman & Sarah Kaplan & John Lynch & Will Mitchell & Nicolaj Siggelkow & Ji Youn (Rose) Kim & Michael Howard & Emily Cox Pahnke & Warren Boeker, 2016. "Understanding network formation in strategy research: Exponential random graph models," Strategic Management Journal, Wiley Blackwell, vol. 37(1), pages 22-44, January.
    9. Yonghong Ma & Xiaomeng Yang & Sen Qu & Lingkai Kong, 2022. "Research on the formation mechanism of big data technology cooperation networks: empirical evidence from China," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(3), pages 1273-1294, March.
    10. John McLevey & Alexander V. Graham & Reid McIlroy-Young & Pierson Browne & Kathryn S. Plaisance, 2018. "Interdisciplinarity and insularity in the diffusion of knowledge: an analysis of disciplinary boundaries between philosophy of science and the sciences," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(1), pages 331-349, October.
    11. Youyi Bi & Yunjian Qiu & Zhenghui Sha & Mingxian Wang & Yan Fu & Noshir Contractor & Wei Chen, 2021. "Modeling Multi-Year Customers’ Considerations and Choices in China’s Auto Market Using Two-Stage Bipartite Network Analysis," Networks and Spatial Economics, Springer, vol. 21(2), pages 365-385, June.
    12. Krivitsky, Pavel N., 2017. "Using contrastive divergence to seed Monte Carlo MLE for exponential-family random graph models," Computational Statistics & Data Analysis, Elsevier, vol. 107(C), pages 149-161.
    13. Cornelius Fritz & Michael Lebacher & Göran Kauermann, 2020. "Tempus volat, hora fugit: A survey of tie‐oriented dynamic network models in discrete and continuous time," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 74(3), pages 275-299, August.
    14. Cody J. Dey & James S. Quinn, 2014. "Individual attributes and self-organizational processes affect dominance network structure in pukeko," Behavioral Ecology, International Society for Behavioral Ecology, vol. 25(6), pages 1402-1408.
    15. Milad Abbasiharofteh & Tom Broekel, 2021. "Still in the shadow of the wall? The case of the Berlin biotechnology cluster," Environment and Planning A, , vol. 53(1), pages 73-94, February.
    16. Tom Broekel & Marcel Bednarz, 2018. "Disentangling link formation and dissolution in spatial networks: An Application of a Two-Mode STERGM to a Project-Based R&D Network in the German Biotechnology Industry," Networks and Spatial Economics, Springer, vol. 18(3), pages 677-704, September.
    17. Nolan E. Phillips & Brian L. Levy & Robert J. Sampson & Mario L. Small & Ryan Q. Wang, 2021. "The Social Integration of American Cities: Network Measures of Connectedness Based on Everyday Mobility Across Neighborhoods," Sociological Methods & Research, , vol. 50(3), pages 1110-1149, August.
    18. Chakraborty, Saptarshi & Bhattacharya, Suman K. & Khare, Kshitij, 2022. "Estimating accuracy of the MCMC variance estimator: Asymptotic normality for batch means estimators," Statistics & Probability Letters, Elsevier, vol. 183(C).
    19. repec:jss:jstsof:24:i09 is not listed on IDEAS
    20. Duncan A. Clark & Mark S. Handcock, 2022. "Comparing the real‐world performance of exponential‐family random graph models and latent order logistic models for social network analysis," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(2), pages 566-587, April.
    21. Angel Ortiz-Pelaez & Getaneh Ashenafi & Francois Roger & Agnes Waret-Szkuta, 2012. "Can Geographical Factors Determine the Choices of Farmers in the Ethiopian Highlands to Trade in Livestock Markets?," PLOS ONE, Public Library of Science, vol. 7(2), pages 1-11, February.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0227804. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.