IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v152y2020ics0167947320301420.html
   My bibliography  Save this article

Inference for a generalised stochastic block model with unknown number of blocks and non-conjugate edge models

Author

Listed:
  • Ludkin, Matthew

Abstract

The stochastic block model (SBM) is a popular model for capturing community structure and interaction within a network. Network data with non-Boolean edge weights is becoming commonplace; however, existing analysis methods convert such data to a binary representation to apply the SBM, leading to a loss of information. A generalisation of the SBM is considered, which allows edge weights to be modelled in their recorded state. An effective reversible jump Markov chain Monte Carlo sampler is proposed for estimating the parameters and the number of blocks for this generalised SBM. The methodology permits non-conjugate distributions for edge weights, which enable more flexible modelling than current methods as illustrated on synthetic data, a network of brain activity and an email communication network.

Suggested Citation

  • Ludkin, Matthew, 2020. "Inference for a generalised stochastic block model with unknown number of blocks and non-conjugate edge models," Computational Statistics & Data Analysis, Elsevier, vol. 152(C).
  • Handle: RePEc:eee:csdana:v:152:y:2020:i:c:s0167947320301420
    DOI: 10.1016/j.csda.2020.107051
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947320301420
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2020.107051?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Peter J. Green & Sylvia Richardson, 2001. "Modelling Heterogeneity With and Without the Dirichlet Process," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 28(2), pages 355-375, June.
    2. Christophe Ambroise & Catherine Matias, 2012. "New consistent and asymptotically normal parameter estimates for random‐graph mixture models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 74(1), pages 3-35, January.
    3. Hoff P.D. & Raftery A.E. & Handcock M.S., 2002. "Latent Space Approaches to Social Network Analysis," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 1090-1098, December.
    4. Jeffrey W. Miller & Matthew T. Harrison, 2018. "Mixture Models With a Prior on the Number of Components," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(521), pages 340-356, January.
    5. Copic Jernej & Jackson Matthew O. & Kirman Alan, 2009. "Identifying Community Structures from Network Data via Maximum Likelihood Methods," The B.E. Journal of Theoretical Economics, De Gruyter, vol. 9(1), pages 1-40, September.
    6. Catherine Matias & Vincent Miele, 2017. "Statistical clustering of temporal networks through a dynamic stochastic block model," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(4), pages 1119-1141, September.
    7. Junxian Geng & Anirban Bhattacharya & Debdeep Pati, 2019. "Probabilistic Community Detection With Unknown Number of Communities," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(526), pages 893-905, April.
    8. McDaid, Aaron F. & Murphy, Thomas Brendan & Friel, Nial & Hurley, Neil J., 2013. "Improved Bayesian inference for the stochastic block model with application to large networks," Computational Statistics & Data Analysis, Elsevier, vol. 60(C), pages 12-31.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Tin Lok James Ng & Thomas Brendan Murphy, 2021. "Weighted stochastic block model," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(5), pages 1365-1398, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Riccardo Rastelli & Michael Fop, 2020. "A stochastic block model for interaction lengths," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 14(2), pages 485-512, June.
    2. Marino, Maria Francesca & Pandolfi, Silvia, 2022. "Hybrid maximum likelihood inference for stochastic block models," Computational Statistics & Data Analysis, Elsevier, vol. 171(C).
    3. Sylvia Frühwirth-Schnatter & Gertraud Malsiner-Walli, 2019. "From here to infinity: sparse finite versus Dirichlet process mixtures in model-based clustering," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(1), pages 33-64, March.
    4. Prasenjit Ghosh & Debdeep Pati & Anirban Bhattacharya, 2020. "Posterior Contraction Rates for Stochastic Block Models," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 82(2), pages 448-476, August.
    5. Dragana M. Pavlović & Bryan R.L. Guillaume & Soroosh Afyouni & Thomas E. Nichols, 2020. "Multi‐subject stochastic blockmodels with mixed effects for adaptive analysis of individual differences in human brain network cluster structure," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 74(3), pages 363-396, August.
    6. Bartolucci, Francesco & Marino, Maria Francesca & Pandolfi, Silvia, 2018. "Dealing with reciprocity in dynamic stochastic block models," Computational Statistics & Data Analysis, Elsevier, vol. 123(C), pages 86-100.
    7. Catherine Matias & Vincent Miele, 2017. "Statistical clustering of temporal networks through a dynamic stochastic block model," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(4), pages 1119-1141, September.
    8. Wei Zhao & S.N. Lahiri, 2022. "Estimation of the Parameters in an Expanding Dynamic Network Model," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 84(1), pages 261-282, June.
    9. Joshua Daniel Loyal & Yuguo Chen, 2020. "Statistical Network Analysis: A Review with Applications to the Coronavirus Disease 2019 Pandemic," International Statistical Review, International Statistical Institute, vol. 88(2), pages 419-440, August.
    10. Hledik, Juraj & Rastelli, Riccardo, 2020. "A dynamic network model to measure exposure diversification in the Austrian interbank market," ESRB Working Paper Series 109, European Systemic Risk Board.
    11. Juraj Hledik & Riccardo Rastelli, 2018. "A dynamic network model to measure exposure diversification in the Austrian interbank market," Papers 1804.01367, arXiv.org, revised Aug 2018.
    12. Ian E. Fellows & Mark S. Handcock, 2023. "Modeling of networked populations when data is sampled or missing," METRON, Springer;Sapienza Università di Roma, vol. 81(1), pages 21-35, April.
    13. Samrachana Adhikari & Beau Dabbs, 2018. "Social Network Analysis in R: A Software Review," Journal of Educational and Behavioral Statistics, , vol. 43(2), pages 225-253, April.
    14. Samrachana Adhikari & Tracy Sweet & Brian Junker, 2021. "Analysis of longitudinal advice‐seeking networks following implementation of high stakes testing," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(4), pages 1475-1500, October.
    15. Chung, Jaewon & Bridgeford, Eric & Arroyo, Jesus & Pedigo, Benjamin D. & Saad-Eldin, Ali & Gopalakrishnan, Vivek & Xiang, Liang & Priebe, Carey E. & Vogelstein, Joshua T., 2020. "Statistical Connectomics," OSF Preprints ek4n3, Center for Open Science.
    16. Falk Bräuning & Siem Jan Koopman, 2016. "The dynamic factor network model with an application to global credit risk," Working Papers 16-13, Federal Reserve Bank of Boston.
    17. Jamie Olson & Kathleen Carley, 2013. "Exact and approximate EM estimation of mutually exciting hawkes processes," Statistical Inference for Stochastic Processes, Springer, vol. 16(1), pages 63-80, April.
    18. Chih‐Sheng Hsieh & Lung‐Fei Lee & Vincent Boucher, 2020. "Specification and estimation of network formation and network interaction models with the exponential probability distribution," Quantitative Economics, Econometric Society, vol. 11(4), pages 1349-1390, November.
    19. Áureo de Paula, 2015. "Econometrics of network models," CeMMAP working papers CWP52/15, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    20. Jiang, Binyan & Li, Jialiang & Yao, Qiwei, 2023. "Autoregressive networks," LSE Research Online Documents on Economics 119983, London School of Economics and Political Science, LSE Library.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:152:y:2020:i:c:s0167947320301420. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.