IDEAS home Printed from https://ideas.repec.org/a/plo/pcsy00/0000010.html
   My bibliography  Save this article

Pattern detection in bipartite networks: A review of terminology, applications, and methods

Author

Listed:
  • Zachary P Neal
  • Annabell Cadieux
  • Diego Garlaschelli
  • Nicholas J Gotelli
  • Fabio Saracco
  • Tiziano Squartini
  • Shade T Shutters
  • Werner Ulrich
  • Guanyang Wang
  • Giovanni Strona

Abstract

Rectangular association matrices with binary (0/1) entries are a common data structure in many research fields. Examples include ecology, economics, mathematics, physics, psychometrics, and others. Because their columns and rows are associated to distinct entities, these matrices can be equivalently expressed as bipartite networks that, in turn, can be projected onto pairs of unipartite networks. A variety of diversity statistics and network metrics can be used to quantify patterns in these matrices and networks. But, to be defined as such, what should these patterns be compared to? In all of these disciplines, researchers have recognized the necessity of comparing an empirical matrix to a benchmark ensemble of ‘null’ matrices created by randomizing certain elements of the original data. This common need has nevertheless promoted the independent development of methodologies by researchers who come from different backgrounds and use different terminology. Here, we provide a multidisciplinary review of randomization techniques and null models for matrices representing binary, bipartite networks. We aim at translating concepts from different technical domains to a common language that is accessible to a broad scientific audience. Specifically, after briefly reviewing examples of binary matrix structures encountered across different fields, we introduce the major approaches and strategies for randomizing these matrices. We then explore the details of and performance of specific techniques and discuss their limitations and computational challenges. In particular, we focus on the conceptual importance and implementation of structural constraints on the randomization, such as preserving row and/or columns sums of the original matrix in each of the randomized matrices. Our review serves both as a guide for empiricists in different disciplines, as well as a reference point for researchers working on theoretical and methodological developments in matrix randomization methods.

Suggested Citation

  • Zachary P Neal & Annabell Cadieux & Diego Garlaschelli & Nicholas J Gotelli & Fabio Saracco & Tiziano Squartini & Shade T Shutters & Werner Ulrich & Guanyang Wang & Giovanni Strona, 2024. "Pattern detection in bipartite networks: A review of terminology, applications, and methods," PLOS Complex Systems, Public Library of Science, vol. 1(2), pages 1-34, October.
  • Handle: RePEc:plo:pcsy00:0000010
    DOI: 10.1371/journal.pcsy.0000010
    as

    Download full text from publisher

    File URL: https://journals.plos.org/complexsystems/article?id=10.1371/journal.pcsy.0000010
    Download Restriction: no

    File URL: https://journals.plos.org/complexsystems/article/file?id=10.1371/journal.pcsy.0000010&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcsy.0000010?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Gergő Tóth & Zoltán Elekes & Adam Whittle & Changjun Lee & Dieter F. Kogler, 2022. "Technology Network Structure Conditions the Economic Resilience of Regions," Economic Geography, Taylor & Francis Journals, vol. 98(4), pages 355-378, August.
    2. Frank Neffke & Martin Henning & Ron Boschma, 2011. "How Do Regions Diversify over Time? Industry Relatedness and the Development of New Growth Paths in Regions," Economic Geography, Taylor & Francis Journals, vol. 87(3), pages 237-265, July.
    3. Di Gangi, Domenico & Lillo, Fabrizio & Pirino, Davide, 2018. "Assessing systemic risk due to fire sales spillover through maximum entropy network reconstruction," Journal of Economic Dynamics and Control, Elsevier, vol. 94(C), pages 117-141.
    4. Jelena Smiljanić & Marija Mitrović Dankulov, 2017. "Associative nature of event participation dynamics: A network theory approach," PLOS ONE, Public Library of Science, vol. 12(2), pages 1-16, February.
    5. Peter D. Grünwald, 2007. "The Minimum Description Length Principle," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262072815, December.
    6. Giovanni Strona & Domenico Nappo & Francesco Boccacci & Simone Fattorini & Jesus San-Miguel-Ayanz, 2014. "A fast and unbiased procedure to randomize ecological binary matrices with fixed row and column totals," Nature Communications, Nature, vol. 5(1), pages 1-9, September.
    7. C. A. Hidalgo & B. Klinger & A. -L. Barabasi & R. Hausmann, 2007. "The Product Space Conditions the Development of Nations," Papers 0708.2090, arXiv.org.
    8. Zhang, Peng & Wang, Jinliang & Li, Xiaojia & Li, Menghui & Di, Zengru & Fan, Ying, 2008. "Clustering coefficient and community structure of bipartite networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 387(27), pages 6869-6875.
    9. Norman Verhelst, 2008. "An Efficient MCMC Algorithm to Sample Binary Matrices with Fixed Marginals," Psychometrika, Springer;The Psychometric Society, vol. 73(4), pages 705-728, December.
    10. Yuguo Chen & Persi Diaconis & Susan P. Holmes & Jun S. Liu, 2005. "Sequential Monte Carlo Methods for Statistical Analysis of Tables," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 109-120, March.
    11. Ulrich, Werner & Gotelli, Nicholas J., 2012. "A null model algorithm for presence–absence matrices based on proportional resampling," Ecological Modelling, Elsevier, vol. 244(C), pages 20-27.
    12. Domenico Di Gangi & Fabrizio Lillo & Davide Pirino, 2015. "Assessing systemic risk due to fire sales spillover through maximum entropy network reconstruction," Papers 1509.00607, arXiv.org, revised Jul 2018.
    13. Tiziano Squartini & Diego Garlaschelli, 2011. "Exact maximum-likelihood method to detect patterns in real networks," LEM Papers Series 2011/07, Laboratory of Economics and Management (LEM), Sant'Anna School of Advanced Studies, Pisa, Italy.
    14. Carolina Mattsson & Frank W. Takes & Eelke M. Heemskerk & Cees Diks & Gert Buiten & Albert Faber & Peter M. A. Sloot, 2021. "Functional structure in production networks," Papers 2103.15777, arXiv.org.
    15. Chen, Yong-Zhou & Li, Nan & He, Da-Ren, 2007. "A study on some urban bus transport networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 376(C), pages 747-754.
    16. Matthieu Cristelli & Andrea Tacchella & Luciano Pietronero, 2015. "The Heterogeneous Dynamics of Economic Complexity," PLOS ONE, Public Library of Science, vol. 10(2), pages 1-15, February.
    17. Neave O'Clery & Samuel Heroy & Francois Hulot & Mariano Beguerisse-D'iaz, 2019. "Unravelling the forces underlying urban industrial agglomeration," Papers 1903.09279, arXiv.org, revised Jun 2019.
    18. Fabio Saracco & Mika J. Straka & Riccardo Di Clemente & Andrea Gabrielli & Guido Caldarelli & Tiziano Squartini, 2016. "Inferring monopartite projections of bipartite networks: an entropy-based approach," Papers 1607.02481, arXiv.org, revised May 2017.
    19. Fabio Saracco & Riccardo Di Clemente & Andrea Gabrielli & Tiziano Squartini, 2015. "Randomizing bipartite networks: the case of the World Trade Web," Papers 1503.05098, arXiv.org, revised Jun 2015.
    20. Maslov, Sergei & Sneppen, Kim & Zaliznyak, Alexei, 2004. "Detection of topological patterns in complex networks: correlation profile of the internet," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 333(C), pages 529-540.
    21. Tiziano Squartini & Assaf Almog & Guido Caldarelli & Iman van Lelyveld & Diego Garlaschelli & Giulio Cimini, 2016. "Enhanced capital-asset pricing model for the reconstruction of bipartite financial networks," Papers 1606.07684, arXiv.org, revised Sep 2017.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ramadiah, Amanah & Caccioli, Fabio & Fricke, Daniel, 2020. "Reconstructing and stress testing credit networks," Journal of Economic Dynamics and Control, Elsevier, vol. 111(C).
    2. Mika J. Straka & Guido Caldarelli & Tiziano Squartini & Fabio Saracco, 2017. "From Ecology to Finance (and Back?): Recent Advancements in the Analysis of Bipartite Networks," Papers 1710.10143, arXiv.org.
    3. Fessina, Massimiliano & Zaccaria, Andrea & Cimini, Giulio & Squartini, Tiziano, 2024. "Pattern-detection in the global automotive industry: A manufacturer-supplier-product network analysis," Chaos, Solitons & Fractals, Elsevier, vol. 181(C).
    4. DIODATO Dario, 2024. "Handbook of Economic Complexity for Policy," JRC Research Reports JRC138666, Joint Research Centre.
    5. Carolina Becatti & Guido Caldarelli & Renaud Lambiotte & Fabio Saracco, 2019. "Extracting significant signal of news consumption from social networks: the case of Twitter in Italian political elections," Palgrave Communications, Palgrave Macmillan, vol. 5(1), pages 1-16, December.
    6. Ramadiah, Amanah & Caccioli, Fabio & Fricke, Daniel, 2019. "Reconstructing and stress testing credit networks," LSE Research Online Documents on Economics 118938, London School of Economics and Political Science, LSE Library.
    7. Marco Bardoscia & Paolo Barucca & Stefano Battiston & Fabio Caccioli & Giulio Cimini & Diego Garlaschelli & Fabio Saracco & Tiziano Squartini & Guido Caldarelli, 2021. "The Physics of Financial Networks," Papers 2103.05623, arXiv.org.
    8. Frank Neffke & Angelica Sbardella & Ulrich Schetter & Andrea Tacchella, 2024. "Economic Complexity Analysis," Papers in Evolutionary Economic Geography (PEEG) 2430, Utrecht University, Department of Human Geography and Spatial Planning, Group Economic Geography, revised Oct 2024.
    9. Bernardo Caldarola & Dario Mazzilli & Lorenzo Napolitano & Aurelio Patelli & Angelica Sbardella, 2023. "Economic complexity and the sustainability transition: A review of data, methods, and literature," Papers 2308.07172, arXiv.org, revised Mar 2024.
    10. Tiziano Squartini & Guido Caldarelli & Giulio Cimini & Andrea Gabrielli & Diego Garlaschelli, 2018. "Reconstruction methods for networks: the case of economic and financial systems," Papers 1806.06941, arXiv.org.
    11. Wang, Chao & Liu, Xiaoxing & Chen, Boyi & Li, Menyu, 2023. "Topological properties of reconstructed credit networks and banking systemic risk," The North American Journal of Economics and Finance, Elsevier, vol. 66(C).
    12. Matteo Bruno & Dario Mazzilli & Aurelio Patelli & Tiziano Squartini & Fabio Saracco, 2023. "Inferring comparative advantage via entropy maximization," Papers 2304.12245, arXiv.org.
    13. Jeroen van Lidth de Jeude & Riccardo Di Clemente & Guido Caldarelli & Fabio Saracco & Tiziano Squartini, 2019. "Reconstructing Mesoscale Network Structures," Complexity, Hindawi, vol. 2019, pages 1-13, January.
    14. Sabrina Aufiero & Giordano De Marzo & Angelica Sbardella & Andrea Zaccaria, 2023. "Mapping job complexity and skills into wages," Papers 2304.05251, arXiv.org.
    15. Angelica Sbardella & Andrea Zaccaria & Luciano Pietronero & Pasquale Scaramozzino, 2021. "Behind the Italian Regional Divide: An Economic Fitness and Complexity Perspective," LEM Papers Series 2021/30, Laboratory of Economics and Management (LEM), Sant'Anna School of Advanced Studies, Pisa, Italy.
    16. Alje van Dam & Koen Frenken, 2019. "Variety, Complexity and Economic Development," Papers 1903.07997, arXiv.org.
    17. Balland, Pierre-Alexandre & Broekel, Tom & Diodato, Dario & Giuliani, Elisa & Hausmann, Ricardo & O'Clery, Neave & Rigby, David, 2022. "Reprint of The new paradigm of economic complexity," Research Policy, Elsevier, vol. 51(8).
    18. Carattini, Stefano & Fankhauser, Sam & Gao, Jianjian & Gennaioli, Caterina & Panzarasa, Pietro, 2023. "What does network analysis teach us about international environmental cooperation?," Ecological Economics, Elsevier, vol. 205(C).
    19. Hartmann, Dominik & Guevara, Miguel R. & Jara-Figueroa, Cristian & Aristarán, Manuel & Hidalgo, César A., 2017. "Linking Economic Complexity, Institutions, and Income Inequality," World Development, Elsevier, vol. 93(C), pages 75-93.
    20. Pinheiro, Cristina, 2025. "Relatedness and economic complexity as tools for industrial policy: Insights and limitations," Structural Change and Economic Dynamics, Elsevier, vol. 72(C), pages 1-10.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcsy00:0000010. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: complexsystem (email available below). General contact details of provider: https://journals.plos.org/complexsystems/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.