IDEAS home Printed from https://ideas.repec.org/a/plo/pcsy00/0000014.html
   My bibliography  Save this article

Assembly Theory is an approximation to algorithmic complexity based on LZ compression that does not explain selection or evolution

Author

Listed:
  • Felipe S Abrahão
  • Santiago Hernández-Orozco
  • Narsis A Kiani
  • Jesper Tegnér
  • Hector Zenil

Abstract

We formally prove the equivalence between Assembly Theory (AT) and Shannon Entropy via a method based upon the principles of statistical compression that belongs to the LZ family of popular compression algorithms. Such popular lossless compression algorithms behind file formats such as ZIP and PNG have been shown to empirically reproduce the results that AT considers its cornerstone. The same results have also been reported before AT in successful application of other complexity measures in the areas covered by AT such as separating organic from non-organic molecules and in the context of the study of selection and evolution. We demonstrate that the assembly index is equivalent to the size of a minimal context-free grammar. The statistical compressibility of such a method is bounded by Shannon Entropy and other equivalent traditional LZ compression schemes, such as LZ77 and LZW. We also demonstrate that AT, and the algorithms supporting its pathway complexity, assembly index, and assembly number, define compression schemes and methods that are subsumed into algorithmic information theory. We conclude that the assembly index and the assembly number do not lead to an explanation or quantification of biases in generative (physical or biological) processes, including those brought about by (abiotic or biotic) selection and evolution, that could not have been arrived at using Shannon Entropy, or that have not been already reported before using classical information theory or algorithmic complexity.Author summary: Assembly Theory (AT) has recently been proposed in order to investigate the distinction between abiotic from biotic matter, while explaining and quantifying the presence of biosignatures, selection, and evolution. We previously have shown that AT cannot rule out false positives and that it has equal or worse performance in comparison to popular compression algorithms at counting exact copies in data without evidence that their compression mechanics are favoured over others. This article investigates these limitations and the many challenges of the theoretical foundations of AT. We demonstrate that AT’s complexity measures (both for individual assembled objects and ensembles of objects) are subsumed into algorithmic information theory. The calculated assembly index for an object in AT is equivalent to the size of a compressing context-free grammar, and its calculation method is an LZ compression scheme that cannot perform better than Shannon Entropy in stochastic scenarios and cannot deal with non-stochastic (generative/causal) ones. Although AT may contribute with a graph-like pedagogical approach to LZ compression in application to molecular complexity, this article disproves hyperbolic claims raised by the authors of AT that introduce AT as a novel method, fundamentally different from other complexity indexes, or as a breakthrough. Instead, the principles behind AT are known elementary principles of complexity rehashed but introduced high logical inconsistency. AT lacks empirical evidence that it is different from or outperforms other complexity indexes in connection to selection, evolution or any of the applications in which the authors of AT have promoted it as capable of explaining physical and biological phenomena.

Suggested Citation

  • Felipe S Abrahão & Santiago Hernández-Orozco & Narsis A Kiani & Jesper Tegnér & Hector Zenil, 2024. "Assembly Theory is an approximation to algorithmic complexity based on LZ compression that does not explain selection or evolution," PLOS Complex Systems, Public Library of Science, vol. 1(1), pages 1-20, September.
  • Handle: RePEc:plo:pcsy00:0000014
    DOI: 10.1371/journal.pcsy.0000014
    as

    Download full text from publisher

    File URL: https://journals.plos.org/complexsystems/article?id=10.1371/journal.pcsy.0000014
    Download Restriction: no

    File URL: https://journals.plos.org/complexsystems/article/file?id=10.1371/journal.pcsy.0000014&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcsy.0000014?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Stuart M. Marshall & Cole Mathis & Emma Carrick & Graham Keenan & Geoffrey J. T. Cooper & Heather Graham & Matthew Craven & Piotr S. Gromski & Douglas G. Moore & Sara. I. Walker & Leroy Cronin, 2021. "Identifying molecules as biosignatures with assembly theory and mass spectrometry," Nature Communications, Nature, vol. 12(1), pages 1-9, December.
    2. Jean-Paul Delahaye & Hector Zenil, 2012. "Numerical Evaluation of Algorithmic Complexity for Short Strings: A Glance into the Innermost Structure of Randomness," Post-Print hal-00825530, HAL.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dingle, Kamaludin & Kamal, Rafiq & Hamzi, Boumediene, 2023. "A note on a priori forecasting and simplicity bias in time series," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 609(C).
    2. Brandouy, Olivier & Delahaye, Jean-Paul & Ma, Lin & Zenil, Hector, 2014. "Algorithmic complexity of financial motions," Research in International Business and Finance, Elsevier, vol. 30(C), pages 336-347.
    3. Zenil, Hector & Soler-Toscano, Fernando & Dingle, Kamaludin & Louis, Ard A., 2014. "Correlation of automorphism group size and topological properties with program-size complexity evaluations of graphs and complex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 404(C), pages 341-358.
    4. Daniel Wilson-Nunn & Hector Zenil, 2014. "On the Complexity and Behaviour of Cryptocurrencies Compared to Other Markets," Papers 1411.1924, arXiv.org.
    5. repec:plo:pcbi00:1005408 is not listed on IDEAS
    6. Fernando Soler-Toscano & Hector Zenil & Jean-Paul Delahaye & Nicolas Gauvrit, 2014. "Calculating Kolmogorov Complexity from the Output Frequency Distributions of Small Turing Machines," PLOS ONE, Public Library of Science, vol. 9(5), pages 1-18, May.
    7. Ekin Ilseven & Phanish Puranam, 2025. "(Not) by chance? An application of Assembly Theory to infer non-randomness in organizational design," Journal of Organization Design, Springer;Organizational Design Community, vol. 14(2), pages 155-166, June.
    8. Koudai Taguchi & Alexis Gilbert & Barbara Sherwood Lollar & Thomas Giunta & Christopher J. Boreham & Qi Liu & Juske Horita & Yuichiro Ueno, 2022. "Low 13C-13C abundances in abiotic ethane," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    9. Fernando Soler-Toscano & Hector Zenil, 2017. "A Computable Measure of Algorithmic Probability by Finite Approximations with an Application to Integer Sequences," Complexity, Hindawi, vol. 2017, pages 1-10, December.
    10. Srinivasan, Aditya & Srinivasan, Arvind & Goodman, Michael R. & Riceberg, Justin S. & Guise, Kevin G. & Shapiro, Matthew L., 2023. "Hippocampal and Medial Prefrontal Cortex Fractal Spiking Patterns Encode Episodes and Rules," Chaos, Solitons & Fractals, Elsevier, vol. 171(C).
    11. Maxwell Murialdo & Arturo Cifuentes, 2022. "Quantifying Value with Effective Complexity," Journal of Interdisciplinary Economics, , vol. 34(1), pages 69-85, January.
    12. Mikołaj Morzy & Tomasz Kajdanowicz & Przemysław Kazienko, 2017. "On Measuring the Complexity of Networks: Kolmogorov Complexity versus Entropy," Complexity, Hindawi, vol. 2017, pages 1-12, November.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcsy00:0000014. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: complexsystem (email available below). General contact details of provider: https://journals.plos.org/complexsystems/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.