IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v646y2024ics0378437124004187.html
   My bibliography  Save this article

Scaling in Deep and Shallow Learning Architectures

Author

Listed:
  • Koresh, Ella
  • Halevi, Tal
  • Meir, Yuval
  • Dilmoney, Dolev
  • Dror, Tamar
  • Gross, Ronit
  • Tevet, Ofek
  • Hodassman, Shiri
  • Kanter, Ido

Abstract

The realization of classification tasks using deep learning is a primary goal of artificial intelligence; however, its possible universal behavior remains unexplored. Herein, we demonstrate a scaling behavior for the test error, ϵ, as a function of the number of classified labels, K. For trained utmost deep architectures on CIFAR-100 ϵ(K)∝Kρ with ρ∼1, and in case of reduced deep architectures, ρ continuously decreases until a crossover to ϵ(K)∝log(K) is observed for shallow architectures. A similar crossover is observed for shallow architectures, where the number of filters in the convolutional layers is proportionally increased. This unified the scaling behavior of deep and shallow architectures, which yields a reduced latency method. The dependence of Δϵ/ΔK on the trained architecture is expected to be crucial in learning scenarios involving dynamic number of labels.

Suggested Citation

  • Koresh, Ella & Halevi, Tal & Meir, Yuval & Dilmoney, Dolev & Dror, Tamar & Gross, Ronit & Tevet, Ofek & Hodassman, Shiri & Kanter, Ido, 2024. "Scaling in Deep and Shallow Learning Architectures," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 646(C).
  • Handle: RePEc:eee:phsmap:v:646:y:2024:i:c:s0378437124004187
    DOI: 10.1016/j.physa.2024.129909
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0378437124004187
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2024.129909?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Blank, Aharon & Solomon, Sorin, 2000. "Power laws in cities population, financial markets and internet sites (scaling in systems with a variable number of components)," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 287(1), pages 279-288.
    2. Tevet, Ofek & Gross, Ronit D. & Hodassman, Shiri & Rogachevsky, Tal & Tzach, Yarden & Meir, Yuval & Kanter, Ido, 2024. "Efficient shallow learning mechanism as an alternative to deep learning," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 635(C).
    3. Levy, Moshe & Solomon, Sorin, 1997. "New evidence for the power-law distribution of wealth," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 242(1), pages 90-94.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Meir, Yuval & Tevet, Ofek & Tzach, Yarden & Hodassman, Shiri & Kanter, Ido, 2024. "Role of delay in brain dynamics," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 654(C).
    2. Gross, Ronit & Koresh, Ella & Halevi, Tal & Hodassman, Shiri & Meir, Yuval & Tzach, Yarden & Kanter, Ido, 2025. "Multilabel classification outperforms detection-based technique," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 658(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Solomon, Sorin & Richmond, Peter, 2001. "Power laws of wealth, market order volumes and market returns," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 299(1), pages 188-197.
    2. Jan Schulz & Mishael Milaković, 2023. "How Wealthy are the Rich?," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 69(1), pages 100-123, March.
    3. Segarra, Agustí & Teruel, Mercedes, 2012. "An appraisal of firm size distribution: Does sample size matter?," Journal of Economic Behavior & Organization, Elsevier, vol. 82(1), pages 314-328.
    4. Kwame Boamah‐Addo & Tomasz J. Kozubowski & Anna K. Panorska, 2023. "A discrete truncated Zipf distribution," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 77(2), pages 156-187, May.
    5. Ricardo Lopez-Ruiz & Elyas Shivanian & Jose-Luis Lopez, 2013. "Random Market Models with an H-Theorem," Papers 1307.2169, arXiv.org, revised Jul 2014.
    6. E. Samanidou & E. Zschischang & D. Stauffer & T. Lux, 2001. "Microscopic Models of Financial Markets," Papers cond-mat/0110354, arXiv.org.
    7. Esteban Rossi-Hansberg & Mark L. J. Wright, 2007. "Urban Structure and Growth," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 74(2), pages 597-624.
    8. Wang, Cheng-Jun & Wu, Lingfei, 2016. "The scaling of attention networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 448(C), pages 196-204.
    9. Rama Cont & Jean-Philippe Bouchaud, 1997. "Herd behavior and aggregate fluctuations in financial markets," Science & Finance (CFM) working paper archive 500028, Science & Finance, Capital Fund Management.
    10. Wang, Yuanjun & You, Shibing, 2016. "An alternative method for modeling the size distribution of top wealth," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 457(C), pages 443-453.
    11. de Wit, Gerrit, 2005. "Firm size distributions: An overview of steady-state distributions resulting from firm dynamics models," International Journal of Industrial Organization, Elsevier, vol. 23(5-6), pages 423-450, June.
    12. Marco Raberto & Silvano Cincotti & Sergio Focardi & Michele Marchesi, 2003. "Traders' Long-Run Wealth in an Artificial Financial Market," Computational Economics, Springer;Society for Computational Economics, vol. 22(2), pages 255-272, October.
    13. Zhou, Bin & Yan, Xiao-Yong & Xu, Xiao-Ke & Xu, Xiao-Ting & Wang, Nianxin, 2018. "Evolutionary of online social networks driven by pareto wealth distribution and bidirectional preferential attachment," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 507(C), pages 427-434.
    14. Wright, Ian, 2009. "Implicit Microfoundations for Macroeconomics," Economics - The Open-Access, Open-Assessment E-Journal (2007-2020), Kiel Institute for the World Economy (IfW Kiel), vol. 3, pages 1-27.
    15. Tomson Ogwang, 2011. "Power laws in top wealth distributions: evidence from Canada," Empirical Economics, Springer, vol. 41(2), pages 473-486, October.
    16. Amaral, L.A.N. & Gopikrishnan, P. & Plerou, V. & Stanley, H.E., 2001. "A model for the growth dynamics of economic organizations," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 299(1), pages 127-136.
    17. Ogwang, Tomson, 2013. "Is the wealth of the world’s billionaires Paretian?," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 392(4), pages 757-762.
    18. Sorin Solomon & Moshe Levy, 2000. "Market Ecology, Pareto Wealth Distribution and Leptokurtic Returns in Microscopic Simulation of the LLS Stock Market Model," Papers cond-mat/0005416, arXiv.org.
    19. Goykhman, Mikhail, 2017. "Wealth dynamics in a sentiment-driven market," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 488(C), pages 132-148.
    20. Johann Lussange & Ivan Lazarevich & Sacha Bourgeois-Gironde & Stefano Palminteri & Boris Gutkin, 2021. "Modelling Stock Markets by Multi-agent Reinforcement Learning," Computational Economics, Springer;Society for Computational Economics, vol. 57(1), pages 113-147, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:646:y:2024:i:c:s0378437124004187. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.