Scaling in Deep and Shallow Learning Architectures

My bibliography Save this article

Scaling in Deep and Shallow Learning Architectures

Author

Listed:

Koresh, Ella
Halevi, Tal
Meir, Yuval
Dilmoney, Dolev
Dror, Tamar
Gross, Ronit
Tevet, Ofek
Hodassman, Shiri
Kanter, Ido

Registered:

Abstract

The realization of classification tasks using deep learning is a primary goal of artificial intelligence; however, its possible universal behavior remains unexplored. Herein, we demonstrate a scaling behavior for the test error, ϵ, as a function of the number of classified labels, K. For trained utmost deep architectures on CIFAR-100 ϵ(K)∝Kρ with ρ∼1, and in case of reduced deep architectures, ρ continuously decreases until a crossover to ϵ(K)∝log(K) is observed for shallow architectures. A similar crossover is observed for shallow architectures, where the number of filters in the convolutional layers is proportionally increased. This unified the scaling behavior of deep and shallow architectures, which yields a reduced latency method. The dependence of Δϵ/ΔK on the trained architecture is expected to be crucial in learning scenarios involving dynamic number of labels.

Suggested Citation

Koresh, Ella & Halevi, Tal & Meir, Yuval & Dilmoney, Dolev & Dror, Tamar & Gross, Ronit & Tevet, Ofek & Hodassman, Shiri & Kanter, Ido, 2024. "Scaling in Deep and Shallow Learning Architectures," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 646(C).

Handle: RePEc:eee:phsmap:v:646:y:2024:i:c:s0378437124004187
DOI: 10.1016/j.physa.2024.129909

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Blank, Aharon & Solomon, Sorin, 2000. "Power laws in cities population, financial markets and internet sites (scaling in systems with a variable number of components)," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 287(1), pages 279-288.
Tevet, Ofek & Gross, Ronit D. & Hodassman, Shiri & Rogachevsky, Tal & Tzach, Yarden & Meir, Yuval & Kanter, Ido, 2024. "Efficient shallow learning mechanism as an alternative to deep learning," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 635(C).
Levy, Moshe & Solomon, Sorin, 1997. "New evidence for the power-law distribution of wealth," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 242(1), pages 90-94.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Meir, Yuval & Tevet, Ofek & Tzach, Yarden & Hodassman, Shiri & Kanter, Ido, 2024. "Role of delay in brain dynamics," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 654(C).
Gross, Ronit & Koresh, Ella & Halevi, Tal & Hodassman, Shiri & Meir, Yuval & Tzach, Yarden & Kanter, Ido, 2025. "Multilabel classification outperforms detection-based technique," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 658(C).

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Solomon, Sorin & Richmond, Peter, 2001. "Power laws of wealth, market order volumes and market returns," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 299(1), pages 188-197.
- Sorin Solomon & Peter Richmond, 2001. "Power Laws of Wealth, Market Order Volumes and Market Returns," Papers cond-mat/0102423, arXiv.org, revised Apr 2001.
Jan Schulz & Mishael Milaković, 2023. "How Wealthy are the Rich?," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 69(1), pages 100-123, March.
- Schulz, Jan & Milaković, Mishael, 2020. "How wealthy are the rich?," BERG Working Paper Series 166, Bamberg University, Bamberg Economic Research Group.
Segarra, Agustí & Teruel, Mercedes, 2012. "An appraisal of firm size distribution: Does sample size matter?," Journal of Economic Behavior & Organization, Elsevier, vol. 82(1), pages 314-328.
Kwame Boamah‐Addo & Tomasz J. Kozubowski & Anna K. Panorska, 2023. "A discrete truncated Zipf distribution," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 77(2), pages 156-187, May.
E. Samanidou & E. Zschischang & D. Stauffer & T. Lux, 2001. "Microscopic Models of Financial Markets," Papers cond-mat/0110354, arXiv.org.
- Samanidou, Egle & Zschischang, Elmar & Stauffer, Dietrich & Lux, Thomas, 2006. "Microscopic models of financial markets," Economics Working Papers 2006-15, Christian-Albrechts-University of Kiel, Department of Economics.
Rama Cont & Jean-Philippe Bouchaud, 1997. "Herd behavior and aggregate fluctuations in financial markets," Science & Finance (CFM) working paper archive 500028, Science & Finance, Capital Fund Management.
Marco Raberto & Silvano Cincotti & Sergio Focardi & Michele Marchesi, 2003. "Traders' Long-Run Wealth in an Artificial Financial Market," Computational Economics, Springer;Society for Computational Economics, vol. 22(2), pages 255-272, October.
- Marco Raberto & Silvano Cincott & Sergio M. Focardi & Michele Marchesi, 2002. "Traders’ long-run wealth in an artificial financial market," Computing in Economics and Finance 2002 301, Society for Computational Economics.
Zhou, Bin & Yan, Xiao-Yong & Xu, Xiao-Ke & Xu, Xiao-Ting & Wang, Nianxin, 2018. "Evolutionary of online social networks driven by pareto wealth distribution and bidirectional preferential attachment," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 507(C), pages 427-434.
Wu, Yahao & Wang, Xiao-Tian & Wu, Min, 2009. "Fractional-moment CAPM with loss aversion," Chaos, Solitons & Fractals, Elsevier, vol. 42(3), pages 1406-1414.
Castaldi, Carolina & Milakovic, Mishael, 2007. "Turnover activity in wealth portfolios," Journal of Economic Behavior & Organization, Elsevier, vol. 63(3), pages 537-552, July.
- Mishael Milakovic & Carolina Castaldi, 2004. "Turnover Activity in Wealth Portfolios," Computing in Economics and Finance 2004 120, Society for Computational Economics.
Bucsa, G. & Jovanovic, F. & Schinckus, C., 2011. "A unified model for price return distributions used in econophysics," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 390(20), pages 3435-3443.
Cornelia Metzig & Mirta Gordon, 2012. "Heterogeneous Enterprises in a Macroeconomic Agent-Based Model," Papers 1211.5575, arXiv.org.
Andrea Bonaccorsi & Maurizio Martinelli & Cristina Rossi & Irma Serrecchia, 2002. "Measuring and modelling Internet diffusion using second level domains: the case of Italy," LEM Papers Series 2002/17, Laboratory of Economics and Management (LEM), Sant'Anna School of Advanced Studies, Pisa, Italy.
Philip Vermeulen, 2018. "How Fat is the Top Tail of the Wealth Distribution?," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 64(2), pages 357-387, June.
- Vermeulen, Philip, 2014. "How fat is the top tail of the wealth distribution?," Working Paper Series 1692, European Central Bank.
Malevergne, Y. & Saichev, A. & Sornette, D., 2013. "Zipf's law and maximum sustainable growth," Journal of Economic Dynamics and Control, Elsevier, vol. 37(6), pages 1195-1212.
- Y. Malevergne & A. Saichev & D. Sornette, 2010. "Zipf's law and maximum sustainable growth," Papers 1012.0199, arXiv.org.
- Yannick Malevergne & Alex Saichev & Didier Sornette, 2013. "Zipf's law and maximum sustainable growth," Post-Print hal-02313060, HAL.
Becker, Bo & Cronqvist, Henrik & Fahlenbrach, Rüdiger, 2011. "Estimating the Effects of Large Shareholders Using a Geographic Instrument," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 46(4), pages 907-942, August.
- Becker, Bo & Cronqvist, Henrik & Fahlenbrach, Rudiger, 2008. "Estimating the Effects of Large Shareholders Using a Geographic Instrument," Working Paper Series 2008-9, Ohio State University, Charles A. Dice Center for Research in Financial Economics.
- Bo Becker & Henrik Cronqvist & Rüdiger Fahlenbrach, 2011. "Estimating the Effects of Large Shareholders Using a Geographic Instrument," NBER Working Papers 17393, National Bureau of Economic Research, Inc.
- Bo Becker & Henrik Cronqvist & Rüdiger Fahlenbrach, 2009. "Estimating the Effects of Large Shareholders Using a Geographic Instrument," Harvard Business School Working Papers 10-028, Harvard Business School, revised Feb 2010.
- Becker, Bo & Cronqvist, Henrik & Fahlenbrach, Rüdiger, 2008. "Estimating the Effects of Large Shareholders Using a Geographic Instrument," SIFR Research Report Series 64, Institute for Financial Research.
Ren, F. & Zhang, Y.C., 2008. "Trading model with pair pattern strategies," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 387(22), pages 5523-5534.
Misha Perepelitsa, 2018. "A model of adaptive, market behavior generating positive returns, volatility and system risk," Papers 1809.09601, arXiv.org.
Meir, Yuval & Tevet, Ofek & Tzach, Yarden & Hodassman, Shiri & Kanter, Ido, 2024. "Role of delay in brain dynamics," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 654(C).
Arun Advani & George Bangham & Jack Leslie, 2021. "The UK's wealth distribution and characteristics of high‐wealth households," Fiscal Studies, John Wiley & Sons, vol. 42(3-4), pages 397-430, September.
- Advani, Arun & Bangham, George & Leslie, Jack, 2021. "The UK's wealth distribution and characteristics of high-wealth households," CAGE Online Working Paper Series 576, Competitive Advantage in the Global Economy (CAGE).
- Advani, Arun & Bangham, George & Leslie, Jack, 2021. "The UK’s wealth distribution and characteristics of high-wealth households," The Warwick Economics Research Paper Series (TWERPS) 1367, University of Warwick, Department of Economics.
- Advani, Arun & Bangham, George & Leslie, Jack, 2021. "The UK's wealth distribution and characteristics of high-wealth households," LSE Research Online Documents on Economics 112698, London School of Economics and Political Science, LSE Library.

More about this item

Keywords

; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:646:y:2024:i:c:s0378437124004187. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Scaling in Deep and Shallow Learning Architectures

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data