Different scaling of linear models and deep learning in UKBiobank brain images versus machine-learning datasets

My bibliography Save this article

Different scaling of linear models and deep learning in UKBiobank brain images versus machine-learning datasets

Author

Listed:

Marc-Andre Schulz
(Aachen University)
B. T. Thomas Yeo
(National University of Singapore
National University of Singapore
National University of Singapore)
Joshua T. Vogelstein
(Johns Hopkins University
Johns Hopkins University)
Janaina Mourao-Miranada
(University College London
University College London)
Jakob N. Kather
(University Hospital RWTH Aachen
German Cancer Research Center (DKFZ)
German Cancer Research Center (DKFZ))
Konrad Kording
(University of Pennsylvania)
Blake Richards
(McGill University
McGill University
Canadian Institute for Advanced Research
Mila - Quebec Artificial Intelligence Institute)
Danilo Bzdok
(Mila - Quebec Artificial Intelligence Institute
Commissariat à l’Energie Atomique (CEA) Saclay
Institut National de Recherche en Informatique et en Automatique (INRIA)
McGill University)

Registered:

Abstract

Recently, deep learning has unlocked unprecedented success in various domains, especially using images, text, and speech. However, deep learning is only beneficial if the data have nonlinear relationships and if they are exploitable at available sample sizes. We systematically profiled the performance of deep, kernel, and linear models as a function of sample size on UKBiobank brain images against established machine learning references. On MNIST and Zalando Fashion, prediction accuracy consistently improves when escalating from linear models to shallow-nonlinear models, and further improves with deep-nonlinear models. In contrast, using structural or functional brain scans, simple linear models perform on par with more complex, highly parameterized models in age/sex prediction across increasing sample sizes. In sum, linear models keep improving as the sample size approaches ~10,000 subjects. Yet, nonlinearities for predicting common phenotypes from typical brain scans remain largely inaccessible to the examined kernel and deep learning methods.

Suggested Citation

Marc-Andre Schulz & B. T. Thomas Yeo & Joshua T. Vogelstein & Janaina Mourao-Miranada & Jakob N. Kather & Konrad Kording & Blake Richards & Danilo Bzdok, 2020. "Different scaling of linear models and deep learning in UKBiobank brain images versus machine-learning datasets," Nature Communications, Nature, vol. 11(1), pages 1-15, December.

Handle: RePEc:nat:natcom:v:11:y:2020:i:1:d:10.1038_s41467-020-18037-z
DOI: 10.1038/s41467-020-18037-z

Download full text from publisher

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Fergus Imrie & Bogdan Cebere & Eoin F McKinney & Mihaela van der Schaar, 2023. "AutoPrognosis 2.0: Democratizing diagnostic and prognostic modeling in healthcare with automated machine learning," PLOS Digital Health, Public Library of Science, vol. 2(6), pages 1-21, June.
Andrea I. Luppi & Helena M. Gellersen & Zhen-Qi Liu & Alexander R. D. Peattie & Anne E. Manktelow & Ram Adapa & Adrian M. Owen & Lorina Naci & David K. Menon & Stavros I. Dimitriadis & Emmanuel A. Sta, 2024. "Systematic evaluation of fMRI data-processing pipelines for consistent functional connectomics," Nature Communications, Nature, vol. 15(1), pages 1-24, December.
Winn-Nuñez, Emily T. & Griffin, Maryclare & Crawford, Lorin, 2024. "A simple approach for local and global variable importance in nonlinear regression models," Computational Statistics & Data Analysis, Elsevier, vol. 194(C).
Jessica Dafflon & Pedro F. Da Costa & František Váša & Ricardo Pio Monti & Danilo Bzdok & Peter J. Hellyer & Federico Turkheimer & Jonathan Smallwood & Emily Jones & Robert Leech, 2022. "A guided multiverse study of neuroimaging analyses," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
Vieira, Bruno Hebling & Pamplona, Gustavo Santo Pedro & Fachinello, Karim & Silva, Alice Kamensek & Foss, Maria Paula & Salmon, Carlos Ernesto Garrido, 2022. "On the prediction of human intelligence from neuroimaging: A systematic review of methods and reporting," Intelligence, Elsevier, vol. 93(C).
Jianzhong Chen & Angela Tam & Valeria Kebets & Csaba Orban & Leon Qi Rong Ooi & Christopher L. Asplund & Scott Marek & Nico U. F. Dosenbach & Simon B. Eickhoff & Danilo Bzdok & Avram J. Holmes & B. T., 2022. "Shared and unique brain network features predict cognitive, personality, and mental health scores in the ABCD study," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
S. Parker Singleton & Andrea I. Luppi & Robin L. Carhart-Harris & Josephine Cruzat & Leor Roseman & David J. Nutt & Gustavo Deco & Morten L. Kringelbach & Emmanuel A. Stamatakis & Amy Kuceyeski, 2022. "Receptor-informed network control theory links LSD and psilocybin to a flattening of the brain’s control energy landscape," Nature Communications, Nature, vol. 13(1), pages 1-13, December.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:11:y:2020:i:1:d:10.1038_s41467-020-18037-z. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

We have no bibliographic references for this item. You can help adding them by using this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Different scaling of linear models and deep learning in UKBiobank brain images versus machine-learning datasets

Author

Abstract

Suggested Citation

Download full text from publisher

Citations

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data