IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0319346.html
   My bibliography  Save this article

Centrality nearest-neighbor projected-distance regression (C-NPDR) feature selection for correlation-based predictors with application to resting-state fMRI study of major depressive disorder

Author

Listed:
  • Elizabeth Kresock
  • Bryan Dawkins
  • Henry Luttbeg
  • Yijie (Jamie) Li
  • Rayus Kuplicki
  • B A McKinney

Abstract

Background: Nearest-neighbor projected-distance regression (NPDR) is a metric-based machine learning feature selection algorithm that uses distances between samples and projected differences between variables to identify variables or features that may interact to affect the prediction of complex outcomes. Typical tabular bioinformatics data consist of separate variables of interest, such as genes or proteins. In contrast, resting-state functional MRI (rs-fMRI) data are composed of time-series for brain regions of interest (ROIs) for each subject, and these within-brain time-series are typically transformed into correlations between pairs of ROIs. These pairs of variables of interest can then be used as inputs for feature selection or other machine learning methods. Straightforward feature selection would return the most significant pairs of ROIs; however, it would also be beneficial to know the importance of individual ROIs. Results: We extend NPDR to compute the importance of individual ROIs from correlation-based features. We introduce correlation-difference and centrality-based versions of NPDR. Centrality-based NPDR can be coupled with any centrality method and can be coupled with importance scores other than NPDR, such as random forest importance scores. We develop a new simulation method using random network theory to generate artificial correlation data predictors with variations in correlations that affect class prediction. Conclusions: We compared feature selection methods based on detection of functional simulated ROIs, and we applied the new centrality NPDR approach to a resting-state fMRI study of major depressive disorder (MDD) participants and healthy controls. We determined that the areas of the brain that have the strongest network effect on MDD include the middle temporal gyrus, the inferior temporal gyrus, and the dorsal entorhinal cortex. The resulting feature selection and simulation approaches can be applied to other domains that use correlation-based features.

Suggested Citation

  • Elizabeth Kresock & Bryan Dawkins & Henry Luttbeg & Yijie (Jamie) Li & Rayus Kuplicki & B A McKinney, 2025. "Centrality nearest-neighbor projected-distance regression (C-NPDR) feature selection for correlation-based predictors with application to resting-state fMRI study of major depressive disorder," PLOS ONE, Public Library of Science, vol. 20(3), pages 1-13, March.
  • Handle: RePEc:plo:pone00:0319346
    DOI: 10.1371/journal.pone.0319346
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0319346
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0319346&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0319346?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0319346. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.