IDEAS home Printed from https://ideas.repec.org/a/gam/jstats/v8y2025i3p53-d1689871.html
   My bibliography  Save this article

Distance-Based Relevance Function for Imbalanced Regression

Author

Listed:
  • Daniel Daeyoung In

    (Department of Statistics and Data Science, Yonsei University, Seoul 03722, Republic of Korea)

  • Hyunjoong Kim

    (Department of Statistics and Data Science, Yonsei University, Seoul 03722, Republic of Korea)

Abstract

Imbalanced regression poses a significant challenge in real-world prediction tasks, where rare target values are prone to overfitting during model training. To address this, prior research has employed relevance functions to quantify the rarity of target instances. However, existing functions often struggle to capture the rarity across diverse target distributions. In this study, we introduce a novel Distance-based Relevance Function (DRF) that quantifies the rarity based on the distance between target values, enabling a more accurate and distribution-agnostic assessment of rare data. This general approach allows imbalanced regression techniques to be effectively applied to a broader range of distributions, including bimodal cases. We evaluate the proposed DRF using Mean Squared Error (MSE), relevance-weighted Mean Absolute Error ( MAE ϕ ), and Symmetric Mean Absolute Percentage Error (SMAPE). Empirical studies on synthetic datasets and 18 real-world datasets demonstrate that DRF tends to improve the performance across various machine learning models, including support vector regression, neural networks, XGBoost, and random forests. These findings suggest that DRF offers a promising direction for rare target detection and broadens the applicability of imbalanced regression methods.

Suggested Citation

  • Daniel Daeyoung In & Hyunjoong Kim, 2025. "Distance-Based Relevance Function for Imbalanced Regression," Stats, MDPI, vol. 8(3), pages 1-14, June.
  • Handle: RePEc:gam:jstats:v:8:y:2025:i:3:p:53-:d:1689871
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2571-905X/8/3/53/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2571-905X/8/3/53/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jstats:v:8:y:2025:i:3:p:53-:d:1689871. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.