IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0342408.html

Random subspace-based ensemble classifier for high-dimensional data Using SPARK

Author

Listed:
  • Venkaiah Chowdary Bhimineni
  • Rajiv Senapati

Abstract

High-dimensional data classification remains challenging for machine learning models due to sparsity and overfitting caused by the ‘curse of dimensionality‘. As the number of features increases, data points become sparse, hindering generalization in classification and leading to higher computational costs and reduced accuracy. To address these issues, we propose an ensemble classifier based on random subspaces implemented in the Spark framework. The proposed framework comprises three key stages. First, the high-dimensional data is normalised through min-max normalisation. Second, the master node partitions the data by using improved deep fuzzy clustering (IDFC). In contrast, the slave node applies support vector machine-modified recursive feature elimination (SVM-MRFE) for efficient feature selection, followed by feature fusion. Finally, we introduced an improved subspace-based ensemble classifier (ISSBEC) that comprises a feature-fusion-based random subspace (FF-RSS), mixed-space enhancement (MSE), and multiple base classifiers. The efficacy of the ISSBEC classifier was evaluated using a set of performance metrics and compared with state-of-the-art methods. Experimental results demonstrate that the proposed approach improves both accuracy and robustness, offering a scalable solution to the limitations of high-dimensional datasets.

Suggested Citation

  • Venkaiah Chowdary Bhimineni & Rajiv Senapati, 2026. "Random subspace-based ensemble classifier for high-dimensional data Using SPARK," PLOS ONE, Public Library of Science, vol. 21(3), pages 1-26, March.
  • Handle: RePEc:plo:pone00:0342408
    DOI: 10.1371/journal.pone.0342408
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0342408
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0342408&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0342408?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0342408. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.