IDEAS home Printed from https://ideas.repec.org/a/hin/jijmms/6493399.html
   My bibliography  Save this article

Adaptive Multiclassification With Lung Cancer Types Using High-Dimensional Discriminant Analysis and Machine Learning Methods

Author

Listed:
  • Autcha Araveeporn

Abstract

This research investigates adaptive multiclassification methods for classifying lung cancer types using both high-dimensional discriminant analysis (HDDA) and widely used machine learning (ML) approaches under challenging data conditions, including high dimensionality, multicollinearity, outliers, and imbalanced classes. The dataset consists of 1000 gene expressions as explanatory variables and four lung cancer types as response variables, categorizing the problem as high-dimensional and imbalanced. HDDA introduces a statistically principled parametrization of the covariance matrix tailored for high-dimensional data. At the same time, ML methods such as Naïve Bayes, K-Nearest Neighbors, Support Vector Machine, Artificial Neural Network, and Random Forest offer flexible, data-driven alternatives. While previous studies have separately investigated either discriminant analysis or ML, there is a lack of comparative studies that evaluate their performance simultaneously under such complex conditions. This study addresses this gap by systematically analyzing both approaches with balanced and imbalanced gene expression data. The central hypothesis of this study is that HDDA, despite being a classical statistical technique, can achieve performance comparable to or complementary with ML methods when applied to gene expression data. Experiments on both original and balanced datasets, across varying subsets of explanatory variables, show that data balancing consistently improves accuracy, precision, and recall. Among ML methods, Random Forest achieves the highest predictive performance on balanced data, while HDDA provides competitive and interpretable results across scenarios.

Suggested Citation

  • Autcha Araveeporn, 2025. "Adaptive Multiclassification With Lung Cancer Types Using High-Dimensional Discriminant Analysis and Machine Learning Methods," International Journal of Mathematics and Mathematical Sciences, Hindawi, vol. 2025, pages 1-21, October.
  • Handle: RePEc:hin:jijmms:6493399
    DOI: 10.1155/ijmm/6493399
    as

    Download full text from publisher

    File URL: http://downloads.hindawi.com/journals/ijmms/2025/6493399.pdf
    Download Restriction: no

    File URL: http://downloads.hindawi.com/journals/ijmms/2025/6493399.xml
    Download Restriction: no

    File URL: https://libkey.io/10.1155/ijmm/6493399?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hin:jijmms:6493399. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Mohamed Abdelhakeem (email available below). General contact details of provider: https://www.hindawi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.