Author
Abstract
Diagnosis of thyroid disease is a most important cause in the field of medicinal research and it is a complex onset axiom. Secretion of Thyroid hormone plays a major role in the regulation of metabolism. Hence, it is very significant to predict thyroid disease in the initial stage, which is helpful for preventing more serious health complications due to thyroid cancer. The diagnostic accuracy of machine leaning-based approaches is greater but these techniques require large amounts of data for the diagnosis process. In the conventional approaches, the time needed for the prediction process is also high. Feature engineering is less investigated in conventional models and hence error produced during the prediction process is high. Hence, in this research work, a machine learning-aided thyroid disease prediction technique is designed to provide higher prediction accuracy and reliability. Initially, the thyroid data is gathered from the standard benchmark resources. Next, the data transformation process is carried out to make the data usable for analysis and visualization. After, the features are extracted using Principal Component Analysis (PCA), “One-Dimensional Convolutional Neural Network Model (1DCNN). Moreover, the statistical features are also extracted for getting more relevant information from the data. The three sets of features such as PCA-based, 1DCNN-based and statistical are concatenated and fed to the “optimal weighted feature selection” process, where the optimal features and weights are tuned by an Improved Archimedes Optimization Algorithm (IAOA). Next, the selected optimally fused features are given to the Ensemble Learning (EL) for predicting the thyroid diseases, where the EL with be suggested by incorporating stacking classifier, XGboost, and Multivariate regression classifier. Ensembling of three different classifiers provides higher thyroid disease prediction accuracy and it makes the decision about normal and abnormal classes. Here, the same IAOA is used for optimizing the parameters of every classifier. The investigational outcomes demonstrate that the proposed ensemble classifier provides higher performance than others. Experimental results prove that the thyroid prediction accuracy of the developed EL approach is 96.30%, precision is 99.67% and F1-score is 97.93%, which is more extensive than the state-of-the-art approaches.
Suggested Citation
K. Hema Priya & K. Valarmathi, 2025.
"Big data-driven optimal weighted fused features-based ensemble learning classifier for thyroid prediction with heuristic algorithm,"
Journal of Combinatorial Optimization, Springer, vol. 49(4), pages 1-44, May.
Handle:
RePEc:spr:jcomop:v:49:y:2025:i:4:d:10.1007_s10878-025-01304-4
DOI: 10.1007/s10878-025-01304-4
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:jcomop:v:49:y:2025:i:4:d:10.1007_s10878-025-01304-4. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.