IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v18y2021i23p12499-d689548.html
   My bibliography  Save this article

Developing a Stacked Ensemble-Based Classification Scheme to Predict Second Primary Cancers in Head and Neck Cancer Survivors

Author

Listed:
  • Chi-Chang Chang

    (School of Medical Informatics, Chung Shan Medical University & IT Office, Chung Shan Medical University Hospital, Taichung 40201, Taiwan
    Department of Information Management, Ming Chuan University, Taoyuan 33300, Taiwan)

  • Tse-Hung Huang

    (Department of Traditional Chinese Medicine, Chang Gung Memorial Hospital, Keelung 20401, Taiwan
    School of Traditional Chinese Medicine, Chang Gung University, Taoyuan 33300, Taiwan
    School of Nursing, National Taipei University of Nursing and Health Sciences, Taipei 11200, Taiwan
    Graduate Institute of Health Industry Technology, Chang Gung University of Science and Technology, Taoyuan 33300, Taiwan)

  • Pei-Wei Shueng

    (Department of Radiology, Division of Radiation Oncology, Far Eastern Memorial Hospital, New Taipei 22060, Taiwan
    Faculty of Medicine, School of Medicine, National Yang Ming Chiao Tung University, Taipei 22060, Taiwan
    These authors contributed equally to this work.)

  • Ssu-Han Chen

    (Department of Industrial Engineering and Management, Ming Chi University of Technology, New Taipei 24330, Taiwan
    Center for Artificial Intelligence & Data Science, Ming Chi University of Technology, New Taipei 24330, Taiwan)

  • Chun-Chia Chen

    (Institute of Medicine, Chung Shan Medical University, Taichung 40201, Taiwan
    Department of Surgery, Division of Plastic Surgery, Chung Shan Medical University Hospital, Taichung 40201, Taiwan)

  • Chi-Jie Lu

    (Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei 242062, Taiwan
    Artificial Intelligence Development Center, Fu Jen Catholic University, New Taipei 242062, Taiwan
    Department of Information Management, Fu Jen Catholic University, New Taipei 242062, Taiwan)

  • Yi-Ju Tseng

    (Department of Information Management, National Central University, Taoyuan 32031, Taiwan)

Abstract

Despite a considerable expansion in the present therapeutic repertoire for other malignancy managements, mortality from head and neck cancer (HNC) has not significantly improved in recent decades. Moreover, the second primary cancer (SPC) diagnoses increased in patients with HNC, but studies providing evidence to support SPCs prediction in HNC are lacking. Several base classifiers are integrated forming an ensemble meta-classifier using a stacked ensemble method to predict SPCs and find out relevant risk features in patients with HNC. The balanced accuracy and area under the curve (AUC) are over 0.761 and 0.847, with an approximately 2% and 3% increase, respectively, compared to the best individual base classifier. Our study found the top six ensemble risk features, such as body mass index, primary site of HNC, clinical nodal (N) status, primary site surgical margins, sex, and pathologic nodal (N) status. This will help clinicians screen HNC survivors before SPCs occur.

Suggested Citation

  • Chi-Chang Chang & Tse-Hung Huang & Pei-Wei Shueng & Ssu-Han Chen & Chun-Chia Chen & Chi-Jie Lu & Yi-Ju Tseng, 2021. "Developing a Stacked Ensemble-Based Classification Scheme to Predict Second Primary Cancers in Head and Neck Cancer Survivors," IJERPH, MDPI, vol. 18(23), pages 1-10, November.
  • Handle: RePEc:gam:jijerp:v:18:y:2021:i:23:p:12499-:d:689548
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/18/23/12499/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/18/23/12499/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Grubinger, Thomas & Zeileis, Achim & Pfeiffer, Karl-Peter, 2014. "evtree: Evolutionary Learning of Globally Optimal Classification and Regression Trees in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 61(i01).
    2. Kurt Hornik & Christian Buchta & Achim Zeileis, 2009. "Open-source machine learning: R meets Weka," Computational Statistics, Springer, vol. 24(2), pages 225-232, May.
    3. Chi-Chang Chang & Chun-Chia Chen & Chalong Cheewakriangkrai & Ying Chen Chen & Shun-Fa Yang, 2021. "Risk Prediction of Second Primary Endometrial Cancer in Obese Women: A Hospital-Based Cancer Registry Study," IJERPH, MDPI, vol. 18(17), pages 1-9, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hao-Yun Kao & Chi-Chang Chang & Chin-Fang Chang & Ying-Chen Chen & Chalong Cheewakriangkrai & Ya-Ling Tu, 2022. "Associations between Sex and Risk Factors for Predicting Chronic Kidney Disease," IJERPH, MDPI, vol. 19(3), pages 1-11, January.
    2. Souhila Ghanem & Raphaël Couturier & Pablo Gregori, 2021. "An Accurate and Easy to Interpret Binary Classifier Based on Association Rules Using Implication Intensity and Majority Vote," Mathematics, MDPI, vol. 9(12), pages 1-12, June.
    3. Emmanuel Jordy Menvouta & Jolien Ponnet & Robin Van Oirbeek & Tim Verdonck, 2022. "mCube: Multinomial Micro-level reserving Model," Papers 2212.00101, arXiv.org.
    4. Fernandez Martinez, Roberto & Lostado Lorza, Ruben & Santos Delgado, Ana Alexandra & Piedra, Nelson, 2021. "Use of classification trees and rule-based models to optimize the funding assignment to research projects: A case study of UTPL," Journal of Informetrics, Elsevier, vol. 15(1).
    5. Höppner, Sebastiaan & Stripling, Eugen & Baesens, Bart & Broucke, Seppe vanden & Verdonck, Tim, 2020. "Profit driven decision trees for churn prediction," European Journal of Operational Research, Elsevier, vol. 284(3), pages 920-933.
    6. Hajko, Vladimír, 2017. "The failure of Energy-Economy Nexus: A meta-analysis of 104 studies," Energy, Elsevier, vol. 125(C), pages 771-787.
    7. Grubinger, Thomas & Zeileis, Achim & Pfeiffer, Karl-Peter, 2014. "evtree: Evolutionary Learning of Globally Optimal Classification and Regression Trees in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 61(i01).
    8. Susan Athey & Stefan Wager, 2021. "Policy Learning With Observational Data," Econometrica, Econometric Society, vol. 89(1), pages 133-161, January.
    9. Yagli, Gokhan Mert & Yang, Dazhi & Srinivasan, Dipti, 2019. "Automatic hourly solar forecasting using machine learning models," Renewable and Sustainable Energy Reviews, Elsevier, vol. 105(C), pages 487-498.
    10. Davide Natalini & Giangiacomo Bravo & Aled Wynne Jones, 2019. "Global food security and food riots – an agent-based modelling approach," Food Security: The Science, Sociology and Economics of Food Production and Access to Food, Springer;The International Society for Plant Pathology, vol. 11(5), pages 1153-1173, October.
    11. Yves Staudt & Joël Wagner, 2021. "Assessing the Performance of Random Forests for Modeling Claim Severity in Collision Car Insurance," Risks, MDPI, vol. 9(3), pages 1-28, March.
    12. Mariane S. Reis & Maria Isabel S. Escada & Luciano V. Dutra & Sidnei J. S. Sant’Anna & Nathan D. Vogt, 2018. "Towards a Reproducible LULC Hierarchical Class Legend for Use in the Southwest of Pará State, Brazil: A Comparison with Remote Sensing Data-Driven Hierarchies," Land, MDPI, vol. 7(2), pages 1-29, May.
    13. Federico Divina & Aude Gilson & Francisco Goméz-Vela & Miguel García Torres & José F. Torres, 2018. "Stacking Ensemble Learning for Short-Term Electricity Consumption Forecasting," Energies, MDPI, vol. 11(4), pages 1-31, April.
    14. Max Tabord-Meehan, 2018. "Stratification Trees for Adaptive Randomization in Randomized Controlled Trials," Papers 1806.05127, arXiv.org, revised Jul 2022.
    15. Anja Breuer & Yves Staudt, 2022. "Equalization Reserves for Reinsurance and Non-Life Undertakings in Switzerland," Risks, MDPI, vol. 10(3), pages 1-41, March.
    16. Patrick Rehill & Nicholas Biddle, 2022. "Policy learning for many outcomes of interest: Combining optimal policy trees with multi-objective Bayesian optimisation," Papers 2212.06312, arXiv.org, revised Oct 2023.
    17. Rashid Mehmood & Muhammad Riaz & Ronald Does, 2013. "Efficient power computation for r out of m runs rules schemes," Computational Statistics, Springer, vol. 28(2), pages 667-681, April.
    18. Alvarez-Iglesias, Alberto & Hinde, John & Ferguson, John & Newell, John, 2017. "An alternative pruning based approach to unbiased recursive partitioning," Computational Statistics & Data Analysis, Elsevier, vol. 106(C), pages 90-102.
    19. Stefano Castellana & Caterina Fusilli & Gianluigi Mazzoccoli & Tommaso Biagini & Daniele Capocefalo & Massimo Carella & Angelo Luigi Vescovi & Tommaso Mazza, 2017. "High-confidence assessment of functional impact of human mitochondrial non-synonymous genome variations by APOGEE," PLOS Computational Biology, Public Library of Science, vol. 13(6), pages 1-12, June.
    20. Maria Janicka & Bogumiła Pawluśkiewicz & Tomasz Gnatowski, 2023. "Preliminary Results of the Introduction of Dicotyledonous Meadow Species," Sustainability, MDPI, vol. 15(4), pages 1-21, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:18:y:2021:i:23:p:12499-:d:689548. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.