IDEAS home Printed from https://ideas.repec.org/a/gam/jforec/v4y2022i1p15-274d748368.html
   My bibliography  Save this article

Explainable Ensemble Machine Learning for Breast Cancer Diagnosis Based on Ultrasound Image Texture Features

Author

Listed:
  • Alireza Rezazadeh

    (Department of Electrical and Computer Engineering, University of Minnesota, Minneapolis, MN 55414, USA)

  • Yasamin Jafarian

    (Department Computer Science and Engineering, University of Minnesota, Minneapolis, MN 55414, USA)

  • Ali Kord

    (Division of Interventional Radiology, Department of Radiology, University of Cincinnati, Cincinnati, OH 45221, USA)

Abstract

Image classification is widely used to build predictive models for breast cancer diagnosis. Most existing approaches overwhelmingly rely on deep convolutional networks to build such diagnosis pipelines. These model architectures, although remarkable in performance, are black-box systems that provide minimal insight into the inner logic behind their predictions. This is a major drawback as the explainability of prediction is vital for applications such as cancer diagnosis. In this paper, we address this issue by proposing an explainable machine learning pipeline for breast cancer diagnosis based on ultrasound images. We extract first- and second-order texture features of the ultrasound images and use them to build a probabilistic ensemble of decision tree classifiers. Each decision tree learns to classify the input ultrasound image by learning a set of robust decision thresholds for texture features of the image. The decision path of the model predictions can then be interpreted by decomposing the learned decision trees. Our results show that our proposed framework achieves high predictive performance while being explainable.

Suggested Citation

  • Alireza Rezazadeh & Yasamin Jafarian & Ali Kord, 2022. "Explainable Ensemble Machine Learning for Breast Cancer Diagnosis Based on Ultrasound Image Texture Features," Forecasting, MDPI, vol. 4(1), pages 1-13, February.
  • Handle: RePEc:gam:jforec:v:4:y:2022:i:1:p:15-274:d:748368
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2571-9394/4/1/15/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2571-9394/4/1/15/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Sun, Xiaolei & Liu, Mingxi & Sima, Zeqian, 2020. "A novel cryptocurrency price trend forecasting model based on LightGBM," Finance Research Letters, Elsevier, vol. 32(C).
    2. Mukund Sundararajan & Amir Najmi, 2019. "The many Shapley values for model explanation," Papers 1908.08474, arXiv.org, revised Feb 2020.
    3. Alireza Rezazadeh, 2020. "A Generalized Flow for B2B Sales Predictive Modeling: An Azure Machine-Learning Approach," Forecasting, MDPI, vol. 2(3), pages 1-17, August.
    4. Davide Castelvecchi, 2016. "Can we open the black box of AI?," Nature, Nature, vol. 538(7623), pages 20-23, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yamashiro, Hirochika & Nonaka, Hirofumi, 2021. "Estimation of processing time using machine learning and real factory data for optimization of parallel machine scheduling problem," Operations Research Perspectives, Elsevier, vol. 8(C).
    2. Sonia Leva, 2021. "Editorial for Special Issue: “Feature Papers of Forecasting”," Forecasting, MDPI, vol. 3(1), pages 1-3, February.
    3. Aurelio F. Bariviera & Ignasi Merediz‐Solà, 2021. "Where Do We Stand In Cryptocurrencies Economic Research? A Survey Based On Hybrid Analysis," Journal of Economic Surveys, Wiley Blackwell, vol. 35(2), pages 377-407, April.
    4. Feng, Qianqian & Sun, Xiaolei & Hao, Jun & Li, Jianping, 2021. "Predictability dynamics of multifactor-influenced installed capacity: A perspective of country clustering," Energy, Elsevier, vol. 214(C).
    5. Hu'e Sullivan & Hurlin Christophe & P'erignon Christophe & Saurin S'ebastien, 2022. "Measuring the Driving Forces of Predictive Performance: Application to Credit Scoring," Papers 2212.05866, arXiv.org, revised Jun 2023.
    6. Hugh Chen & Scott M. Lundberg & Su-In Lee, 2022. "Explaining a series of models by propagating Shapley values," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    7. Hsu, Chia-Wei & Xiong, Rui & Chen, Nan-Yow & Li, Ju & Tsou, Nien-Ti, 2022. "Deep neural network battery life and voltage prediction by using data of one cycle only," Applied Energy, Elsevier, vol. 306(PB).
    8. Goodell, John W. & Ben Jabeur, Sami & Saâdaoui, Foued & Nasir, Muhammad Ali, 2023. "Explainable artificial intelligence modeling to forecast bitcoin prices," International Review of Financial Analysis, Elsevier, vol. 88(C).
    9. Vaia I. Kontopoulou & Athanasios D. Panagopoulos & Ioannis Kakkos & George K. Matsopoulos, 2023. "A Review of ARIMA vs. Machine Learning Approaches for Time Series Forecasting in Data Driven Networks," Future Internet, MDPI, vol. 15(8), pages 1-31, July.
    10. Masayoshi Mase & Art B. Owen & Benjamin B. Seiler, 2021. "Cohort Shapley value for algorithmic fairness," Papers 2105.07168, arXiv.org.
    11. Laith T. Khrais, 2020. "Role of Artificial Intelligence in Shaping Consumer Demand in E-Commerce," Future Internet, MDPI, vol. 12(12), pages 1-14, December.
    12. Johannes Habel & Sascha Alavi & Nicolas Heinitz, 2023. "A theory of predictive sales analytics adoption," AMS Review, Springer;Academy of Marketing Science, vol. 13(1), pages 34-54, June.
    13. Wenguang Zhang & Ting Lei & Yu Gong & Jun Zhang & Yirong Wu, 2022. "Using Explainable Artificial Intelligence to Identify Key Characteristics of Deep Poverty for Each Household," Sustainability, MDPI, vol. 14(16), pages 1-21, August.
    14. Masayoshi Mase & Art B. Owen & Benjamin B. Seiler, 2022. "Variable importance without impossible data," Papers 2205.15750, arXiv.org, revised Apr 2023.
    15. Hakan Pabuccu & Adrian Barbu, 2023. "Feature Selection with Annealing for Forecasting Financial Time Series," Papers 2303.02223, arXiv.org, revised Feb 2024.
    16. Martin Eling & Davide Nuessle & Julian Staubli, 2022. "The impact of artificial intelligence along the insurance value chain and on the insurability of risks," The Geneva Papers on Risk and Insurance - Issues and Practice, Palgrave Macmillan;The Geneva Association, vol. 47(2), pages 205-241, April.
    17. Canellas, Marc & Haga, Rachel, 2017. "Framing Human-Automation Regulation: A New Modus Operandi from Cognitive Engineering," LawArXiv yu2h3, Center for Open Science.
    18. Gao, Yuan & Miyata, Shohei & Akashi, Yasunori, 2023. "How to improve the application potential of deep learning model in HVAC fault diagnosis: Based on pruning and interpretable deep learning method," Applied Energy, Elsevier, vol. 348(C).
    19. Alwosheel, Ahmad & van Cranenburgh, Sander & Chorus, Caspar G., 2018. "Is your dataset big enough? Sample size requirements when using artificial neural networks for discrete choice analysis," Journal of choice modelling, Elsevier, vol. 28(C), pages 167-182.
    20. Aras, Serkan & Hanifi Van, M., 2022. "An interpretable forecasting framework for energy consumption and CO2 emissions," Applied Energy, Elsevier, vol. 328(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jforec:v:4:y:2022:i:1:p:15-274:d:748368. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.