IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v9y2021i13p1574-d588273.html
   My bibliography  Save this article

A Cascade Deep Forest Model for Breast Cancer Subtype Classification Using Multi-Omics Data

Author

Listed:
  • Ala’a El-Nabawy

    (Orange Labs., Smart Village 12511, Giza Governorate, Egypt
    These authors contributed equally to this work.)

  • Nahla A. Belal

    (College of Computing and Information Technology, Arab Academy for Science, Technology, and Maritime Transport, Smart Village, Giza 12577, Egypt
    College of Computing and Information Technology, Arab Academy for Science, Technology, and Maritime Transport, Aswan 81531, Egypt
    These authors contributed equally to this work.)

  • Nashwa El-Bendary

    (College of Computing and Information Technology, Arab Academy for Science, Technology, and Maritime Transport, Smart Village, Giza 12577, Egypt
    College of Computing and Information Technology, Arab Academy for Science, Technology, and Maritime Transport, Aswan 81531, Egypt
    These authors contributed equally to this work.)

Abstract

Automated diagnosis systems aim to reduce the cost of diagnosis while maintaining the same efficiency. Many methods have been used for breast cancer subtype classification. Some use single data source, while others integrate many data sources, the case that results in reduced computational performance as opposed to accuracy. Breast cancer data, especially biological data, is known for its imbalance, with lack of extensive amounts of histopathological images as biological data. Recent studies have shown that cascade Deep Forest ensemble model achieves a competitive classification accuracy compared with other alternatives, such as the general ensemble learning methods and the conventional deep neural networks (DNNs), especially for imbalanced training sets, through learning hyper-representations through using cascade ensemble decision trees. In this work, a cascade Deep Forest is employed to classify breast cancer subtypes, IntClust and Pam50, using multi-omics datasets and different configurations. The results obtained recorded an accuracy of 83.45% for 5 subtypes and 77.55% for 10 subtypes. The significance of this work is that it is shown that using gene expression data alone with the cascade Deep Forest classifier achieves comparable accuracy to other techniques with higher computational performance, where the time recorded is about 5 s for 10 subtypes, and 7 s for 5 subtypes.

Suggested Citation

  • Ala’a El-Nabawy & Nahla A. Belal & Nashwa El-Bendary, 2021. "A Cascade Deep Forest Model for Breast Cancer Subtype Classification Using Multi-Omics Data," Mathematics, MDPI, vol. 9(13), pages 1-14, July.
  • Handle: RePEc:gam:jmathe:v:9:y:2021:i:13:p:1574-:d:588273
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/9/13/1574/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/9/13/1574/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:9:y:2021:i:13:p:1574-:d:588273. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.