IDEAS home Printed from https://ideas.repec.org/a/anm/alpnmr/v9y2021i1p125-142.html
   My bibliography  Save this article

Classification of Cancer Types by Cluster Analysis Methods

Author

Listed:
  • Aynur İncekırık
  • Öznur İşçi Güneri
  • Burcu Durmuş

Abstract

Cluster analysis can be defined as the group of methods that aim to classify multivariate observations by using similarity/dissimilarity measures between observations. The clusters obtained as a result of the analysis are required to be homogeneous within themselves and heterogeneous among themselves. This study aims to cluster cancer types in datasets created by considering age group characteristics according to gender. In the study, clustering analysis was applied to four different datasets created from the data registered between 1982 and 2016 for 57 cancer types in men and women according to age groups at the Australian Institute of Health and Welfare, and the analysis results were evaluated and interpreted. In addition, in determining the clustering method and the number of clusters, Cophenetic correlation coefficients and 26 cluster validity indices were used, respectively. The distribution of cancer types in age groups determined by gender was observed in 4 different datasets created with 3 different age group characteristics that led to the best separation of cancer groups, and the clustering tendencies of cancers in the relevant age groups were investigated. R-3.5.1 package program was used for analyses. In this study, the analysis results of the k-means method and the average linkage method, which was decided to be the most successful method due to the high cophenetic correlation coefficient value, were evaluated and interpreted. The number of clusters was determined as 3 with the help of cluster validity indices. When the results obtained are examined, it is seen that breast cancer in women and prostate cancer in men is the most common type of cancer in the age group of 40 and above, and that these cancers are alone in a cluster. In addition, it is seen that the 0-14 age group characteristic fails to separate the clusters.

Suggested Citation

  • Aynur İncekırık & Öznur İşçi Güneri & Burcu Durmuş, 2021. "Classification of Cancer Types by Cluster Analysis Methods," Alphanumeric Journal, Bahadir Fatih Yildirim, vol. 9(1), pages 125-142, June.
  • Handle: RePEc:anm:alpnmr:v:9:y:2021:i:1:p:125-142
    DOI: http://dx.doi.org/10.17093/alphanumeric.949958
    as

    Download full text from publisher

    File URL: https://www.alphanumericjournal.com/media/Issue/volume-9-issue-1-2021/classification-of-cancer-types-by-cluster-analysis-methods.pdf
    Download Restriction: no

    File URL: https://alphanumericjournal.com/article/classification-of-cancer-types-by-cluster-analysis-methods
    Download Restriction: no

    File URL: https://libkey.io/http://dx.doi.org/10.17093/alphanumeric.949958?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Keywords

    Cancer Types; Cluster Analysis; Cluster Validity İndex; Cophenetic Correlation Coefficient; K-means;
    All these keywords.

    JEL classification:

    • C01 - Mathematical and Quantitative Methods - - General - - - Econometrics

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:anm:alpnmr:v:9:y:2021:i:1:p:125-142. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Bahadir Fatih Yildirim (email available below). General contact details of provider: https://www.alphanumericjournal.com/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.