IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1011197.html
   My bibliography  Save this article

Identifying prognostic subgroups of luminal-A breast cancer using deep autoencoders and gene expressions

Author

Listed:
  • Seunghyun Wang
  • Doheon Lee

Abstract

Luminal-A breast cancer is the most frequently occurring subtype which is characterized by high expression levels of hormone receptors. However, some luminal-A breast cancer patients suffer from intrinsic and/or acquired resistance to endocrine therapies which are considered as first-line treatments for luminal-A breast cancer. This heterogeneity within luminal-A breast cancer has required a more precise stratification method. Hence, our study aims to identify prognostic subgroups of luminal-A breast cancer. In this study, we discovered two prognostic subgroups of luminal-A breast cancer (BPS-LumA and WPS-LumA) using deep autoencoders and gene expressions. The deep autoencoders were trained using gene expression profiles of 679 luminal-A breast cancer samples in the METABRIC dataset. Then, latent features of each samples generated from the deep autoencoders were used for K-Means clustering to divide the samples into two subgroups, and Kaplan-Meier survival analysis was performed to compare prognosis (recurrence-free survival) between them. As a result, the prognosis between the two subgroups were significantly different (p-value = 5.82E-05; log-rank test). This prognostic difference between two subgroups was validated using gene expression profiles of 415 luminal-A breast cancer samples in the TCGA BRCA dataset (p-value = 0.004; log-rank test). Notably, the latent features were superior to the gene expression profiles and traditional dimensionality reduction method in terms of discovering the prognostic subgroups. Lastly, we discovered that ribosome-related biological functions could be potentially associated with the prognostic difference between them using differentially expressed genes and co-expression network analysis. Our stratification method can be contributed to understanding a complexity of luminal-A breast cancer and providing a personalized medicine.Author summary: Luminal-A breast cancer is the most frequently occurring breast cancer subtype. However, it shows high variability in prognosis, and more precise stratification is needed. In this paper, we identified two prognostic subgroups of luminal-A breast cancer, BPS-LumA and WPS-LumA. To this end, we used deep autoencoders which automatically generate informative latent features that represent essential properties of gene expressions. We found that the two subgroups clustered using the latent features are significantly different in prognosis. This prognostic difference was validated with the external luminal-A breast cancer cohort. We showed that only latent features are able to discover the prognostic subgroups compared to gene expression profiles. In addition, we compare our results with the two previous luminal-A breast cancer stratification method which are complementary to each other. Finally, we suggested biological functions associated with the differentially expressed genes between the two subgroups as potential molecular mechanisms which results in the differences in the prognosis. We expect that our method could be used for the personalized medicine of luminal-A breast cancer.

Suggested Citation

  • Seunghyun Wang & Doheon Lee, 2023. "Identifying prognostic subgroups of luminal-A breast cancer using deep autoencoders and gene expressions," PLOS Computational Biology, Public Library of Science, vol. 19(5), pages 1-18, May.
  • Handle: RePEc:plo:pcbi00:1011197
    DOI: 10.1371/journal.pcbi.1011197
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1011197
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1011197&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1011197?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. H. Jeong & S. P. Mason & A.-L. Barabási & Z. N. Oltvai, 2001. "Lethality and centrality in protein networks," Nature, Nature, vol. 411(6833), pages 41-42, May.
    2. Bernard Pereira & Suet-Feung Chin & Oscar M. Rueda & Hans-Kristian Moen Vollan & Elena Provenzano & Helen A. Bardwell & Michelle Pugh & Linda Jones & Roslin Russell & Stephen-John Sammut & Dana W. Y. , 2016. "The somatic mutation profiles of 2,433 breast cancers refine their genomic and transcriptomic landscapes," Nature Communications, Nature, vol. 7(1), pages 1-16, September.
    3. Charles M. Perou & Therese Sørlie & Michael B. Eisen & Matt van de Rijn & Stefanie S. Jeffrey & Christian A. Rees & Jonathan R. Pollack & Douglas T. Ross & Hilde Johnsen & Lars A. Akslen & Øystein Flu, 2000. "Molecular portraits of human breast tumours," Nature, Nature, vol. 406(6797), pages 747-752, August.
    4. Bernard Pereira & Suet-Feung Chin & Oscar M. Rueda & Hans-Kristian Moen Vollan & Elena Provenzano & Helen A. Bardwell & Michelle Pugh & Linda Jones & Roslin Russell & Stephen-John Sammut & Dana W. Y. , 2016. "Erratum: The somatic mutation profiles of 2,433 breast cancers refine their genomic and transcriptomic landscapes," Nature Communications, Nature, vol. 7(1), pages 1-1, September.
    5. Sanjiv K. Dwivedi & Andreas Tjärnberg & Jesper Tegnér & Mika Gustafsson, 2020. "Deriving disease modules from the compressed transcriptional space embedded in a deep autoencoder," Nature Communications, Nature, vol. 11(1), pages 1-10, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Peter Eirew & Ciara O’Flanagan & Jerome Ting & Sohrab Salehi & Jazmine Brimhall & Beixi Wang & Justina Biele & Teresa Algara & So Ra Lee & Corey Hoang & Damian Yap & Steven McKinney & Cherie Bates & E, 2022. "Accurate determination of CRISPR-mediated gene fitness in transplantable tumours," Nature Communications, Nature, vol. 13(1), pages 1-19, December.
    2. Chotaro Onaga & Shoma Tamori & Izumi Matsuoka & Ayaka Ozaki & Hitomi Motomura & Yuka Nagashima & Tsugumichi Sato & Keiko Sato & Yuyun Xiong & Kazunori Sasaki & Shigeo Ohno & Kazunori Akimoto, 2022. "High expression of SLC20A1 is less effective for endocrine therapy and predicts late recurrence in ER-positive breast cancer," PLOS ONE, Public Library of Science, vol. 17(5), pages 1-22, May.
    3. Yang, Xi & Hoadley, Katherine A. & Hannig, Jan & Marron, J.S., 2023. "Jackstraw inference for AJIVE data integration," Computational Statistics & Data Analysis, Elsevier, vol. 180(C).
    4. Giorgio Jansen & Tanda Qi & Vito Latora & Grigoris D. Amoutzias & Daniela Delneri & Stephen G. Oliver & Giuseppe Nicosia, 2024. "Minimisation of metabolic networks defines a new functional class of genes," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    5. Piaopiao Chen & Agnès H. Michel & Jianzhi Zhang, 2022. "Transposon insertional mutagenesis of diverse yeast strains suggests coordinated gene essentiality polymorphisms," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    6. Manish G & Anil Kumar Badana & Rama Rao Malla, 2017. "Emerging Diagnostic and Prognostic Biomarkers of Triple Negative Breast Cancer," Biomedical Journal of Scientific & Technical Research, Biomedical Research Network+, LLC, vol. 1(3), pages 561-565, August.
    7. Jacob Elnaggar & Fern Tsien & Lucio Miele & Chindo Hicks & Clayton Yates & Melisa Davis, 2019. "An Integrative Genomics Approach for Associating Genetic Susceptibility with the Tumor Immune Microenvironment in Triple Negative Breast Cancer," Biomedical Journal of Scientific & Technical Research, Biomedical Research Network+, LLC, vol. 15(1), pages 1-12, February.
    8. Egashira, Kento & Yata, Kazuyoshi & Aoshima, Makoto, 2024. "Asymptotic properties of hierarchical clustering in high-dimensional settings," Journal of Multivariate Analysis, Elsevier, vol. 199(C).
    9. María Elena Martínez & Jonathan T Unkart & Li Tao & Candyce H Kroenke & Richard Schwab & Ian Komenaka & Scarlett Lin Gomez, 2017. "Prognostic significance of marital status in breast cancer survival: A population-based study," PLOS ONE, Public Library of Science, vol. 12(5), pages 1-14, May.
    10. Yishai Shimoni, 2018. "Association between expression of random gene sets and survival is evident in multiple cancer types and may be explained by sub-classification," PLOS Computational Biology, Public Library of Science, vol. 14(2), pages 1-15, February.
    11. repec:plo:pone00:0103514 is not listed on IDEAS
    12. Yubo Peng & Bofeng Zhang & Furong Chang, 2021. "Overlapping Community Detection of Bipartite Networks Based on a Novel Community Density," Future Internet, MDPI, vol. 13(4), pages 1-21, March.
    13. Marcin Pilarczyk & Mehdi Fazel-Najafabadi & Michal Kouril & Behrouz Shamsaei & Juozas Vasiliauskas & Wen Niu & Naim Mahi & Lixia Zhang & Nicholas A. Clark & Yan Ren & Shana White & Rashid Karim & Huan, 2022. "Connecting omics signatures and revealing biological mechanisms with iLINCS," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    14. repec:plo:pone00:0018135 is not listed on IDEAS
    15. Junhee Seok & Ronald W Davis & Wenzhong Xiao, 2015. "A Hybrid Approach of Gene Sets and Single Genes for the Prediction of Survival Risks with Gene Expression Data," PLOS ONE, Public Library of Science, vol. 10(5), pages 1-15, May.
    16. Qing Qu & Yan Mao & Xiao-chun Fei & Kun-wei Shen, 2013. "The Impact of Androgen Receptor Expression on Breast Cancer Survival: A Retrospective Study and Meta-Analysis," PLOS ONE, Public Library of Science, vol. 8(12), pages 1-1, December.
    17. Fiedor, Paweł, 2014. "Sector strength and efficiency on developed and emerging financial markets," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 413(C), pages 180-188.
    18. Wilhelm, Thomas & Hollunder, Jens, 2007. "Information theoretic description of networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 385(1), pages 385-396.
    19. repec:plo:pone00:0081843 is not listed on IDEAS
    20. Mahyar, Hamidreza & Hasheminezhad, Rouzbeh & Ghalebi K., Elahe & Nazemian, Ali & Grosu, Radu & Movaghar, Ali & Rabiee, Hamid R., 2018. "Compressive sensing of high betweenness centrality nodes in networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 497(C), pages 166-184.
    21. Laurienti, Paul J. & Joyce, Karen E. & Telesford, Qawi K. & Burdette, Jonathan H. & Hayasaka, Satoru, 2011. "Universal fractal scaling of self-organized networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 390(20), pages 3608-3613.
    22. Gao, Jianbo & Hu, Jing, 2014. "Financial crisis, Omori's law, and negative entropy flow," International Review of Financial Analysis, Elsevier, vol. 33(C), pages 79-86.
    23. Bourret, Pascale & Keating, Peter & Cambrosio, Alberto, 2011. "Regulating diagnosis in post-genomic medicine: Re-aligning clinical judgment?," Social Science & Medicine, Elsevier, vol. 73(6), pages 816-824, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1011197. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.