IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0333134.html

A contrastive adversarial encoder for multi-omics data integration

Author

Listed:
  • Ma Yinghua
  • Ahmad Khan
  • Yang Heng
  • Fiaz Gul Khan
  • Afnan Aldhahri
  • Iftikhar Ahmed Khan

Abstract

Early and accurate cancer detection is crucial for effective treatment, prognosis, and the advancement of precision medicine. Analyzing omics data is vital in cancer research. While using a single type of omics data provides a limited perspective, integrating multiple omics modalities allows for a more comprehensive understanding of cancer. Current deep models struggle to achieve efficient dimensionality reduction while preserving global information and integrating multi-omics data. This often results in feature redundancy or information loss, overlooking the synergies among different modalities. This paper proposes a contrastive adversarial encoder (CAEncoder) for multi-omics data integration to address this challenge. The proposed model combines a Vision Transformer (ViT) and a CycleGAN, trained in an end-to-end contrastive manner. The ViT is the encoder, utilizing self-attention, while the CycleGAN employs adversarial learning to ensure more discriminative and invariant latent space embeddings. Contrastive adversarial training improves representation quality by preventing information loss, eliminating redundancy, and capturing the synergies among different omics modalities. To ensure contrastive adversarial training, a composite loss function is used, consisting of a weighted combination of Adversarial Loss (Hinge Loss), Cycle Consistency Loss, and Triplet Margin Loss. The Adversarial Loss and Cycle Consistency Loss provide feedback from the CycleGAN, ensuring effective adversarial learning. Meanwhile, the Triplet Margin Loss promotes contrastive learning by pulling similar samples together and pushing dissimilar samples apart in the latent space. The performance of the CAEncoder is evaluated on downstream classification tasks, including both binary and multi-class classifications of five different cancer types. The results show that the model achieved a classification accuracy of up to 93.33% and an F1 score of 92.81%, outperforming existing advanced models. These findings demonstrate the potential of our method to enhance precision medicine for cancer through improved multi-omics data integration.

Suggested Citation

  • Ma Yinghua & Ahmad Khan & Yang Heng & Fiaz Gul Khan & Afnan Aldhahri & Iftikhar Ahmed Khan, 2025. "A contrastive adversarial encoder for multi-omics data integration," PLOS ONE, Public Library of Science, vol. 20(10), pages 1-16, October.
  • Handle: RePEc:plo:pone00:0333134
    DOI: 10.1371/journal.pone.0333134
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0333134
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0333134&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0333134?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0333134. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.