IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v15y2024i1d10.1038_s41467-024-46089-y.html
   My bibliography  Save this article

Drug target prediction through deep learning functional representation of gene signatures

Author

Listed:
  • Hao Chen

    (Novartis Biomedical Research
    University of California, Riverside
    Carnegie Mellon University)

  • Frederick J. King

    (Novartis Biomedical Research)

  • Bin Zhou

    (Novartis Biomedical Research)

  • Yu Wang

    (Novartis Biomedical Research)

  • Carter J. Canedy

    (Novartis Biomedical Research)

  • Joel Hayashi

    (Novartis Biomedical Research)

  • Yang Zhong

    (Novartis Biomedical Research)

  • Max W. Chang

    (University of California, San Diego)

  • Lars Pache

    (Sanford Burnham Prebys Medical Discovery Institute)

  • Julian L. Wong

    (Novartis Biomedical Research)

  • Yong Jia

    (Novartis Biomedical Research)

  • John Joslin

    (Novartis Biomedical Research)

  • Tao Jiang

    (University of California, Riverside)

  • Christopher Benner

    (University of California, San Diego)

  • Sumit K. Chanda

    (Scripps Research)

  • Yingyao Zhou

    (Novartis Biomedical Research)

Abstract

Many machine learning applications in bioinformatics currently rely on matching gene identities when analyzing input gene signatures and fail to take advantage of preexisting knowledge about gene functions. To further enable comparative analysis of OMICS datasets, including target deconvolution and mechanism of action studies, we develop an approach that represents gene signatures projected onto their biological functions, instead of their identities, similar to how the word2vec technique works in natural language processing. We develop the Functional Representation of Gene Signatures (FRoGS) approach by training a deep learning model and demonstrate that its application to the Broad Institute’s L1000 datasets results in more effective compound-target predictions than models based on gene identities alone. By integrating additional pharmacological activity data sources, FRoGS significantly increases the number of high-quality compound-target predictions relative to existing approaches, many of which are supported by in silico and/or experimental evidence. These results underscore the general utility of FRoGS in machine learning-based bioinformatics applications. Prediction networks pre-equipped with the knowledge of gene functions may help uncover new relationships among gene signatures acquired by large-scale OMICs studies on compounds, cell types, disease models, and patient cohorts.

Suggested Citation

  • Hao Chen & Frederick J. King & Bin Zhou & Yu Wang & Carter J. Canedy & Joel Hayashi & Yang Zhong & Max W. Chang & Lars Pache & Julian L. Wong & Yong Jia & John Joslin & Tao Jiang & Christopher Benner , 2024. "Drug target prediction through deep learning functional representation of gene signatures," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
  • Handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-46089-y
    DOI: 10.1038/s41467-024-46089-y
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-024-46089-y
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-024-46089-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Neel S. Madhukar & Prashant K. Khade & Linda Huang & Kaitlyn Gayvert & Giuseppe Galletti & Martin Stogniew & Joshua E. Allen & Paraskevi Giannakakou & Olivier Elemento, 2019. "A Bayesian machine learning approach for drug target identification using diverse data types," Nature Communications, Nature, vol. 10(1), pages 1-14, December.
    2. Alexander Lachmann & Denis Torre & Alexandra B. Keenan & Kathleen M. Jagodnik & Hoyjin J. Lee & Lily Wang & Moshe C. Silverstein & Avi Ma’ayan, 2018. "Massive mining of publicly available RNA-seq data from human and mouse," Nature Communications, Nature, vol. 9(1), pages 1-10, December.
    3. George C. Linderman & Jun Zhao & Manolis Roulis & Piotr Bielecki & Richard A. Flavell & Boaz Nadler & Yuval Kluger, 2022. "Zero-preserving imputation of single-cell RNA-seq data," Nature Communications, Nature, vol. 13(1), pages 1-11, December.
    4. Yifan Zhao & Huiyu Cai & Zuobai Zhang & Jian Tang & Yue Li, 2021. "Publisher Correction: Learning interpretable cellular and gene signature embeddings from single-cell transcriptomic data," Nature Communications, Nature, vol. 12(1), pages 1-1, December.
    5. Yifan Zhao & Huiyu Cai & Zuobai Zhang & Jian Tang & Yue Li, 2021. "Learning interpretable cellular and gene signature embeddings from single-cell transcriptomic data," Nature Communications, Nature, vol. 12(1), pages 1-15, December.
    6. Yingyao Zhou & Bin Zhou & Lars Pache & Max Chang & Alireza Hadj Khodabakhshi & Olga Tanaseichuk & Christopher Benner & Sumit K. Chanda, 2019. "Metascape provides a biologist-oriented resource for the analysis of systems-level datasets," Nature Communications, Nature, vol. 10(1), pages 1-10, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Andrea Riba & Attila Oravecz & Matej Durik & Sara Jiménez & Violaine Alunni & Marie Cerciat & Matthieu Jung & Céline Keime & William M. Keyes & Nacho Molina, 2022. "Cell cycle gene regulation dynamics revealed by RNA velocity and deep-learning," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    2. Xiangwei Li & Thomas Delerue & Ben Schöttker & Bernd Holleczek & Eva Grill & Annette Peters & Melanie Waldenberger & Barbara Thorand & Hermann Brenner, 2022. "Derivation and validation of an epigenetic frailty risk score in population-based cohorts of older adults," Nature Communications, Nature, vol. 13(1), pages 1-11, December.
    3. Andreas Herchenröther & Stefanie Gossen & Tobias Friedrich & Alexander Reim & Nadine Daus & Felix Diegmüller & Jörg Leers & Hakimeh Moghaddas Sani & Sarah Gerstner & Leah Schwarz & Inga Stellmacher & , 2023. "The H2A.Z and NuRD associated protein HMG20A controls early head and heart developmental transcription programs," Nature Communications, Nature, vol. 14(1), pages 1-20, December.
    4. Jonathan P. Ling & Alexei M. Bygrave & Clayton P. Santiago & Rogger P. Carmen-Orozco & Vickie T. Trinh & Minzhong Yu & Yini Li & Ying Liu & Kyra D. Bowden & Leighton H. Duncan & Jeong Han & Kamil Tane, 2022. "Cell-specific regulation of gene expression using splicing-dependent frameshifting," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    5. Ramachandran Prakasam & Angela Bonadiman & Roberta Andreotti & Emanuela Zuccaro & Davide Dalfovo & Caterina Marchioretti & Debasmita Tripathy & Gianluca Petris & Eric N. Anderson & Alice Migazzi & Lau, 2023. "LSD1/PRMT6-targeting gene therapy to attenuate androgen receptor toxic gain-of-function ameliorates spinobulbar muscular atrophy phenotypes in flies and mice," Nature Communications, Nature, vol. 14(1), pages 1-22, December.
    6. Li Guo & Cheng Hu & Yang Liu & Xiaoyu Chen & Deli Song & Runling Shen & Zhanzhen Liu & Xudong Jia & Qinfen Zhang & Yuanzhu Gao & Zhezhi Deng & Tao Zuo & Jun Hu & Wenbo Zhu & Jing Cai & Guangmei Yan & , 2023. "Directed natural evolution generates a next-generation oncolytic virus with a high potency and safety profile," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    7. Cecilia Pessoa Rodrigues & Aindrila Chatterjee & Meike Wiese & Thomas Stehle & Witold Szymanski & Maria Shvedunova & Asifa Akhtar, 2021. "Histone H4 lysine 16 acetylation controls central carbon metabolism and diet-induced obesity in mice," Nature Communications, Nature, vol. 12(1), pages 1-21, December.
    8. Tianshi Feng & Xuemei Zhao & Ping Gu & Wah Yang & Cunchuan Wang & Qingyu Guo & Qiaoyun Long & Qing Liu & Ying Cheng & Jin Li & Cynthia Kwan Yui Cheung & Donghai Wu & Xinyu Kong & Yong Xu & Dewei Ye & , 2022. "Adipocyte-derived lactate is a signalling metabolite that potentiates adipose macrophage inflammation via targeting PHD2," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    9. Elisa Bellucci & Andrea Benazzo & Chunming Xu & Elena Bitocchi & Monica Rodriguez & Saleh Alseekh & Valerio Di Vittori & Tania Gioia & Kerstin Neumann & Gaia Cortinovis & Giulia Frascarelli & Ester Mu, 2023. "Selection and adaptive introgression guided the complex evolutionary history of the European common bean," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    10. Jiayin Peng & Lili Han & Biao Liu & Jiawen Song & Yuang Wang & Kunpeng Wang & Qian Guo & XinYan Liu & Yu Li & Jujin Zhang & Wenqing Wu & Sheng Li & Xin Fu & Cheng-le Zhuang & Weikang Zhang & Shengbao , 2023. "Gli1 marks a sentinel muscle stem cell population for muscle regeneration," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    11. Yichuan Cao & Xiamiao Zhao & Songming Tang & Qun Jiang & Sijie Li & Siyu Li & Shengquan Chen, 2024. "scButterfly: a versatile single-cell cross-modality translation method via dual-aligned variational autoencoders," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    12. Elisa Setten & Alessandra Castagna & Josué Manik Nava-Sedeño & Jonathan Weber & Roberta Carriero & Andreas Reppas & Valery Volk & Jessica Schmitz & Wilfried Gwinner & Haralampos Hatzikirou & Friedrich, 2022. "Understanding fibrosis pathogenesis via modeling macrophage-fibroblast interplay in immune-metabolic context," Nature Communications, Nature, vol. 13(1), pages 1-22, December.
    13. Dafne Ibarra-Morales & Michael Rauer & Piergiuseppe Quarato & Leily Rabbani & Fides Zenk & Mariana Schulte-Sasse & Francesco Cardamone & Alejandro Gomez-Auli & Germano Cecere & Nicola Iovino, 2021. "Histone variant H2A.Z regulates zygotic genome activation," Nature Communications, Nature, vol. 12(1), pages 1-14, December.
    14. Ryuki Shimada & Yuzuru Kato & Naoki Takeda & Sayoko Fujimura & Kei-ichiro Yasunaga & Shingo Usuki & Hitoshi Niwa & Kimi Araki & Kei-ichiro Ishiguro, 2023. "STRA8–RB interaction is required for timely entry of meiosis in mouse female germ cells," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    15. Tingting Bo & Jie Li & Ganlu Hu & Ge Zhang & Wei Wang & Qian Lv & Shaoling Zhao & Junjie Ma & Meng Qin & Xiaohui Yao & Meiyun Wang & Guang-Zhong Wang & Zheng Wang, 2023. "Brain-wide and cell-specific transcriptomic insights into MRI-derived cortical morphology in macaque monkeys," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    16. Matthieu Dos Santos & Akansha M. Shah & Yichi Zhang & Svetlana Bezprozvannaya & Kenian Chen & Lin Xu & Weichun Lin & John R. McAnally & Rhonda Bassel-Duby & Ning Liu & Eric N. Olson, 2023. "Opposing gene regulatory programs governing myofiber development and maturation revealed at single nucleus resolution," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    17. Gaofei Li & Yicong Sun & Immanuel Kwok & Liting Yang & Wanying Wen & Peixian Huang & Mei Wu & Jing Li & Zhibin Huang & Zhaoyuan Liu & Shuai He & Wan Peng & Jin-Xin Bei & Florent Ginhoux & Lai Guan Ng , 2024. "Cebp1 and Cebpβ transcriptional axis controls eosinophilopoiesis in zebrafish," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    18. Mei Zhao & Tianxiao Wang & Frederico O. Gleber-Netto & Zhen Chen & Daniel J. McGrail & Javier A. Gomez & Wutong Ju & Mayur A. Gadhikar & Wencai Ma & Li Shen & Qi Wang & Ximing Tang & Sen Pathak & Mari, 2024. "Mutant p53 gains oncogenic functions through a chromosomal instability-induced cytosolic DNA response," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    19. Yingyan Yu & Zhen Zhang & Xiaorui Dong & Ruixin Yang & Zhongqu Duan & Zhen Xiang & Jun Li & Guichao Li & Fazhe Yan & Hongzhang Xue & Du Jiao & Jinyuan Lu & Huimin Lu & Wenmin Zhang & Yangzhen Wei & Sh, 2022. "Pangenomic analysis of Chinese gastric cancer," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    20. Jae-Seung Moon & Chun-Chang Ho & Jong-Hyun Park & Kyungsoo Park & Bo-Young Shin & Su-Hyeon Lee & Ines Sequeira & Chin Hee Mun & Jin-Su Shin & Jung-Ho Kim & Beom Seok Kim & Jin-Wook Noh & Eui-Seon Lee , 2023. "Lrig1-expression confers suppressive function to CD4+ cells and is essential for averting autoimmunity via the Smad2/3/Foxp3 axis," Nature Communications, Nature, vol. 14(1), pages 1-16, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-46089-y. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.