IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v16y2025i1d10.1038_s41467-025-63730-6.html
   My bibliography  Save this article

Unified and explainable molecular representation learning for imperfectly annotated data from the hypergraph view

Author

Listed:
  • Bowen Wang

    (The Chinese University of Hong Kong)

  • Junyou Li

    (Zhejiang Lab)

  • Donghao Zhou

    (The Chinese University of Hong Kong)

  • Lanqing Li

    (The Chinese University of Hong Kong
    Zhejiang Lab)

  • Jinpeng Li

    (The Chinese University of Hong Kong)

  • Ercheng Wang

    (Zhejiang Lab)

  • Jianye Hao

    (Tianjin University)

  • Liang Shi

    (University of California)

  • Chengqiang Lu

    (University of Science and Technology of China)

  • Jiezhong Qiu

    (Zhejiang Lab)

  • Tingjun Hou

    (Zhejiang University)

  • Dongsheng Cao

    (Central South University)

  • Guangyong Chen

    (Chinese Academy of Science)

  • Pheng Ann Heng

    (The Chinese University of Hong Kong)

Abstract

Molecular representation learning (MRL) has shown promise in accelerating drug development by predicting chemical properties. However, imperfectly annotation among datasets pose challenges in model design and explainability. In this work, we formulate molecules and corresponding properties as a hypergraph, extracting three key relationships: among properties, molecule-to-property, and among molecules, and developed a unified and explainable multi-task MRL framework, OmniMol. It integrates a task-related meta-information encoder and a task-routed mixture of experts (t-MoE) backbone to capture correlations among properties and produce task-adaptive outputs. To capture underlying physical principles among molecules, we implement an innovative SE(3)-encoder for physical symmetry, applying equilibrium conformation supervision, recursive geometry updates, and scale-invariant message passing to facilitate learning-based conformational relaxation. OmniMol achieves state-of-the-art performance in properties prediction, reaches top performance in chirality-aware tasks, demonstrates explainability for all three relations, and shows effective performance in practical applications. Our code is available in our https://github.com/bowenwang77/OmniMol public repository.

Suggested Citation

  • Bowen Wang & Junyou Li & Donghao Zhou & Lanqing Li & Jinpeng Li & Ercheng Wang & Jianye Hao & Liang Shi & Chengqiang Lu & Jiezhong Qiu & Tingjun Hou & Dongsheng Cao & Guangyong Chen & Pheng Ann Heng, 2025. "Unified and explainable molecular representation learning for imperfectly annotated data from the hypergraph view," Nature Communications, Nature, vol. 16(1), pages 1-18, December.
  • Handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-63730-6
    DOI: 10.1038/s41467-025-63730-6
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-025-63730-6
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-025-63730-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Michael Schlander & Karla Hernandez-Villafuerte & Chih-Yuan Cheng & Jorge Mestre-Ferrandiz & Michael Baumann, 2021. "How Much Does It Cost to Research and Develop a New Drug? A Systematic Review and Assessment," PharmacoEconomics, Springer, vol. 39(11), pages 1243-1269, November.
    2. Yusong Wang & Tong Wang & Shaoning Li & Xinheng He & Mingyu Li & Zun Wang & Nanning Zheng & Bin Shao & Tie-Yan Liu, 2024. "Enhancing geometric representations for molecules with equivariant vector-scalar interactive message passing," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    3. Anastasiia V. Sadybekov & Vsevolod Katritch, 2023. "Computational approaches streamlining drug discovery," Nature, Nature, vol. 616(7958), pages 673-685, April.
    4. Felix Wong & Erica J. Zheng & Jacqueline A. Valeri & Nina M. Donghia & Melis N. Anahtar & Satotaka Omori & Alicia Li & Andres Cubillos-Ruiz & Aarti Krishnan & Wengong Jin & Abigail L. Manson & Jens Fr, 2024. "Discovery of a structural class of antibiotics with explainable deep learning," Nature, Nature, vol. 626(7997), pages 177-185, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bo He & Yanhao Hu & Dongshan Liu & Xin Zang & Xu He & Wang Chen & Jingfang Yang & Mingfeng Feng & Ping Chen & Lirong Wei & Yu Li & Wei Yan & Jun Li & Zhike Feng & Jiahai Zhou & Yonghao Ye, 2025. "Substrate-based discovery of α-hydroxycarboxylic acid derivatives as potential herbicides targeting dihydroxyacid dehydratase," Nature Communications, Nature, vol. 16(1), pages 1-12, December.
    2. Caimmi, Michele & Canali, Beatrice & Candelora, Laura & Di Costanzo, Alessandra & Fiorentino, Francesca & Vassallo, Chiara & Mancusi, Maria Luisa, 2025. "Price regulation and competition among on-patent anticancer drugs in Italy," Health Policy, Elsevier, vol. 157(C).
    3. Changge Guan & Marcelo D. T. Torres & Sufen Li & Cesar de la Fuente-Nunez, 2025. "Computational exploration of global venoms for antimicrobial discovery with Venomics artificial intelligence," Nature Communications, Nature, vol. 16(1), pages 1-12, December.
    4. Xin Chen & Kexin Wang & Jianfang Chen & Chao Wu & Jun Mao & Yuanpeng Song & Yijing Liu & Zhenhua Shao & Xuemei Pu, 2024. "Integrative residue-intuitive machine learning and MD Approach to Unveil Allosteric Site and Mechanism for β2AR," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    5. Alessio Fallani & Leonardo Medrano Sandonas & Alexandre Tkatchenko, 2024. "Inverse mapping of quantum properties to structures for chemical space of small organic molecules," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    6. Xiaohan Lin & Yijie Xia & Yanheng Li & Yu-Peng Huang & Shuo Liu & Jun Zhang & Yi Qin Gao, 2025. "In-silico 3D molecular editing through physics-informed and preference-aligned generative foundation models," Nature Communications, Nature, vol. 16(1), pages 1-15, December.
    7. Taoyong Cui & Chenyu Tang & Dongzhan Zhou & Yuqiang Li & Xingao Gong & Wanli Ouyang & Mao Su & Shufei Zhang, 2025. "Online test-time adaptation for better generalization of interatomic potentials to out-of-distribution data," Nature Communications, Nature, vol. 16(1), pages 1-11, December.
    8. Hamid Hadipour & Yan Yi Li & Yan Sun & Chutong Deng & Leann Lac & Rebecca Davis & Silvia T. Cardona & Pingzhao Hu, 2025. "GraphBAN: An inductive graph-based approach for enhanced prediction of compound-protein interactions," Nature Communications, Nature, vol. 16(1), pages 1-12, December.
    9. Arnau Comajuncosa-Creus & Guillem Jorba & Xavier Barril & Patrick Aloy, 2024. "Comprehensive detection and characterization of human druggable pockets through binding site descriptors," Nature Communications, Nature, vol. 15(1), pages 1-20, December.
    10. Roberto Olayo-Alarcon & Martin K. Amstalden & Annamaria Zannoni & Medina Bajramovic & Cynthia M. Sharma & Ana Rita Brochado & Mina Rezaei & Christian L. Müller, 2025. "Pre-trained molecular representations enable antimicrobial discovery," Nature Communications, Nature, vol. 16(1), pages 1-15, December.
    11. Patrick Bryant & Atharva Kelkar & Andrea Guljas & Cecilia Clementi & Frank Noé, 2024. "Structure prediction of protein-ligand complexes from sequence information with Umol," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    12. Qichang Zhao & Haochen Zhao & Linyuan Guo & Kai Zheng & Yajie Li & Qiao Ling & Jing Tang & Yaohang Li & Jianxin Wang, 2025. "ColdstartCPI: Induced-fit theory-guided DTI predictive model with improved generalization performance," Nature Communications, Nature, vol. 16(1), pages 1-22, December.
    13. Yixiang Ruan & Chenyin Lu & Ning Xu & Yuchen He & Yixin Chen & Jian Zhang & Jun Xuan & Jianzhang Pan & Qun Fang & Hanyu Gao & Xiaodong Shen & Ning Ye & Qiang Zhang & Yiming Mo, 2024. "An automatic end-to-end chemical synthesis development platform powered by large language models," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    14. Mingyang Wang & Shuai Li & Jike Wang & Odin Zhang & Hongyan Du & Dejun Jiang & Zhenxing Wu & Yafeng Deng & Yu Kang & Peichen Pan & Dan Li & Xiaorui Wang & Xiaojun Yao & Tingjun Hou & Chang-Yu Hsieh, 2024. "ClickGen: Directed exploration of synthesizable chemical space via modular reactions and reinforcement learning," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    15. Peiyuan Ma & Ritesh Kumar & Ke-Hsin Wang & Chibueze V. Amanchukwu, 2025. "Active learning accelerates electrolyte solvent screening for anode-free lithium metal batteries," Nature Communications, Nature, vol. 16(1), pages 1-14, December.
    16. Daniel Tobias Michaeli & Hasan Basri Yagmur & Timur Achmadeev & Thomas Michaeli, 2022. "Value drivers of development stage biopharma companies," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 23(8), pages 1287-1296, November.
    17. Dianlong Zhao & Shunxin Li & Yang Su & Jiajun Qin & Guanjun Xiao & Yuchen Shang & Xiu Yin & Pengfei Lv & Feng Wang & Jiayi Yang & Zhaodong Liu & Fujun Lan & Qiaoshi Zeng & Lijun Zhang & Feng Gao & Bo , 2025. "Pressure encryption toward physically uncopiable anti-counterfeiting," Nature Communications, Nature, vol. 16(1), pages 1-11, December.
    18. Christopher J. Barden & Fan Wu & J. Pedro Fernandez-Murray & Erhu Lu & Shengguo Sun & Marcia M. Taylor & Annette L. Rushton & Jason Williams & Mahtab Tavasoli & Autumn Meek & Alla Siva Reddy & Lisa M., 2024. "Computer-aided drug design to generate a unique antibiotic family," Nature Communications, Nature, vol. 15(1), pages 1-10, December.
    19. J. Thorben Frank & Oliver T. Unke & Klaus-Robert Müller & Stefan Chmiela, 2024. "A Euclidean transformer for fast and stable machine learned force fields," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    20. Xiaoting Shan & Ying Cai & Binyu Zhu & Lingli Zhou & Xujie Sun & Xiaoxuan Xu & Qi Yin & Dangge Wang & Yaping Li, 2024. "Rational strategies for improving the efficiency of design and discovery of nanomedicines," Nature Communications, Nature, vol. 15(1), pages 1-9, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-63730-6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.