Author
Listed:
- Xiangru Tang
(Yale University)
- Qiao Jin
(National Institutes of Health)
- Kunlun Zhu
(Mila-Quebec AI Institute)
- Tongxin Yuan
(Shanghai Jiao Tong University)
- Yichi Zhang
(Yale University)
- Wangchunshu Zhou
(OPPO Research Institute)
- Meng Qu
(Mila-Quebec AI Institute)
- Yilun Zhao
(Yale University)
- Jian Tang
(Mila-Quebec AI Institute)
- Zhuosheng Zhang
(Shanghai Jiao Tong University)
- Arman Cohan
(Yale University)
- Dov Greenbaum
(Reichman University
Yale University)
- Zhiyong Lu
(National Institutes of Health)
- Mark Gerstein
(Yale University
Yale University
Yale University
Yale University)
Abstract
AI scientists powered by large language models have demonstrated substantial promise in autonomously conducting experiments and facilitating scientific discoveries across various disciplines. While their capabilities are promising, these agents also introduce novel vulnerabilities that require careful consideration for safety. However, there has been limited comprehensive exploration of these vulnerabilities. This perspective examines vulnerabilities in AI scientists, shedding light on potential risks associated with their misuse, and emphasizing the need for safety measures. We begin by providing an overview of the potential risks inherent to AI scientists, taking into account user intent, the specific scientific domain, and their potential impact on the external environment. Then, we explore the underlying causes of these vulnerabilities and provide a scoping review of the limited existing works. Based on our analysis, we propose a triadic framework involving human regulation, agent alignment, and an understanding of environmental feedback (agent regulation) to mitigate these identified risks. Furthermore, we highlight the limitations and challenges associated with safeguarding AI scientists and advocate for the development of improved models, robust benchmarks, and comprehensive regulations.
Suggested Citation
Xiangru Tang & Qiao Jin & Kunlun Zhu & Tongxin Yuan & Yichi Zhang & Wangchunshu Zhou & Meng Qu & Yilun Zhao & Jian Tang & Zhuosheng Zhang & Arman Cohan & Dov Greenbaum & Zhiyong Lu & Mark Gerstein, 2025.
"Risks of AI scientists: prioritizing safeguarding over autonomy,"
Nature Communications, Nature, vol. 16(1), pages 1-11, December.
Handle:
RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-63913-1
DOI: 10.1038/s41467-025-63913-1
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-63913-1. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.