Author
Listed:
- Marc Feuermann
(Centre Medical Universitaire)
- Huaiyu Mi
(University of Southern California Los Angeles)
- Pascale Gaudet
(Centre Medical Universitaire)
- Anushya Muruganujan
(University of Southern California Los Angeles)
- Suzanna E. Lewis
(Lawrence Berkeley National Laboratory)
- Dustin Ebert
(University of Southern California Los Angeles)
- Tremayne Mushayahama
(University of Southern California Los Angeles)
- Paul D. Thomas
(University of Southern California Los Angeles)
Abstract
A comprehensive, computable representation of the functional repertoire of all macromolecules encoded within the human genome is a foundational resource for biology and biomedical research. The Gene Ontology Consortium has been working towards this goal by generating a structured body of information about gene functions, which now includes experimental findings reported in more than 175,000 publications for human genes and genes in experimentally tractable model organisms1,2. Here, we describe the results of a large, international effort to integrate all of these findings to create a representation of human gene functions that is as complete and accurate as possible. Specifically, we apply an expert-curated, explicit evolutionary modelling approach to all human protein-coding genes. This approach integrates available experimental information across families of related genes into models that reconstruct the gain and loss of functional characteristics over evolutionary time. The models and the resulting set of 68,667 integrated gene functions cover approximately 82% of human protein-coding genes. The functional repertoire reveals a marked preponderance of molecular regulatory functions, and the models provide insights into the evolutionary origins of human gene functions. We show that our set of descriptions of functions can improve the widely used genomic technique of Gene Ontology enrichment analysis. The experimental evidence for each functional characteristic is recorded, thereby enabling the scientific community to help review and improve the resource, which we have made publicly available.
Suggested Citation
Marc Feuermann & Huaiyu Mi & Pascale Gaudet & Anushya Muruganujan & Suzanna E. Lewis & Dustin Ebert & Tremayne Mushayahama & Paul D. Thomas, 2025.
"A compendium of human gene functions derived from evolutionary modelling,"
Nature, Nature, vol. 640(8057), pages 146-154, April.
Handle:
RePEc:nat:nature:v:640:y:2025:i:8057:d:10.1038_s41586-025-08592-0
DOI: 10.1038/s41586-025-08592-0
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:nature:v:640:y:2025:i:8057:d:10.1038_s41586-025-08592-0. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.