IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v14y2023i1d10.1038_s41467-023-36634-6.html
   My bibliography  Save this article

Hypothesis-free phenotype prediction within a genetics-first framework

Author

Listed:
  • Chang Lu

    (Cambridge Biomedical Campus)

  • Jan Zaucha

    (University of Bristol)

  • Rihab Gam

    (Cambridge Biomedical Campus)

  • Hai Fang

    (University of Bristol
    Ruijin Hospital affiliated to Shanghai Jiao Tong University School of Medicine)

  • Smithers

    (University of Bristol)

  • Matt E. Oates

    (University of Bristol)

  • Miguel Bernabe-Rubio

    (King’s College London, Guy’s Hospital)

  • James Williams

    (King’s College London, Guy’s Hospital)

  • Natalie Zelenka

    (University of Bristol)

  • Arun Prasad Pandurangan

    (Cambridge Biomedical Campus)

  • Himani Tandon

    (Cambridge Biomedical Campus)

  • Hashem Shihab

    (University of Bristol)

  • Raju Kalaivani

    (Cambridge Biomedical Campus)

  • Minkyung Sung

    (Cambridge Biomedical Campus)

  • Adam J. Sardar

    (University of Bristol)

  • Bastian Greshake Tzovoras

    (Université de Paris, INSERM U1284, Center for Research and Interdisciplinarity (CRI))

  • Davide Danovi

    (King’s College London, Guy’s Hospital)

  • Julian Gough

    (Cambridge Biomedical Campus
    University of Bristol)

Abstract

Cohort-wide sequencing studies have revealed that the largest category of variants is those deemed ‘rare’, even for the subset located in coding regions (99% of known coding variants are seen in less than 1% of the population. Associative methods give some understanding how rare genetic variants influence disease and organism-level phenotypes. But here we show that additional discoveries can be made through a knowledge-based approach using protein domains and ontologies (function and phenotype) that considers all coding variants regardless of allele frequency. We describe an ab initio, genetics-first method making molecular knowledge-based interpretations for exome-wide non-synonymous variants for phenotypes at the organism and cellular level. By using this reverse approach, we identify plausible genetic causes for developmental disorders that have eluded other established methods and present molecular hypotheses for the causal genetics of 40 phenotypes generated from a direct-to-consumer genotype cohort. This system offers a chance to extract further discovery from genetic data after standard tools have been applied.

Suggested Citation

  • Chang Lu & Jan Zaucha & Rihab Gam & Hai Fang & Smithers & Matt E. Oates & Miguel Bernabe-Rubio & James Williams & Natalie Zelenka & Arun Prasad Pandurangan & Himani Tandon & Hashem Shihab & Raju Kalai, 2023. "Hypothesis-free phenotype prediction within a genetics-first framework," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
  • Handle: RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-36634-6
    DOI: 10.1038/s41467-023-36634-6
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-023-36634-6
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-023-36634-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Helena Kilpinen & Angela Goncalves & Andreas Leha & Vackar Afzal & Kaur Alasoo & Sofie Ashford & Sendu Bala & Dalila Bensaddek & Francesco Paolo Casale & Oliver J. Culley & Petr Danecek & Adam Faulcon, 2017. "Common genetic variation drives molecular heterogeneity in human iPSCs," Nature, Nature, vol. 546(7658), pages 370-375, June.
    2. D. G. MacArthur & T. A. Manolio & D. P. Dimmock & H. L. Rehm & J. Shendure & G. R. Abecasis & D. R. Adams & R. B. Altman & S. E. Antonarakis & E. A. Ashley & J. C. Barrett & L. G. Biesecker & D. F. Co, 2014. "Guidelines for investigating causality of sequence variants in human disease," Nature, Nature, vol. 508(7497), pages 469-476, April.
    3. Quanli Wang & Ryan S. Dhindsa & Keren Carss & Andrew R. Harper & Abhishek Nag & Ioanna Tachmazidou & Dimitrios Vitsios & Sri V. V. Deevi & Alex Mackay & Daniel Muthas & Michael Hühn & Susan Monkley & , 2021. "Rare variant contribution to human disease in 281,104 UK Biobank exomes," Nature, Nature, vol. 597(7877), pages 527-532, September.
    4. Teri A. Manolio & Francis S. Collins & Nancy J. Cox & David B. Goldstein & Lucia A. Hindorff & David J. Hunter & Mark I. McCarthy & Erin M. Ramos & Lon R. Cardon & Aravinda Chakravarti & Judy H. Cho &, 2009. "Finding the missing heritability of complex diseases," Nature, Nature, vol. 461(7265), pages 747-753, October.
    5. Helena Kilpinen & Angela Goncalves & Andreas Leha & Vackar Afzal & Kaur Alasoo & Sofie Ashford & Sendu Bala & Dalila Bensaddek & Francesco Paolo Casale & Oliver J. Culley & Petr Danecek & Adam Faulcon, 2017. "Correction: Corrigendum: Common genetic variation drives molecular heterogeneity in human iPSCs," Nature, Nature, vol. 546(7660), pages 686-686, June.
    6. Eric Vallabh Minikel & Konrad J. Karczewski & Hilary C. Martin & Beryl B. Cummings & Nicola Whiffin & Daniel Rhodes & Jessica Alföldi & Richard C. Trembath & David A. Heel & Mark J. Daly & Stuart L. S, 2020. "Evaluating drug targets through human loss-of-function genetic variation," Nature, Nature, vol. 581(7809), pages 459-464, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Matthew Tegtmeyer & Jatin Arora & Samira Asgari & Beth A. Cimini & Ajay Nadig & Emily Peirent & Dhara Liyanage & Gregory P. Way & Erin Weisbart & Aparna Nathan & Tiffany Amariuta & Kevin Eggan & Marzi, 2024. "High-dimensional phenotyping to define the genetic basis of cellular morphology," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    2. Qiliang Ding & Matthew M. Edwards & Ning Wang & Xiang Zhu & Alexa N. Bracci & Michelle L. Hulke & Ya Hu & Yao Tong & Joyce Hsiao & Christine J. Charvet & Sulagna Ghosh & Robert E. Handsaker & Kevin Eg, 2021. "The genetic architecture of DNA replication timing in human pluripotent stem cells," Nature Communications, Nature, vol. 12(1), pages 1-18, December.
    3. Drew R. Neavin & Angela M. Steinmann & Nona Farbehi & Han Sheng Chiu & Maciej S. Daniszewski & Himanshi Arora & Yasmin Bermudez & Cátia Moutinho & Chia-Ling Chan & Monique Bax & Mubarika Tyebally & Vi, 2023. "A village in a dish model system for population-scale hiPSC studies," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    4. Sulagna Ghosh & Ralda Nehme & Lindy E. Barrett, 2022. "Greater genetic diversity is needed in human pluripotent stem cell models," Nature Communications, Nature, vol. 13(1), pages 1-7, December.
    5. Remo Monti & Pia Rautenstrauch & Mahsa Ghanbari & Alva Rani James & Matthias Kirchler & Uwe Ohler & Stefan Konigorski & Christoph Lippert, 2022. "Identifying interpretable gene-biomarker associations with functionally informed kernel-based tests in 190,000 exomes," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    6. Benjamin J. Schmiedel & Job Rocha & Cristian Gonzalez-Colin & Sourya Bhattacharyya & Ariel Madrigal & Christian H. Ottensmeier & Ferhat Ay & Vivek Chandra & Pandurangan Vijayanand, 2021. "COVID-19 genetic risk variants are associated with expression of multiple genes in diverse immune cell types," Nature Communications, Nature, vol. 12(1), pages 1-12, December.
    7. Yuan Guan & Annika Enejder & Meiyue Wang & Zhuoqing Fang & Lu Cui & Shih-Yu Chen & Jingxiao Wang & Yalun Tan & Manhong Wu & Xinyu Chen & Patrik K. Johansson & Issra Osman & Koshi Kunimoto & Pierre Rus, 2021. "A human multi-lineage hepatic organoid model for liver fibrosis," Nature Communications, Nature, vol. 12(1), pages 1-15, December.
    8. Ian Barnett & Rajarshi Mukherjee & Xihong Lin, 2017. "The Generalized Higher Criticism for Testing SNP-Set Effects in Genetic Association Studies," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(517), pages 64-76, January.
    9. Bingxin Zhao & Fei Zou, 2022. "On polygenic risk scores for complex traits prediction," Biometrics, The International Biometric Society, vol. 78(2), pages 499-511, June.
    10. von Stumm, Sophie & Kandaswamy, Radhika & Maxwell, Jessye, 2023. "Gene-environment interplay in early life cognitive development," Intelligence, Elsevier, vol. 98(C).
    11. Jörn Bethune & April Kleppe & Søren Besenbacher, 2022. "A method to build extended sequence context models of point mutations and indels," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    12. Yunfeng Huang & Dora Bodnar & Chia-Yen Chen & Gabriela Sanchez-Andrade & Mark Sanderson & Jun Shi & Katherine G. Meilleur & Matthew E. Hurles & Sebastian S. Gerety & Ellen A. Tsai & Heiko Runz, 2023. "Rare genetic variants impact muscle strength," Nature Communications, Nature, vol. 14(1), pages 1-8, December.
    13. Keith Woodley & Laura S. Dillingh & George Giotopoulos & Pedro Madrigal & Kevin M. Rattigan & Céline Philippe & Vilma Dembitz & Aoife M. S. Magee & Ryan Asby & Louie N. van de Lagemaat & Christopher M, 2023. "Mannose metabolism inhibition sensitizes acute myeloid leukaemia cells to therapy by driving ferroptotic cell death," Nature Communications, Nature, vol. 14(1), pages 1-19, December.
    14. Marcin Kierczak & Nima Rafati & Julia Höglund & Hadrien Gourlé & Valeria Lo Faro & Daniel Schmitz & Weronica E. Ek & Ulf Gyllensten & Stefan Enroth & Diana Ekman & Björn Nystedt & Torgny Karlsson & Ås, 2022. "Contribution of rare whole-genome sequencing variants to plasma protein levels and the missing heritability," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    15. Ozvan Bocher & Cristen J. Willer & Eleftheria Zeggini, 2023. "Unravelling the genetic architecture of human complex traits through whole genome sequencing," Nature Communications, Nature, vol. 14(1), pages 1-4, December.
    16. Zhuoran Xu & Quan Li & Luigi Marchionni & Kai Wang, 2023. "PhenoSV: interpretable phenotype-aware model for the prioritization of genes affected by structural variants," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    17. Matthias Wuttke & Eva König & Maria-Alexandra Katsara & Holger Kirsten & Saeed Khomeijani Farahani & Alexander Teumer & Yong Li & Martin Lang & Burulca Göcmen & Cristian Pattaro & Dorothee Günzel & An, 2023. "Imputation-powered whole-exome analysis identifies genes associated with kidney function and disease in the UK Biobank," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    18. Guo, Fangqi & Harris, Kathleen Mullan & Boardman, Jason D. & Robinette, Jennifer W., 2022. "Does crime trigger genetic risk for type 2 diabetes in young adults? A G x E interaction study using national data," Social Science & Medicine, Elsevier, vol. 313(C).
    19. Surina Singh & Ananyo Choudhury & Scott Hazelhurst & Nigel J. Crowther & Palwendé R. Boua & Hermann Sorgho & Godfred Agongo & Engelbert A. Nonterah & Lisa K. Micklesfield & Shane A. Norris & Isaac Kis, 2023. "Genome-wide association study meta-analysis of blood pressure traits and hypertension in sub-Saharan African populations: an AWI-Gen study," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    20. Lucas Alvizi & Diogo Nani & Luciano Abreu Brito & Gerson Shigeru Kobayashi & Maria Rita Passos-Bueno & Roberto Mayor, 2023. "Neural crest E-cadherin loss drives cleft lip/palate by epigenetic modulation via pro-inflammatory gene–environment interaction," Nature Communications, Nature, vol. 14(1), pages 1-14, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-36634-6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.