IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v15y2024i1d10.1038_s41467-024-54678-0.html
   My bibliography  Save this article

A phenome-wide association study of tandem repeat variation in 168,554 individuals from the UK Biobank

Author

Listed:
  • Celine A. Manigbas

    (Icahn School of Medicine at Mount)

  • Bharati Jadhav

    (Icahn School of Medicine at Mount)

  • Paras Garg

    (Icahn School of Medicine at Mount)

  • Mariya Shadrina

    (Icahn School of Medicine at Mount)

  • William Lee

    (Icahn School of Medicine at Mount)

  • Gabrielle Altman

    (Icahn School of Medicine at Mount)

  • Alejandro Martin-Trujillo

    (Icahn School of Medicine at Mount)

  • Andrew J. Sharp

    (Icahn School of Medicine at Mount)

Abstract

Most genetic association studies focus on binary variants. To identify the effects of multi-allelic variation of tandem repeats (TRs) on human traits, we perform direct TR genotyping and phenome-wide association studies in 168,554 individuals from the UK Biobank, identifying 47 TRs showing fine-mapped associations with 73 traits. We replicate 23 of 31 (74%) of these associations in the All of Us cohort. While this set includes several known repeat expansion disorders, novel associations we found are attributable to common polymorphic variation in TR length rather than rare expansions and include e.g. a coding polyhistidine motif in HRCT1 influencing risk of hypertension and a poly(CGC) in the 5’UTR of GNB2 influencing heart rate. Fine-mapped TRs are strongly enriched for associations with local gene expression and DNA methylation. Our study highlights the contribution of multi-allelic TRs to the “missing heritability” of the human genome.

Suggested Citation

  • Celine A. Manigbas & Bharati Jadhav & Paras Garg & Mariya Shadrina & William Lee & Gabrielle Altman & Alejandro Martin-Trujillo & Andrew J. Sharp, 2024. "A phenome-wide association study of tandem repeat variation in 168,554 individuals from the UK Biobank," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
  • Handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-54678-0
    DOI: 10.1038/s41467-024-54678-0
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-024-54678-0
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-024-54678-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Teri A. Manolio & Francis S. Collins & Nancy J. Cox & David B. Goldstein & Lucia A. Hindorff & David J. Hunter & Mark I. McCarthy & Erin M. Ramos & Lon R. Cardon & Aravinda Chakravarti & Judy H. Cho &, 2009. "Finding the missing heritability of complex diseases," Nature, Nature, vol. 461(7265), pages 747-753, October.
    2. Wen-Wei Liao & Mobin Asri & Jana Ebler & Daniel Doerr & Marina Haukness & Glenn Hickey & Shuangjia Lu & Julian K. Lucas & Jean Monlong & Haley J. Abel & Silvia Buonaiuto & Xian H. Chang & Haoyu Cheng , 2023. "A draft human pangenome reference," Nature, Nature, vol. 617(7960), pages 312-324, May.
    3. Oliver Stegle & Leopold Parts & Richard Durbin & John Winn, 2010. "A Bayesian Framework to Account for Complex Non-Genetic Factors in Gene Expression Levels Greatly Increases Power in eQTL Studies," PLOS Computational Biology, Public Library of Science, vol. 6(5), pages 1-11, May.
    4. Shubham Saini & Ileena Mitra & Nima Mousavi & Stephanie Feupe Fotsing & Melissa Gymrek, 2018. "A reference haplotype panel for genome-wide imputation of short tandem repeats," Nature Communications, Nature, vol. 9(1), pages 1-11, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sean A. Misek & Aaron Fultineer & Jeremie Kalfon & Javad Noorbakhsh & Isabella Boyle & Priyanka Roy & Joshua Dempster & Lia Petronio & Katherine Huang & Alham Saadat & Thomas Green & Adam Brown & John, 2024. "Germline variation contributes to false negatives in CRISPR-based experiments with varying burden across ancestries," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    2. Zhenhua Zhang & Wenchao Li & Qiuyao Zhan & Michelle Aillaud & Javier Botey-Bataller & Martijn Zoodsma & Rob Horst & Leo A. B. Joosten & Christoph Bock & Leon N. Schulte & Cheng-Jian Xu & Mihai G. Nete, 2025. "Unveiling genetic signatures of immune response in immune-related diseases through single-cell eQTL analysis across diverse conditions," Nature Communications, Nature, vol. 16(1), pages 1-16, December.
    3. repec:plo:pgen00:1003258 is not listed on IDEAS
    4. Ilias Georgakopoulos-Soares & Chengyu Deng & Vikram Agarwal & Candace S. Y. Chan & Jingjing Zhao & Fumitaka Inoue & Nadav Ahituv, 2023. "Transcription factor binding site orientation and order are major drivers of gene regulatory activity," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    5. Meida Wang & Shuanglin Zhang & Qiuying Sha, 2022. "A computationally efficient clustering linear combination approach to jointly analyze multiple phenotypes for GWAS," PLOS ONE, Public Library of Science, vol. 17(4), pages 1-13, April.
    6. repec:plo:pone00:0083057 is not listed on IDEAS
    7. Leslie A. Smith & James A. Cahill & Ji-Hyun Lee & Kiley Graim, 2025. "Equitable machine learning counteracts ancestral bias in precision medicine," Nature Communications, Nature, vol. 16(1), pages 1-17, December.
    8. Emilia Volpe & Alessio Colantoni & Luca Corda & Elena Di Tommaso & Franca Pelliccia & Riccardo Ottalevi & Andrea Guarracino & Danilo Licastro & Luigi Faino & Mattia Capulli & Giulio Formenti & Evelyne, 2025. "The reference genome of the human diploid cell line RPE-1," Nature Communications, Nature, vol. 16(1), pages 1-15, December.
    9. Xiao Chen & Daniel Baker & Egor Dolzhenko & Joseph M. Devaney & Jessica Noya & April S. Berlyoung & Rhonda Brandon & Kathleen S. Hruska & Lucas Lochovsky & Paul Kruszka & Scott Newman & Emily Farrow &, 2025. "Genome-wide profiling of highly similar paralogous genes using HiFi sequencing," Nature Communications, Nature, vol. 16(1), pages 1-13, December.
    10. repec:plo:pone00:0070774 is not listed on IDEAS
    11. Lin Yuan & Chang-An Yuan & De-Shuang Huang, 2017. "FAACOSE: A Fast Adaptive Ant Colony Optimization Algorithm for Detecting SNP Epistasis," Complexity, Hindawi, vol. 2017, pages 1-10, September.
    12. Lap Sum Chan & Gen Li & Eric B. Fauman & Xianyong Yin & Markku Laakso & Michael Boehnke & Peter X. K. Song, 2025. "DrFARM: identification of pleiotropic genetic variants in genome-wide association studies," Nature Communications, Nature, vol. 16(1), pages 1-14, December.
    13. Chang Lu & Jan Zaucha & Rihab Gam & Hai Fang & Smithers & Matt E. Oates & Miguel Bernabe-Rubio & James Williams & Natalie Zelenka & Arun Prasad Pandurangan & Himani Tandon & Hashem Shihab & Raju Kalai, 2023. "Hypothesis-free phenotype prediction within a genetics-first framework," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    14. Seong Kyu Han & Michelle T. McNulty & Christopher J. Benway & Pei Wen & Anya Greenberg & Ana C. Onuchic-Whitford & Dongkeun Jang & Jason Flannick & Noël P. Burtt & Parker C. Wilson & Benjamin D. Humph, 2023. "Mapping genomic regulation of kidney disease and traits through high-resolution and interpretable eQTLs," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    15. Ian Barnett & Rajarshi Mukherjee & Xihong Lin, 2017. "The Generalized Higher Criticism for Testing SNP-Set Effects in Genetic Association Studies," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(517), pages 64-76, January.
    16. repec:plo:pone00:0118701 is not listed on IDEAS
    17. repec:plo:pgen00:1002078 is not listed on IDEAS
    18. Saedis Saevarsdottir & Kristbjörg Bjarnadottir & Thorsteinn Markusson & Jonas Berglund & Thorunn A. Olafsdottir & Gisli H. Halldorsson & Gudrun Rutsdottir & Kristbjorg Gunnarsdottir & Asgeir Orn Arnth, 2024. "Start codon variant in LAG3 is associated with decreased LAG-3 expression and increased risk of autoimmune thyroid disease," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    19. Bingxin Zhao & Fei Zou, 2022. "On polygenic risk scores for complex traits prediction," Biometrics, The International Biometric Society, vol. 78(2), pages 499-511, June.
    20. von Stumm, Sophie & Kandaswamy, Radhika & Maxwell, Jessye, 2023. "Gene-environment interplay in early life cognitive development," Intelligence, Elsevier, vol. 98(C).
    21. repec:plo:pgen00:1006573 is not listed on IDEAS
    22. repec:plo:pone00:0188566 is not listed on IDEAS
    23. Satria P. Sajuthi & Jamie L. Everman & Nathan D. Jackson & Benjamin Saef & Cydney L. Rios & Camille M. Moore & Angel C. Y. Mak & Celeste Eng & Ana Fairbanks-Mahnke & Sandra Salazar & Jennifer Elhawary, 2022. "Nasal airway transcriptome-wide association study of asthma reveals genetically driven mucus pathobiology," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    24. Barbara E Stranger & Stephen B Montgomery & Antigone S Dimas & Leopold Parts & Oliver Stegle & Catherine E Ingle & Magda Sekowska & George Davey Smith & David Evans & Maria Gutierrez-Arcelus & Alkes P, 2012. "Patterns of Cis Regulatory Variation in Diverse Human Populations," PLOS Genetics, Public Library of Science, vol. 8(4), pages 1-13, April.
    25. Jiao Gong & Huiru Sun & Kaiyuan Wang & Yanhui Zhao & Yechao Huang & Qinsheng Chen & Hui Qiao & Yang Gao & Jialin Zhao & Yunchao Ling & Ruifang Cao & Jingze Tan & Qi Wang & Yanyun Ma & Jing Li & Jingch, 2025. "Long-read sequencing of 945 Han individuals identifies structural variants associated with phenotypic diversity and disease susceptibility," Nature Communications, Nature, vol. 16(1), pages 1-21, December.
    26. Nikolaos I. Panousis & Omar El Garwany & Andrew Knights & Jesse Cheruiyot Rop & Natsuhiko Kumasaka & Maria Imaz & Lorena Boquete Vilarino & Anthi Tsingene & Alex Tokolyi & Alice Barnett & Celine Gomez, 2025. "Gene expression QTL mapping in stimulated iPSC-derived macrophages provides insights into common complex diseases," Nature Communications, Nature, vol. 16(1), pages 1-11, December.
    27. repec:plo:pcbi00:1002600 is not listed on IDEAS
    28. Yu Yan & Hongbo Liu & Amin Abedini & Xin Sheng & Matthew Palmer & Hongzhe Li & Katalin Susztak, 2024. "Unraveling the epigenetic code: human kidney DNA methylation and chromatin dynamics in renal disease development," Nature Communications, Nature, vol. 15(1), pages 1-17, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-54678-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.