IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v16y2025i1d10.1038_s41467-025-59351-8.html
   My bibliography  Save this article

Genetic ancestry and population structure in the All of Us Research Program cohort

Author

Listed:
  • Shivam Sharma

    (Georgia Institute of Technology
    National Institutes of Health)

  • Shashwat Deepali Nagar

    (Georgia Institute of Technology)

  • Priscilla Pemu

    (Morehouse School of Medicine
    University of Miami)

  • Stephan Zuchner

    (University of Miami
    University of Miami)

  • Leonardo Mariño-Ramírez

    (National Institutes of Health)

  • Robert Meller

    (Morehouse School of Medicine
    University of Miami)

  • I. King Jordan

    (Georgia Institute of Technology)

Abstract

We analyzed participant genomic variant data to characterize population structure and genetic ancestry for the All of Us cohort (n = 297,549). There is substantial population structure in the cohort, with clusters of closely related participants interspersed among less related individuals. Participants show diverse genetic ancestry, with major contributions from European (66.4%), African (19.5%), Asian (7.6%), and American (6.3%) continental ancestry components. Participant genetic similarity clusters show group-specific ancestry, with distinct patterns of continental and subcontinental ancestry among groups. African and American ancestry are enriched in the southeast and southwest regions of the country, respectively, whereas European ancestry is more evenly distributed across the US. The diversity of All of Us participants’ genetic ancestry is negatively correlated with age; younger participants show higher levels of genetic admixture compared to older participants. Our results underscore the ancestral genetic diversity of the All of Us cohort, a crucial prerequisite for genomic health equity.

Suggested Citation

  • Shivam Sharma & Shashwat Deepali Nagar & Priscilla Pemu & Stephan Zuchner & Leonardo Mariño-Ramírez & Robert Meller & I. King Jordan, 2025. "Genetic ancestry and population structure in the All of Us Research Program cohort," Nature Communications, Nature, vol. 16(1), pages 1-10, December.
  • Handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-59351-8
    DOI: 10.1038/s41467-025-59351-8
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-025-59351-8
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-025-59351-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Noah A Rosenberg & Saurabh Mahajan & Sohini Ramachandran & Chengfeng Zhao & Jonathan K Pritchard & Marcus W Feldman, 2005. "Clines, Clusters, and the Effect of Study Design on the Inference of Human Population Structure," PLOS Genetics, Public Library of Science, vol. 1(6), pages 1-12, December.
    2. repec:plo:pgen00:1008225 is not listed on IDEAS
    3. Carlos D. Bustamante & Francisco M. De La Vega & Esteban G. Burchard, 2011. "Genomics for the world," Nature, Nature, vol. 475(7355), pages 163-165, July.
    4. Rasmus Nielsen & Joshua M. Akey & Mattias Jakobsson & Jonathan K. Pritchard & Sarah Tishkoff & Eske Willerslev, 2017. "Tracing the peopling of the world through genomics," Nature, Nature, vol. 541(7637), pages 302-310, January.
    5. Eunjung Han & Peter Carbonetto & Ross E. Curtis & Yong Wang & Julie M. Granka & Jake Byrnes & Keith Noto & Amir R. Kermany & Natalie M. Myres & Mathew J. Barber & Kristin A. Rand & Shiya Song & Theodo, 2017. "Clustering of 770,000 genomes reveals post-colonial population structure of North America," Nature Communications, Nature, vol. 8(1), pages 1-12, April.
    6. Alice B. Popejoy & Stephanie M. Fullerton, 2016. "Genomics is failing on diversity," Nature, Nature, vol. 538(7624), pages 161-164, October.
    7. Julian R Homburger & Andrés Moreno-Estrada & Christopher R Gignoux & Dominic Nelson & Elena Sanchez & Patricia Ortiz-Tello & Bernardo A Pons-Estel & Eduardo Acevedo-Vasquez & Pedro Miranda & Carl D La, 2015. "Genomic Insights into the Ancestry and Demographic History of South America," PLOS Genetics, Public Library of Science, vol. 11(12), pages 1-26, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jon Lerga-Jaso & Biljana Novković & Deepu Unnikrishnan & Varuna Bamunusinghe & Marcelinus R. Hatorangan & Charlie Manson & Haley Pedersen & Alex Osama & Andrew Terpolovsky & Sandra Bohn & Adriano De M, 2025. "Tracing human genetic histories and natural selection with precise local ancestry inference," Nature Communications, Nature, vol. 16(1), pages 1-13, December.
    2. Nadine R. Caron & Wilf Adam & Kate Anderson & Brooke T. Boswell & Meck Chongo & Viktor Deineko & Alexanne Dick & Shannon E. Hall & Jessica T. Hatcher & Patricia Howard & Megan Hunt & Kevin Linn & Ashl, 2023. "Partnering with First Nations in Northern British Columbia Canada to Reduce Inequity in Access to Genomic Research," IJERPH, MDPI, vol. 20(10), pages 1-31, May.
    3. Pei-Kuan Cong & Wei-Yang Bai & Jin-Chen Li & Meng-Yuan Yang & Saber Khederzadeh & Si-Rui Gai & Nan Li & Yu-Heng Liu & Shi-Hui Yu & Wei-Wei Zhao & Jun-Quan Liu & Yi Sun & Xiao-Wei Zhu & Pian-Pian Zhao , 2022. "Genomic analyses of 10,376 individuals in the Westlake BioBank for Chinese (WBBC) pilot project," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    4. Ido Amit & Kristin Ardlie & Fabiana Arzuaga & Gordon Awandare & Gary Bader & Alexander Bernier & Piero Carninci & Stacey Donnelly & Roland Eils & Alistair R. R. Forrest & Henry T. Greely & Roderic Gui, 2024. "The commitment of the human cell atlas to humanity," Nature Communications, Nature, vol. 15(1), pages 1-7, December.
    5. Kevin L Keys & Angel C Y Mak & Marquitta J White & Walter L Eckalbar & Andrew W Dahl & Joel Mefford & Anna V Mikhaylova & María G Contreras & Jennifer R Elhawary & Celeste Eng & Donglei Hu & Scott Hun, 2020. "On the cross-population generalizability of gene expression prediction models," PLOS Genetics, Public Library of Science, vol. 16(8), pages 1-28, August.
    6. repec:plo:pgen00:1000078 is not listed on IDEAS
    7. Quamrul H. Ashraf & Oded Galor, 2018. "The Macrogenoeconomics of Comparative Development," Journal of Economic Literature, American Economic Association, vol. 56(3), pages 1119-1155, September.
    8. Leslie A. Smith & James A. Cahill & Ji-Hyun Lee & Kiley Graim, 2025. "Equitable machine learning counteracts ancestral bias in precision medicine," Nature Communications, Nature, vol. 16(1), pages 1-17, December.
    9. Michel S. Naslavsky & Marilia O. Scliar & Guilherme L. Yamamoto & Jaqueline Yu Ting Wang & Stepanka Zverinova & Tatiana Karp & Kelly Nunes & José Ricardo Magliocco Ceroni & Diego Lima Carvalho & Carlo, 2022. "Whole-genome sequencing of 1,171 elderly admixed individuals from Brazil," Nature Communications, Nature, vol. 13(1), pages 1-11, December.
    10. Nick Patterson & Alkes L Price & David Reich, 2006. "Population Structure and Eigenanalysis," PLOS Genetics, Public Library of Science, vol. 2(12), pages 1-20, December.
    11. Frédérik Saltré & Joël Chadœuf & Thomas Higham & Monty Ochocki & Sebastián Block & Ellyse Bunney & Bastien Llamas & Corey J. A. Bradshaw, 2024. "Environmental conditions associated with initial northern expansion of anatomically modern humans," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    12. Gustav Agneman & Lisa Strömbom, 2025. "How Ethnic Discrimination Shapes Political Reintegration After War: Insights From a Conjoint Experiment in Colombia," Journal of Conflict Resolution, Peace Science Society (International), vol. 69(1), pages 152-177, January.
    13. repec:plo:pone00:0213766 is not listed on IDEAS
    14. Vaughan, Laura K. & Divers, Jasmin & Padilla, Miguel A. & Redden, David T. & Tiwari, Hemant K. & Pomp, Daniel & Allison, David B., 2009. "The use of plasmodes as a supplement to simulations: A simple example evaluating individual admixture estimation methodologies," Computational Statistics & Data Analysis, Elsevier, vol. 53(5), pages 1755-1766, March.
    15. Wei Fu & Shin-Yi Chou & Li-San Wang, 2022. "NIH Grant Expansion, Ancestral Diversity and Scientific Discovery in Genomics Research," NBER Working Papers 30155, National Bureau of Economic Research, Inc.
    16. Baier, Tina & Lyngstad, Torkild Hovde, 2024. "Social Background Effects on Educational Outcomes - New Insights from Modern Genetic Science," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 76(3), pages 525-545.
    17. Tristan Salles & Renaud Joannes-Boyau & Ian Moffat & Laurent Husson & Manon Lorcery, 2024. "Physiography, foraging mobility, and the first peopling of Sahul," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    18. Bing Guo & Victor Borda & Roland Laboulaye & Michele D. Spring & Mariusz Wojnarski & Brian A. Vesely & Joana C. Silva & Norman C. Waters & Timothy D. O’Connor & Shannon Takala-Harrison, 2024. "Strong positive selection biases identity-by-descent-based inferences of recent demography and population structure in Plasmodium falciparum," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    19. Xiaodong Liu & Ke Zhang & Neslihan A. Kaya & Zhe Jia & Dafei Wu & Tingting Chen & Zhiyuan Liu & Sinan Zhu & Axel M. Hillmer & Torsten Wuestefeld & Jin Liu & Yun Shen Chan & Zheng Hu & Liang Ma & Li Ji, 2024. "Tumor phylogeography reveals block-shaped spatial heterogeneity and the mode of evolution in Hepatocellular Carcinoma," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    20. Rob Cooke & Ferran Sayol & Tobias Andermann & Tim M. Blackburn & Manuel J. Steinbauer & Alexandre Antonelli & Søren Faurby, 2023. "Undiscovered bird extinctions obscure the true magnitude of human-driven extinction waves," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    21. Shim, Janet K. & Bentz, Michael & Vasquez, Emily & Jeske, Melanie & Saperstein, Aliya & Fullerton, Stephanie M. & Foti, Nicole & McMahon, Caitlin & Lee, Sandra Soo-Jin, 2022. "Strategies of inclusion: The tradeoffs of pursuing “baked in” diversity through place-based recruitment," Social Science & Medicine, Elsevier, vol. 306(C).
    22. Giuliano, Paola & Spilimbergo, Antonio & Tonon, Giovanni, 2006. "Genetic, Cultural and Geographical Distances," IZA Discussion Papers 2229, Institute of Labor Economics (IZA).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-59351-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.