IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v14y2023i1d10.1038_s41467-023-42491-0.html
   My bibliography  Save this article

Unappreciated subcontinental admixture in Europeans and European Americans and implications for genetic epidemiology studies

Author

Listed:
  • Mateus H. Gouveia

    (National Institutes of Health)

  • Amy R. Bentley

    (National Institutes of Health)

  • Thiago P. Leal

    (Cleveland Clinic)

  • Eduardo Tarazona-Santos

    (Universidade Federal de Minas Gerais)

  • Carlos D. Bustamante

    (Stanford University)

  • Adebowale A. Adeyemo

    (National Institutes of Health)

  • Charles N. Rotimi

    (National Institutes of Health)

  • Daniel Shriner

    (National Institutes of Health)

Abstract

European-ancestry populations are recognized as stratified but not as admixed, implying that residual confounding by locus-specific ancestry can affect studies of association, polygenic adaptation, and polygenic risk scores. We integrate individual-level genome-wide data from ~19,000 European-ancestry individuals across 79 European populations and five European American cohorts. We generate a new reference panel that captures ancestral diversity missed by both the 1000 Genomes and Human Genome Diversity Projects. Both Europeans and European Americans are admixed at the subcontinental level, with admixture dates differing among subgroups of European Americans. After adjustment for both genome-wide and locus-specific ancestry, associations between a highly differentiated variant in LCT (rs4988235) and height or LDL-cholesterol were confirmed to be false positives whereas the association between LCT and body mass index was genuine. We provide formal evidence of subcontinental admixture in individuals with European ancestry, which, if not properly accounted for, can produce spurious results in genetic epidemiology studies.

Suggested Citation

  • Mateus H. Gouveia & Amy R. Bentley & Thiago P. Leal & Eduardo Tarazona-Santos & Carlos D. Bustamante & Adebowale A. Adeyemo & Charles N. Rotimi & Daniel Shriner, 2023. "Unappreciated subcontinental admixture in Europeans and European Americans and implications for genetic epidemiology studies," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
  • Handle: RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-42491-0
    DOI: 10.1038/s41467-023-42491-0
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-023-42491-0
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-023-42491-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Adam E. Locke & Bratati Kahali & Sonja I. Berndt & Anne E. Justice & Tune H. Pers & Felix R. Day & Corey Powell & Sailaja Vedantam & Martin L. Buchkovich & Jian Yang & Damien C. Croteau-Chonka & Tonu , 2015. "Genetic studies of body mass index yield new insights for obesity biology," Nature, Nature, vol. 518(7538), pages 197-206, February.
    2. Ashot Margaryan & Daniel J. Lawson & Martin Sikora & Fernando Racimo & Simon Rasmussen & Ida Moltke & Lara M. Cassidy & Emil Jørsboe & Andrés Ingason & Mikkel W. Pedersen & Thorfinn Korneliussen & Hel, 2020. "Population genomics of the Viking world," Nature, Nature, vol. 585(7825), pages 390-396, September.
    3. Joscha Gretzinger & Duncan Sayer & Pierre Justeau & Eveline Altena & Maria Pala & Katharina Dulias & Ceiridwen J. Edwards & Susanne Jodoin & Laura Lacher & Susanna Sabin & Åshild J. Vågene & Wolfgang , 2022. "The Anglo-Saxon migration and the formation of the early English gene pool," Nature, Nature, vol. 610(7930), pages 112-119, October.
    4. John Novembre & Toby Johnson & Katarzyna Bryc & Zoltán Kutalik & Adam R. Boyko & Adam Auton & Amit Indap & Karen S. King & Sven Bergmann & Matthew R. Nelson & Matthew Stephens & Carlos D. Bustamante, 2008. "Genes mirror geography within Europe," Nature, Nature, vol. 456(7219), pages 274-274, November.
    5. John Novembre & Toby Johnson & Katarzyna Bryc & Zoltán Kutalik & Adam R. Boyko & Adam Auton & Amit Indap & Karen S. King & Sven Bergmann & Matthew R. Nelson & Matthew Stephens & Carlos D. Bustamante, 2008. "Genes mirror geography within Europe," Nature, Nature, vol. 456(7218), pages 98-101, November.
    6. Andrés Moreno-Estrada & Simon Gravel & Fouad Zakharia & Jacob L McCauley & Jake K Byrnes & Christopher R Gignoux & Patricia A Ortiz-Tello & Ricardo J Martínez & Dale J Hedges & Richard W Morris & Cele, 2013. "Reconstructing the Population Genetic History of the Caribbean," PLOS Genetics, Public Library of Science, vol. 9(11), pages 1-19, November.
    7. Iosif Lazaridis & Nick Patterson & Alissa Mittnik & Gabriel Renaud & Swapan Mallick & Karola Kirsanow & Peter H. Sudmant & Joshua G. Schraiber & Sergi Castellano & Mark Lipson & Bonnie Berger & Christ, 2014. "Ancient human genomes suggest three ancestral populations for present-day Europeans," Nature, Nature, vol. 513(7518), pages 409-413, September.
    8. Rasika Ann Mathias & Margaret A. Taub & Christopher R. Gignoux & Wenqing Fu & Shaila Musharoff & Timothy D. O'Connor & Candelaria Vergara & Dara G. Torgerson & Maria Pino-Yanes & Suyash S. Shringarpur, 2016. "A continuum of admixture in the Western Hemisphere revealed by the African Diaspora genome," Nature Communications, Nature, vol. 7(1), pages 1-10, November.
    9. Daniel John Lawson & Garrett Hellenthal & Simon Myers & Daniel Falush, 2012. "Inference of Population Structure using Dense Haplotype Data," PLOS Genetics, Public Library of Science, vol. 8(1), pages 1-16, January.
    10. Charrad, Malika & Ghazzali, Nadia & Boiteau, Véronique & Niknafs, Azam, 2014. "NbClust: An R Package for Determining the Relevant Number of Clusters in a Data Set," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 61(i06).
    11. Daniel Taliun & Daniel N. Harris & Michael D. Kessler & Jedidiah Carlson & Zachary A. Szpiech & Raul Torres & Sarah A. Gagliano Taliun & André Corvelo & Stephanie M. Gogarten & Hyun Min Kang & Achille, 2021. "Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program," Nature, Nature, vol. 590(7845), pages 290-299, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Rozaimi Mohamad Razali & Juan Rodriguez-Flores & Mohammadmersad Ghorbani & Haroon Naeem & Waleed Aamer & Elbay Aliyev & Ali Jubran & Andrew G. Clark & Khalid A. Fakhro & Younes Mokrab, 2021. "Thousands of Qatari genomes inform human migration history and improve imputation of Arab haplotypes," Nature Communications, Nature, vol. 12(1), pages 1-16, December.
    2. Oscar Lao & Fan Liu & Andreas Wollstein & Manfred Kayser, 2014. "GAGA: A New Algorithm for Genomic Inference of Geographic Ancestry Reveals Fine Level Population Substructure in Europeans," PLOS Computational Biology, Public Library of Science, vol. 10(2), pages 1-11, February.
    3. Bárbara Sousa da Mota & Simone Rubinacci & Diana Ivette Cruz Dávalos & Carlos Eduardo G. Amorim & Martin Sikora & Niels N. Johannsen & Marzena H. Szmyt & Piotr Włodarczak & Anita Szczepanek & Marcin M, 2023. "Imputation of ancient human genomes," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    4. Jerome Kelleher & Alison M Etheridge & Gilean McVean, 2016. "Efficient Coalescent Simulation and Genealogical Analysis for Large Sample Sizes," PLOS Computational Biology, Public Library of Science, vol. 12(5), pages 1-22, May.
    5. Marco Lopez-Cruz & Fernando M. Aguate & Jacob D. Washburn & Natalia Leon & Shawn M. Kaeppler & Dayane Cristina Lima & Ruijuan Tan & Addie Thompson & Laurence Willard Bretonne & Gustavo los Campos, 2023. "Leveraging data from the Genomes-to-Fields Initiative to investigate genotype-by-environment interactions in maize in North America," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    6. Beatrix Eugster & Rafael Lalive & Andreas Steinhauer & Josef Zweimüller, 2011. "The Demand for Social Insurance: Does Culture Matter?," Economic Journal, Royal Economic Society, vol. 121(556), pages 413-448, November.
    7. Filippini, Massimo & Wekhof, Tobias, 2021. "The effect of culture on energy efficient vehicle ownership," Journal of Environmental Economics and Management, Elsevier, vol. 105(C).
    8. Andrey V Khrunin & Denis V Khokhrin & Irina N Filippova & Tõnu Esko & Mari Nelis & Natalia A Bebyakova & Natalia L Bolotova & Janis Klovins & Liene Nikitina-Zake & Karola Rehnström & Samuli Ripatti & , 2013. "A Genome-Wide Analysis of Populations from European Russia Reveals a New Pole of Genetic Diversity in Northern Europe," PLOS ONE, Public Library of Science, vol. 8(3), pages 1-9, March.
    9. Wenhan Chen & Yang Wu & Zhili Zheng & Ting Qi & Peter M. Visscher & Zhihong Zhu & Jian Yang, 2021. "Improved analyses of GWAS summary statistics by reducing data heterogeneity and errors," Nature Communications, Nature, vol. 12(1), pages 1-10, December.
    10. Parsa Akbari & Olukayode A. Sosina & Jonas Bovijn & Karl Landheer & Jonas B. Nielsen & Minhee Kim & Senem Aykul & Tanima De & Mary E. Haas & George Hindy & Nan Lin & Ian R. Dinsmore & Jonathan Z. Luo , 2022. "Multiancestry exome sequencing reveals INHBE mutations associated with favorable fat distribution and protection from diabetes," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    11. Pierre Luisi & Angelina García & Juan Manuel Berros & Josefina M B Motti & Darío A Demarchi & Emma Alfaro & Eliana Aquilano & Carina Argüelles & Sergio Avena & Graciela Bailliet & Julieta Beltramo & C, 2020. "Fine-scale genomic analyses of admixed individuals reveal unrecognized genetic ancestry components in Argentina," PLOS ONE, Public Library of Science, vol. 15(7), pages 1-30, July.
    12. Brielin C Brown & Nicolas L Bray & Lior Pachter, 2018. "Expression reflects population structure," PLOS Genetics, Public Library of Science, vol. 14(12), pages 1-15, December.
    13. Gad Abraham & Michael Inouye, 2014. "Fast Principal Component Analysis of Large-Scale Genome-Wide Data," PLOS ONE, Public Library of Science, vol. 9(4), pages 1-5, April.
    14. Beatrix Brügger & Rafael Lalive & Josef Zweimüller, 2009. "Does Culture Affect Unemployment? Evidence from the Röstigraben," NRN working papers 2009-10, The Austrian Center for Labor Economics and the Analysis of the Welfare State, Johannes Kepler University Linz, Austria.
    15. Diana Chang & Alon Keinan, 2014. "Principal Component Analysis Characterizes Shared Pathogenetics from Genome-Wide Association Studies," PLOS Computational Biology, Public Library of Science, vol. 10(9), pages 1-14, September.
    16. Alejandro Ochoa & John D Storey, 2021. "Estimating FST and kinship for arbitrary population structures," PLOS Genetics, Public Library of Science, vol. 17(1), pages 1-36, January.
    17. Victor Ronda & Esben Agerbo & Dorthe Bleses & Preben Bo Mortensen & Anders Børglum & Ole Mors & Michael Rosholm & David M. Hougaard & Merete Nordentoft & Thomas Werge, 2022. "Family disadvantage, gender, and the returns to genetic human capital," Scandinavian Journal of Economics, Wiley Blackwell, vol. 124(2), pages 550-578, April.
    18. Feldman, Michael J., 2023. "Spiked singular values and vectors under extreme aspect ratios," Journal of Multivariate Analysis, Elsevier, vol. 196(C).
    19. Nicola Barban & Elisabetta De Cao & Sonia Oreffice & Climent Quintana-Domeque, 2016. "Assortative Mating on Education: A Genetic Assessment," Working Papers 2016-034, Human Capital and Economic Opportunity Working Group.
    20. Buzbas, Erkan Ozge & Verdu, Paul, 2018. "Inference on admixture fractions in a mechanistic model of recurrent admixture," Theoretical Population Biology, Elsevier, vol. 122(C), pages 149-157.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-42491-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.