IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0207677.html
   My bibliography  Save this article

The impact of a fine-scale population stratification on rare variant association test results

Author

Listed:
  • Elodie Persyn
  • Richard Redon
  • Lise Bellanger
  • Christian Dina

Abstract

Population stratification is a well-known confounding factor in both common and rare variant association analyses. Rare variants tend to be more geographically clustered than common variants, because of their more recent origin. However, it is not yet clear if population stratification at a very fine scale (neighboring administrative regions within a country) would lead to statistical bias in rare variant analyses. As the inclusion of convenience controls from external studies is indeed a common procedure, in order to increase the power to detect genetic associations, this problem is important. We studied through simulation the impact of a fine scale population structure on different rare variant association strategies, assessing type I error and power. We showed that principal component analysis (PCA) based methods of adjustment for population stratification adequately corrected type I error inflation at the largest geographical scales, but not at finest scales. We also showed in our simulations that adding controls obviously increased power, but at a considerably lower level when controls were drawn from another population.

Suggested Citation

  • Elodie Persyn & Richard Redon & Lise Bellanger & Christian Dina, 2018. "The impact of a fine-scale population stratification on rare variant association test results," PLOS ONE, Public Library of Science, vol. 13(12), pages 1-17, December.
  • Handle: RePEc:plo:pone00:0207677
    DOI: 10.1371/journal.pone.0207677
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0207677
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0207677&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0207677?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Elodie Persyn & Matilde Karakachoff & Solena Le Scouarnec & Camille Le Clézio & Dominique Campion & French Exome Consortium & Jean-Jacques Schott & Richard Redon & Lise Bellanger & Christian Dina, 2017. "DoEstRare: A statistical test to identify local enrichments in rare genomic variants associated with disease," PLOS ONE, Public Library of Science, vol. 12(7), pages 1-20, July.
    2. Benjamin M Neale & Manuel A Rivas & Benjamin F Voight & David Altshuler & Bernie Devlin & Marju Orho-Melander & Sekar Kathiresan & Shaun M Purcell & Kathryn Roeder & Mark J Daly, 2011. "Testing for an Unusual Distribution of Rare Variants," PLOS Genetics, Public Library of Science, vol. 7(3), pages 1-8, March.
    3. Yun-Ching Chen & Hannah Carter & Jennifer Parla & Melissa Kramer & Fernando S Goes & Mehdi Pirooznia & Peter P Zandi & W Richard McCombie & James B Potash & Rachel Karchin, 2013. "A Hybrid Likelihood Model for Sequence-Based Disease Association Studies," PLOS Genetics, Public Library of Science, vol. 9(1), pages 1-18, January.
    4. Robert B. Davies, 1980. "The Distribution of a Linear Combination of χ2 Random Variables," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 29(3), pages 323-333, November.
    5. Nick Patterson & Alkes L Price & David Reich, 2006. "Population Structure and Eigenanalysis," PLOS Genetics, Public Library of Science, vol. 2(12), pages 1-20, December.
    6. Timothy D O’Connor & Adam Kiezun & Michael Bamshad & Stephen S Rich & Joshua D Smith & Emily Turner & NHLBIGO Exome Sequencing Project & ESP Population Genetics, Statistical Analysis Working Group & S, 2013. "Fine-Scale Patterns of Population Stratification Confound Rare Variant Association Tests," PLOS ONE, Public Library of Science, vol. 8(7), pages 1-10, July.
    7. Wan-Yu Lin, 2014. "Association Testing of Clustered Rare Causal Variants in Case-Control Studies," PLOS ONE, Public Library of Science, vol. 9(4), pages 1-6, April.
    8. Dajiang J Liu & Suzanne M Leal, 2010. "A Novel Adaptive Method for the Analysis of Next-Generation Sequencing Data to Detect Complex Trait Associations with Rare Variants Due to Gene Main Effects and Interactions," PLOS Genetics, Public Library of Science, vol. 6(10), pages 1-14, October.
    9. Marie-Claude Babron & Marie de Tayrac & Douglas N Rutledge & Eleftheria Zeggini & Emmanuelle Génin, 2012. "Rare and Low Frequency Variant Stratification in the UK Population: Description and Impact on Association Tests," PLOS ONE, Public Library of Science, vol. 7(10), pages 1-9, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chung-Feng Kao & Jia-Rou Liu & Hung Hung & Po-Hsiu Kuo, 2015. "A Robust GWSS Method to Simultaneously Detect Rare and Common Variants for Complex Disease," PLOS ONE, Public Library of Science, vol. 10(4), pages 1-14, April.
    2. Gyaneshwer Chaubey & Anurag Kadian & Saroj Bala & Vadlamudi Raghavendra Rao, 2015. "Genetic Affinity of the Bhil, Kol and Gond Mentioned in Epic Ramayana," PLOS ONE, Public Library of Science, vol. 10(6), pages 1-11, June.
    3. Daniel Svensson & Matilda Rentoft & Anna M Dahlin & Emma Lundholm & Pall I Olason & Andreas Sjödin & Carin Nylander & Beatrice S Melin & Johan Trygg & Erik Johansson, 2020. "A whole-genome sequenced control population in northern Sweden reveals subregional genetic differences," PLOS ONE, Public Library of Science, vol. 15(9), pages 1-18, September.
    4. Estavoyer, Maxime & François, Olivier, 2022. "Theoretical analysis of principal components in an umbrella model of intraspecific evolution," Theoretical Population Biology, Elsevier, vol. 148(C), pages 11-21.
    5. Felsenstein, Joseph, 2015. "Covariation of gene frequencies in a stepping-stone lattice of populations," Theoretical Population Biology, Elsevier, vol. 100(C), pages 88-97.
    6. Yaron Granot & Omri Tal & Saharon Rosset & Karl Skorecki, 2016. "On the Apportionment of Population Structure," PLOS ONE, Public Library of Science, vol. 11(8), pages 1-24, August.
    7. Escanciano, Juan Carlos & Jacho-Chávez, David T., 2010. "Approximating the critical values of Cramér-von Mises tests in general parametric conditional specifications," Computational Statistics & Data Analysis, Elsevier, vol. 54(3), pages 625-636, March.
    8. Pötscher, Benedikt M. & Preinerstorfer, David, 2021. "Valid Heteroskedasticity Robust Testing," MPRA Paper 107420, University Library of Munich, Germany.
    9. Hyosik Jang & Ian M Ehrenreich, 2012. "Genome-Wide Characterization of Genetic Variation in the Unicellular, Green Alga Chlamydomonas reinhardtii," PLOS ONE, Public Library of Science, vol. 7(7), pages 1-9, July.
    10. Mathieu Gautier & Denis Laloë & Katayoun Moazami-Goudarzi, 2010. "Insights into the Genetic History of French Cattle from Dense SNP Data on 47 Worldwide Breeds," PLOS ONE, Public Library of Science, vol. 5(9), pages 1-11, September.
    11. Xiaofeng Cai & Xuepeng Sun & Chenxi Xu & Honghe Sun & Xiaoli Wang & Chenhui Ge & Zhonghua Zhang & Quanxi Wang & Zhangjun Fei & Chen Jiao & Quanhua Wang, 2021. "Genomic analyses provide insights into spinach domestication and the genetic basis of agronomic traits," Nature Communications, Nature, vol. 12(1), pages 1-12, December.
    12. Lee, Anthony J. & Hibbs, Courtney & Wright, Margaret J. & Martin, Nicholas G. & Keller, Matthew C. & Zietsch, Brendan P., 2017. "Assessing the accuracy of perceptions of intelligence based on heritable facial features," Intelligence, Elsevier, vol. 64(C), pages 1-8.
    13. Thompson Katherine L. & Linnen Catherine R. & Kubatko Laura, 2016. "Tree-based quantitative trait mapping in the presence of external covariates," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 15(6), pages 473-490, December.
    14. Chin-Shang Li, 2016. "A test for the linearity of the nonparametric part of a semiparametric logistic regression model," Journal of Applied Statistics, Taylor & Francis Journals, vol. 43(3), pages 461-475, March.
    15. Taamouti, Abderrahim, 2009. "Analytical Value-at-Risk and Expected Shortfall under regime-switching," Finance Research Letters, Elsevier, vol. 6(3), pages 138-151, September.
    16. Matthieu Bouaziz & Caroline Paccard & Mickael Guedj & Christophe Ambroise, 2012. "SHIPS: Spectral Hierarchical Clustering for the Inference of Population Structure in Genetic Studies," PLOS ONE, Public Library of Science, vol. 7(10), pages 1-17, October.
    17. Nicolas Städler & Sach Mukherjee, 2017. "Two-sample testing in high dimensions," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(1), pages 225-246, January.
    18. Jacobo Pardo-Seco & Alberto Gómez-Carballa & Jorge Amigo & Federico Martinón-Torres & Antonio Salas, 2014. "A Genome-Wide Study of Modern-Day Tuscans: Revisiting Herodotus's Theory on the Origin of the Etruscans," PLOS ONE, Public Library of Science, vol. 9(9), pages 1-11, September.
    19. Mahmood Kharrati-Kopaei, 2021. "On the exact distribution of the likelihood ratio test statistic for testing the homogeneity of the scale parameters of several inverse Gaussian distributions," Computational Statistics, Springer, vol. 36(2), pages 1123-1138, June.
    20. Andrey V Khrunin & Denis V Khokhrin & Irina N Filippova & Tõnu Esko & Mari Nelis & Natalia A Bebyakova & Natalia L Bolotova & Janis Klovins & Liene Nikitina-Zake & Karola Rehnström & Samuli Ripatti & , 2013. "A Genome-Wide Analysis of Populations from European Russia Reveals a New Pole of Genetic Diversity in Northern Europe," PLOS ONE, Public Library of Science, vol. 8(3), pages 1-9, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0207677. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.