IDEAS home Printed from https://ideas.repec.org/a/eee/thpobi/v163y2025icp13-23.html

The Patterson-Price-Reich's rule of population structure analysis from genetic marker data

Author

Listed:
  • Wang, Jinliang

Abstract

Delineating population structure from the marker genotypes of a sample of individuals is now routinely conducted in the fields of molecular ecology, evolution and conservation biology. Various Bayesian and likelihood methods as well as more general statistical methods (e.g. PCA) have been proposed to detect population structure, to assign sampled individuals to discrete clusters (subpopulations), and to estimate the admixture proportions of each sampled individual. Regardless of the methods, the power of a structure analysis depends on the strength of population structure (measured by FST) relative to the amount of marker information (measured by NL, where N and L are the numbers of sampled individuals and loci respectively). Patterson, Price and Reich (2006) proposed that population structure is unidentifiable when data size D = NL is smaller than 1/FST2 and quickly becomes identifiable easily with an increasing D or FST when D>1/FST2. In this study, I investigated this phase change PPR rule by analysing both simulated genomic data and empirical data by four likelihood admixture analysis methods. The results show that the PPR rule is largely valid, but the accuracy of a structure analysis is also affected by the number of subpopulations K. A more complicated population structure with a larger K requires a larger NLFST2 to resolve accurately. For a given NLFST2 above the PPR threshold value of 1, increasing L and decreasing N is advantageous over increasing N and decreasing L in improving admixture estimation accuracy.

Suggested Citation

  • Wang, Jinliang, 2025. "The Patterson-Price-Reich's rule of population structure analysis from genetic marker data," Theoretical Population Biology, Elsevier, vol. 163(C), pages 13-23.
  • Handle: RePEc:eee:thpobi:v:163:y:2025:i:c:p:13-23
    DOI: 10.1016/j.tpb.2025.03.001
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0040580925000188
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.tpb.2025.03.001?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Nick Patterson & Alkes L Price & David Reich, 2006. "Population Structure and Eigenanalysis," PLOS Genetics, Public Library of Science, vol. 2(12), pages 1-20, December.
    2. George Nicholson & Albert V. Smith & Frosti Jónsson & Ómar Gústafsson & Kári Stefánsson & Peter Donnelly, 2002. "Assessing population differentiation and isolation from single‐nucleotide polymorphism data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(4), pages 695-715, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gyaneshwer Chaubey & Anurag Kadian & Saroj Bala & Vadlamudi Raghavendra Rao, 2015. "Genetic Affinity of the Bhil, Kol and Gond Mentioned in Epic Ramayana," PLOS ONE, Public Library of Science, vol. 10(6), pages 1-11, June.
    2. Daniel Svensson & Matilda Rentoft & Anna M Dahlin & Emma Lundholm & Pall I Olason & Andreas Sjödin & Carin Nylander & Beatrice S Melin & Johan Trygg & Erik Johansson, 2020. "A whole-genome sequenced control population in northern Sweden reveals subregional genetic differences," PLOS ONE, Public Library of Science, vol. 15(9), pages 1-18, September.
    3. Estavoyer, Maxime & François, Olivier, 2022. "Theoretical analysis of principal components in an umbrella model of intraspecific evolution," Theoretical Population Biology, Elsevier, vol. 148(C), pages 11-21.
    4. Felsenstein, Joseph, 2015. "Covariation of gene frequencies in a stepping-stone lattice of populations," Theoretical Population Biology, Elsevier, vol. 100(C), pages 88-97.
    5. Yaron Granot & Omri Tal & Saharon Rosset & Karl Skorecki, 2016. "On the Apportionment of Population Structure," PLOS ONE, Public Library of Science, vol. 11(8), pages 1-24, August.
    6. repec:plo:pone00:0185380 is not listed on IDEAS
    7. Özkan İş & Xue Wang & Joseph S. Reddy & Yuhao Min & Elanur Yilmaz & Prabesh Bhattarai & Tulsi Patel & Jeremiah Bergman & Zachary Quicksall & Michael G. Heckman & Frederick Q. Tutor-New & Birsen Can De, 2024. "Gliovascular transcriptional perturbations in Alzheimer’s disease reveal molecular mechanisms of blood brain barrier dysfunction," Nature Communications, Nature, vol. 15(1), pages 1-23, December.
    8. Ambroise Wonkam & Kevin Esoh & Rachel M. Levine & Valentina Josiane Ngo Bitoungui & Khuthala Mnika & Nikitha Nimmagadda & Erin A. D. Dempsey & Siana Nkya & Raphael Z. Sangeda & Victoria Nembaware & Ja, 2025. "FLT1 and other candidate fetal haemoglobin modifying loci in sickle cell disease in African ancestries," Nature Communications, Nature, vol. 16(1), pages 1-21, December.
    9. Hyosik Jang & Ian M Ehrenreich, 2012. "Genome-Wide Characterization of Genetic Variation in the Unicellular, Green Alga Chlamydomonas reinhardtii," PLOS ONE, Public Library of Science, vol. 7(7), pages 1-9, July.
    10. Mathieu Gautier & Denis Laloë & Katayoun Moazami-Goudarzi, 2010. "Insights into the Genetic History of French Cattle from Dense SNP Data on 47 Worldwide Breeds," PLOS ONE, Public Library of Science, vol. 5(9), pages 1-11, September.
    11. Xiaofeng Cai & Xuepeng Sun & Chenxi Xu & Honghe Sun & Xiaoli Wang & Chenhui Ge & Zhonghua Zhang & Quanxi Wang & Zhangjun Fei & Chen Jiao & Quanhua Wang, 2021. "Genomic analyses provide insights into spinach domestication and the genetic basis of agronomic traits," Nature Communications, Nature, vol. 12(1), pages 1-12, December.
    12. repec:plo:pone00:0112640 is not listed on IDEAS
    13. Lee, Anthony J. & Hibbs, Courtney & Wright, Margaret J. & Martin, Nicholas G. & Keller, Matthew C. & Zietsch, Brendan P., 2017. "Assessing the accuracy of perceptions of intelligence based on heritable facial features," Intelligence, Elsevier, vol. 64(C), pages 1-8.
    14. Thompson Katherine L. & Linnen Catherine R. & Kubatko Laura, 2016. "Tree-based quantitative trait mapping in the presence of external covariates," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 15(6), pages 473-490, December.
    15. Matthieu Bouaziz & Caroline Paccard & Mickael Guedj & Christophe Ambroise, 2012. "SHIPS: Spectral Hierarchical Clustering for the Inference of Population Structure in Genetic Studies," PLOS ONE, Public Library of Science, vol. 7(10), pages 1-17, October.
    16. Jacobo Pardo-Seco & Alberto Gómez-Carballa & Jorge Amigo & Federico Martinón-Torres & Antonio Salas, 2014. "A Genome-Wide Study of Modern-Day Tuscans: Revisiting Herodotus's Theory on the Origin of the Etruscans," PLOS ONE, Public Library of Science, vol. 9(9), pages 1-11, September.
    17. Andrey V Khrunin & Denis V Khokhrin & Irina N Filippova & Tõnu Esko & Mari Nelis & Natalia A Bebyakova & Natalia L Bolotova & Janis Klovins & Liene Nikitina-Zake & Karola Rehnström & Samuli Ripatti & , 2013. "A Genome-Wide Analysis of Populations from European Russia Reveals a New Pole of Genetic Diversity in Northern Europe," PLOS ONE, Public Library of Science, vol. 8(3), pages 1-9, March.
    18. Ilja M Nolte & Chris Wallace & Stephen J Newhouse & Daryl Waggott & Jingyuan Fu & Nicole Soranzo & Rhian Gwilliam & Panos Deloukas & Irina Savelieva & Dongling Zheng & Chrysoula Dalageorgou & Martin F, 2009. "Common Genetic Variation Near the Phospholamban Gene Is Associated with Cardiac Repolarisation: Meta-Analysis of Three Genome-Wide Association Studies," PLOS ONE, Public Library of Science, vol. 4(7), pages 1-10, July.
    19. Hoicheong Siu & Li Jin & Momiao Xiong, 2012. "Manifold Learning for Human Population Structure Studies," PLOS ONE, Public Library of Science, vol. 7(1), pages 1-18, January.
    20. Elodie Persyn & Richard Redon & Lise Bellanger & Christian Dina, 2018. "The impact of a fine-scale population stratification on rare variant association test results," PLOS ONE, Public Library of Science, vol. 13(12), pages 1-17, December.
    21. Lap Sum Chan & Gen Li & Eric B. Fauman & Xianyong Yin & Markku Laakso & Michael Boehnke & Peter X. K. Song, 2025. "DrFARM: identification of pleiotropic genetic variants in genome-wide association studies," Nature Communications, Nature, vol. 16(1), pages 1-14, December.
    22. Andre Krumel Portella & Afroditi Papantoni & Catherine Paquet & Spencer Moore & Keri Shiels Rosch & Stewart Mostofsky & Richard S Lee & Kimberly R Smith & Robert Levitan & Patricia Pelufo Silveira & S, 2020. "Predicted DRD4 prefrontal gene expression moderates snack intake and stress perception in response to the environment in adolescents," PLOS ONE, Public Library of Science, vol. 15(6), pages 1-20, June.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:thpobi:v:163:y:2025:i:c:p:13-23. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.sciencedirect.com/journal/theoretical-population-biology .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.