IDEAS home Printed from https://ideas.repec.org/a/eee/thpobi/v166y2025icp116-137.html

Using mathematical constraints to explain narrow ranges for allele-sharing dissimilarities

Author

Listed:
  • Liu, Xiran
  • Ahsan, Zarif
  • Rosenberg, Noah A.

Abstract

Allele-sharing dissimilarity (ASD) statistics are measures of genetic differentiation for pairs of individuals or populations. Given the allele-frequency distributions of two populations — possibly the same population — the expected value of an ASD statistic is computed by evaluating the expectation of the pairwise dissimilarity between two individuals drawn at random, each from its associated allele-frequency distribution. For each of two ASD statistics, which we term D1 and D2, we investigate the extent to which the expected ASD is constrained by allele frequencies in the two populations; in other words, how is the magnitude of the measure bounded as a function of the frequency of the most frequent allelic type? We first consider dissimilarity of a population with itself, obtaining bounds on expected ASD in terms of the frequency of the most frequent allelic type in the population. We then examine pairs of populations that might or might not possess the same most frequent allelic type. Across the unit interval for the frequency of the most frequent allelic type, the expected allele-sharing dissimilarity has a range that is more restricted than the [0,1] interval. The mathematical constraints on expected ASD assist in explaining a pattern observed empirically in human populations, namely that when averaging across loci, allele-sharing dissimilarities between pairs of individuals often tend to vary only within a relatively narrow range.

Suggested Citation

  • Liu, Xiran & Ahsan, Zarif & Rosenberg, Noah A., 2025. "Using mathematical constraints to explain narrow ranges for allele-sharing dissimilarities," Theoretical Population Biology, Elsevier, vol. 166(C), pages 116-137.
  • Handle: RePEc:eee:thpobi:v:166:y:2025:i:c:p:116-137
    DOI: 10.1016/j.tpb.2025.05.002
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S004058092500036X
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.tpb.2025.05.002?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Noah A Rosenberg & Saurabh Mahajan & Sohini Ramachandran & Chengfeng Zhao & Jonathan K Pritchard & Marcus W Feldman, 2005. "Clines, Clusters, and the Effect of Study Design on the Inference of Human Population Structure," PLOS Genetics, Public Library of Science, vol. 1(6), pages 1-12, December.
    2. Garud, Nandita R. & Rosenberg, Noah A., 2015. "Enhancing the mathematical properties of new haplotype homozygosity statistics for the detection of selective sweeps," Theoretical Population Biology, Elsevier, vol. 102(C), pages 94-101.
    3. Edge, Michael D. & Rosenberg, Noah A., 2014. "Upper bounds on FST in terms of the frequency of the most frequent allele and total homozygosity: The case of a specified number of alleles," Theoretical Population Biology, Elsevier, vol. 97(C), pages 20-34.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. repec:plo:pgen00:1000078 is not listed on IDEAS
    2. Nick Patterson & Alkes L Price & David Reich, 2006. "Population Structure and Eigenanalysis," PLOS Genetics, Public Library of Science, vol. 2(12), pages 1-20, December.
    3. Alejandro Ochoa & John D Storey, 2021. "Estimating FST and kinship for arbitrary population structures," PLOS Genetics, Public Library of Science, vol. 17(1), pages 1-36, January.
    4. Vaughan, Laura K. & Divers, Jasmin & Padilla, Miguel A. & Redden, David T. & Tiwari, Hemant K. & Pomp, Daniel & Allison, David B., 2009. "The use of plasmodes as a supplement to simulations: A simple example evaluating individual admixture estimation methodologies," Computational Statistics & Data Analysis, Elsevier, vol. 53(5), pages 1755-1766, March.
    5. Garud, Nandita R. & Rosenberg, Noah A., 2015. "Enhancing the mathematical properties of new haplotype homozygosity statistics for the detection of selective sweeps," Theoretical Population Biology, Elsevier, vol. 102(C), pages 94-101.
    6. Shivam Sharma & Shashwat Deepali Nagar & Priscilla Pemu & Stephan Zuchner & Leonardo Mariño-Ramírez & Robert Meller & I. King Jordan, 2025. "Genetic ancestry and population structure in the All of Us Research Program cohort," Nature Communications, Nature, vol. 16(1), pages 1-10, December.
    7. Giuliano, Paola & Spilimbergo, Antonio & Tonon, Giovanni, 2006. "Genetic, Cultural and Geographical Distances," IZA Discussion Papers 2229, IZA Network @ LISER.
    8. Ramachandran, Sohini & Rosenberg, Noah A. & Feldman, Marcus W. & Wakeley, John, 2008. "Population differentiation and migration: Coalescence times in a two-sex island model for autosomal and X-linked loci," Theoretical Population Biology, Elsevier, vol. 74(4), pages 291-301.
    9. Quamrul H. Ashraf & Oded Galor, 2018. "The Macrogenoeconomics of Comparative Development," Journal of Economic Literature, American Economic Association, vol. 56(3), pages 1119-1155, September.
    10. Arbisser, Ilana M. & Rosenberg, Noah A., 2020. "FST and the triangle inequality for biallelic markers," Theoretical Population Biology, Elsevier, vol. 133(C), pages 117-129.
    11. Ting Fung Ma & Fangfang Wang & Jun Zhu, 2023. "On generalized latent factor modeling and inference for high‐dimensional binomial data," Biometrics, The International Biometric Society, vol. 79(3), pages 2311-2320, September.
    12. Szpiech, Zachary A. & Rosenberg, Noah A., 2011. "On the size distribution of private microsatellite alleles," Theoretical Population Biology, Elsevier, vol. 80(2), pages 100-113.
    13. Ricardo Kanitz & Elsa G Guillot & Sylvain Antoniazza & Samuel Neuenschwander & Jérôme Goudet, 2018. "Complex genetic patterns in human arise from a simple range-expansion model over continental landmasses," PLOS ONE, Public Library of Science, vol. 13(2), pages 1-16, February.
    14. Liu Xiran & Ahsan Zarif & Martheswaran Tarun K. & Rosenberg Noah A., 2023. "When is the allele-sharing dissimilarity between two populations exceeded by the allele-sharing dissimilarity of a population with itself?," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 22(1), pages 1-24, January.
    15. Devansh Pandey & Mariana Harris & Nandita R. Garud & Vagheesh M. Narasimhan, 2024. "Leveraging ancient DNA to uncover signals of natural selection in Europe lost due to admixture or drift," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    16. Eric R Londin & Margaret A Keller & Cathleen Maista & Gretchen Smith & Laura A Mamounas & Ran Zhang & Steven J Madore & Katrina Gwinn & Roderick A Corriveau, 2010. "CoAIMs: A Cost-Effective Panel of Ancestry Informative Markers for Determining Continental Origins," PLOS ONE, Public Library of Science, vol. 5(10), pages 1-12, October.
    17. Peristera Paschou & Petros Drineas & Jamey Lewis & Caroline M Nievergelt & Deborah A Nickerson & Joshua D Smith & Paul M Ridker & Daniel I Chasman & Ronald M Krauss & Elad Ziv, 2008. "Tracing Sub-Structure in the European American Population with PCA-Informative Markers," PLOS Genetics, Public Library of Science, vol. 4(7), pages 1-13, July.
    18. Paye, Sanjana M. & Edge, Michael D., 2025. "Mathematical bounds on r2 and the effect size in case-control genome-wide association studies," Theoretical Population Biology, Elsevier, vol. 164(C), pages 1-11.
    19. Charlotte Faurie & Clement Mettling & Mohamed Ali Bchir & Danang Hadmoko & Carine Heitz & Evi Lestari & Michel Raymond & Marc Willinger, 2016. "Evidence of genotypic adaptation to the exposure to volcanic risk at the dopamine receptor DRD4 locus," Post-Print hal-02062364, HAL.
    20. Frank, Reanne, 2007. "What to make of it? The (Re)emergence of a biological conceptualization of race in health disparities research," Social Science & Medicine, Elsevier, vol. 64(10), pages 1977-1983, May.
    21. Catherine Bliss, 2015. "Science and Struggle," The ANNALS of the American Academy of Political and Social Science, , vol. 661(1), pages 86-108, September.

    More about this item

    Keywords

    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:thpobi:v:166:y:2025:i:c:p:116-137. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.sciencedirect.com/journal/theoretical-population-biology .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.