IDEAS home Printed from https://ideas.repec.org/a/eee/thpobi/v165y2025icp10-21.html

Identity-by-descent segments in large samples

Author

Listed:
  • Temple, Seth D.
  • Thompson, Elizabeth A.

Abstract

If two haplotypes share the same alleles for an extended gene tract, these haplotypes are likely to be derived identical-by-descent from a recent common ancestor. Identity-by-descent segment lengths are correlated via unobserved ancestral tree and recombination processes, which commonly presents challenges to the derivation of theoretical results in population genetics. We show that the proportion of detectable identity-by-descent segments around a locus is normally distributed when the sample size and the scaled population size are large. We generalize this central limit theorem to cover flexible demographic scenarios, multi-way identity-by-descent segments, and multivariate identity-by-descent rates. The regularity conditions on sample size and scaled population size are unlikely to hold in genetic data from real populations, but provide intuition for when the Gaussian distribution may be a reasonable approximate model for the IBD rate. We use efficient simulations to study the distributional behavior of the detectable identity-by-descent rate. One consequence of non-normality in finite samples is that a genome-wide scan looking for excess identity-by-descent rates may be subject to anti-conservative control of family-wise error rates.

Suggested Citation

  • Temple, Seth D. & Thompson, Elizabeth A., 2025. "Identity-by-descent segments in large samples," Theoretical Population Biology, Elsevier, vol. 165(C), pages 10-21.
  • Handle: RePEc:eee:thpobi:v:165:y:2025:i:c:p:10-21
    DOI: 10.1016/j.tpb.2025.06.003
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0040580925000395
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.tpb.2025.06.003?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Ruhollah Shemirani & Gillian M. Belbin & Christy L. Avery & Eimear E. Kenny & Christopher R. Gignoux & José Luis Ambite, 2021. "Rapid detection of identity-by-descent tracts for mega-scale datasets," Nature Communications, Nature, vol. 12(1), pages 1-13, December.
    2. Bing Guo & Victor Borda & Roland Laboulaye & Michele D. Spring & Mariusz Wojnarski & Brian A. Vesely & Joana C. Silva & Norman C. Waters & Timothy D. O’Connor & Shannon Takala-Harrison, 2024. "Strong positive selection biases identity-by-descent-based inferences of recent demography and population structure in Plasmodium falciparum," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    3. Juba Nait Saada & Georgios Kalantzis & Derek Shyr & Fergus Cooper & Martin Robinson & Alexander Gusev & Pier Francesco Palamara, 2020. "Identity-by-descent detection across 487,409 British samples reveals fine scale population structure and ultra-rare variant associations," Nature Communications, Nature, vol. 11(1), pages 1-15, December.
    4. Alexander Shapiro & Jos Berge, 2002. "Statistical inference of minimum rank factor analysis," Psychometrika, Springer;The Psychometric Society, vol. 67(1), pages 79-94, March.
    5. Heng Li & Richard Durbin, 2011. "Inference of human population history from individual whole-genome sequences," Nature, Nature, vol. 475(7357), pages 493-496, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Romain Fournier & Zoi Tsangalidou & David Reich & Pier Francesco Palamara, 2023. "Haplotype-based inference of recent effective population size in modern and ancient DNA samples," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    2. Jewett, Ethan M. & Rosenberg, Noah A., 2014. "Theory and applications of a deterministic approximation to the coalescent model," Theoretical Population Biology, Elsevier, vol. 93(C), pages 14-29.
    3. Anastasiou, Andreas, 2017. "Bounds for the normal approximation of the maximum likelihood estimator from m-dependent random variables," Statistics & Probability Letters, Elsevier, vol. 129(C), pages 171-181.
    4. Denter, Philipp & Sisak, Dana, 2015. "Do polls create momentum in political competition?," Journal of Public Economics, Elsevier, vol. 130(C), pages 1-14.
    5. Salgado Alfredo, 2018. "Incomplete Information and Costly Signaling in College Admissions," Working Papers 2018-23, Banco de México.
    6. Albrecht, James & Anderson, Axel & Vroman, Susan, 2010. "Search by committee," Journal of Economic Theory, Elsevier, vol. 145(4), pages 1386-1407, July.
    7. Blier-Wong, Christopher & Cossette, Hélène & Marceau, Etienne, 2023. "Risk aggregation with FGM copulas," Insurance: Mathematics and Economics, Elsevier, vol. 111(C), pages 102-120.
    8. Ya-Mei Ding & Xiao-Xu Pang & Yu Cao & Wei-Ping Zhang & Susanne S. Renner & Da-Yong Zhang & Wei-Ning Bai, 2023. "Genome structure-based Juglandaceae phylogenies contradict alignment-based phylogenies and substitution rates vary with DNA repair genes," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    9. Gilles-Philippe Morin & Claudia Moreau & Amadou Barry & Simon L. Girard, 2026. "Fine-scale structure of a whole regional population through genetics and genealogies," Nature Communications, Nature, vol. 17(1), pages 1-8, December.
    10. Wim J. van der Linden, 2019. "Lord’s Equity Theorem Revisited," Journal of Educational and Behavioral Statistics, , vol. 44(4), pages 415-430, August.
    11. Ribeiro, Rafaela & Fanzeres, Bruno, 2026. "Integrated estimate-and-optimize decision trees learning for two-stage linear decision-making problems," European Journal of Operational Research, Elsevier, vol. 329(2), pages 607-628.
    12. Simar, Léopold & Wilson, Paul, 2022. "Modern Tools for Evaluating the Performance of Health-Care Providers," LIDAM Discussion Papers ISBA 2022006, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    13. Baey, Charlotte & Didier, Anne & Lemaire, Sébastien & Maupas, Fabienne & Cournède, Paul-Henry, 2013. "Modelling the interindividual variability of organogenesis in sugar beet populations using a hierarchical segmented model," Ecological Modelling, Elsevier, vol. 263(C), pages 56-63.
    14. Tasche, Dirk, 2013. "Bayesian estimation of probabilities of default for low default portfolios," Journal of Risk Management in Financial Institutions, Henry Stewart Publications, vol. 6(3), pages 302-326, July.
    15. Daniel dos Santos Baptista & Nuno M. Brites & Alfredo D. Egídio dos Reis, 2025. "Stochastic differential equations death rates models: the Portuguese case," Decisions in Economics and Finance, Springer;Associazione per la Matematica, vol. 48(2), pages 1197-1217, December.
    16. Diers, Dorothea & Linde, Marc & Hahn, Lukas, 2016. "Addendum to ‘The multi-year non-life insurance risk in the additive reserving model’ [Insurance Math. Econom. 52(3) (2013) 590–598]: Quantification of multi-year non-life insurance risk in chain ladder reserving models," Insurance: Mathematics and Economics, Elsevier, vol. 67(C), pages 187-199.
    17. Fan, Wai-Tong (Louis) & Wakeley, John, 2024. "Latent mutations in the ancestries of alleles under selection," Theoretical Population Biology, Elsevier, vol. 158(C), pages 1-20.
    18. Anastasiou, Andreas, 2017. "Bounds for the normal approximation of the maximum likelihood estimator from m -dependent random variables," LSE Research Online Documents on Economics 83635, London School of Economics and Political Science, LSE Library.
    19. Guangping Huang & Lingyun Song & Xin Du & Xin Huang & Fuwen Wei, 2023. "Evolutionary genomics of camouflage innovation in the orchid mantis," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    20. Legried, Brandon & Terhorst, Jonathan, 2022. "Rates of convergence in the two-island and isolation-with-migration models," Theoretical Population Biology, Elsevier, vol. 147(C), pages 16-27.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:thpobi:v:165:y:2025:i:c:p:10-21. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.sciencedirect.com/journal/theoretical-population-biology .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.