IDEAS home Printed from https://ideas.repec.org/a/eee/thpobi/v134y2020icp106-118.html
   My bibliography  Save this article

Maximum likelihood estimators for scaled mutation rates in an equilibrium mutation–drift model

Author

Listed:
  • Vogl, Claus
  • Mikula, Lynette C.
  • Burden, Conrad J.

Abstract

The stationary sampling distribution of a neutral decoupled Moran or Wright–Fisher diffusion with neutral mutations is known to first order for a general rate matrix with small but otherwise unconstrained mutation rates. Using this distribution as a starting point we derive results for maximum likelihood estimates of scaled mutation rates from site frequency data under three model assumptions: a twelve-parameter general rate matrix, a nine-parameter reversible rate matrix, and a six-parameter strand-symmetric rate matrix. The site frequency spectrum is assumed to be sampled from a fixed size population in equilibrium, and to consist of allele frequency data at a large number of unlinked sites evolving with a common mutation rate matrix without selective bias. We correct an error in a previous treatment of the same problem (Burden and Tang, 2017) affecting the estimators for the general and strand-symmetric rate matrices. The method is applied to a biological dataset consisting of a site frequency spectrum extracted from short autosomal introns in a sample of Drosophila melanogaster individuals.

Suggested Citation

  • Vogl, Claus & Mikula, Lynette C. & Burden, Conrad J., 2020. "Maximum likelihood estimators for scaled mutation rates in an equilibrium mutation–drift model," Theoretical Population Biology, Elsevier, vol. 134(C), pages 106-118.
  • Handle: RePEc:eee:thpobi:v:134:y:2020:i:c:p:106-118
    DOI: 10.1016/j.tpb.2020.06.001
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0040580920300460
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.tpb.2020.06.001?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Burden, Conrad J. & Tang, Yurong, 2017. "Rate matrix estimation from site frequency data," Theoretical Population Biology, Elsevier, vol. 113(C), pages 23-33.
    2. Burden, Conrad J. & Tang, Yurong, 2016. "An approximate stationary solution for multi-allele neutral diffusion with low mutation rates," Theoretical Population Biology, Elsevier, vol. 112(C), pages 22-32.
    3. Vogl, Claus, 2014. "Estimating the scaled mutation rate and mutation bias with site frequency data," Theoretical Population Biology, Elsevier, vol. 98(C), pages 19-27.
    4. Vogl, Claus & Bergman, Juraj, 2015. "Inference of directional selection and mutation parameters assuming equilibrium," Theoretical Population Biology, Elsevier, vol. 106(C), pages 71-82.
    5. Matthew Stephens & Peter Donnelly, 2000. "Inference in molecular population genetics," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 62(4), pages 605-635.
    6. Vogl, Claus & Clemente, Florian, 2012. "The allele-frequency spectrum in a decoupled Moran model with mutation, drift, and directional selection, assuming small mutation rates," Theoretical Population Biology, Elsevier, vol. 81(3), pages 197-209.
    7. Schrempf, Dominik & Hobolth, Asger, 2017. "An alternative derivation of the stationary distribution of the multivariate neutral Wright–Fisher model for low mutation rates with a view to mutation rate estimation from site frequency data," Theoretical Population Biology, Elsevier, vol. 114(C), pages 88-94.
    8. RoyChoudhury, Arindam & Wakeley, John, 2010. "Sufficiency of the number of segregating sites in the limit under finite-sites mutation," Theoretical Population Biology, Elsevier, vol. 78(2), pages 118-122.
    9. Etheridge, A.M. & Griffiths, R.C., 2009. "A coalescent dual process in a Moran model with genic selection," Theoretical Population Biology, Elsevier, vol. 75(4), pages 320-330.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Burden, Conrad J. & Tang, Yurong, 2017. "Rate matrix estimation from site frequency data," Theoretical Population Biology, Elsevier, vol. 113(C), pages 23-33.
    2. Vogl, Claus & Bergman, Juraj, 2015. "Inference of directional selection and mutation parameters assuming equilibrium," Theoretical Population Biology, Elsevier, vol. 106(C), pages 71-82.
    3. Vogl, Claus & Mikula, Lynette Caitlin, 2021. "A nearly-neutral biallelic Moran model with biased mutation and linear and quadratic selection," Theoretical Population Biology, Elsevier, vol. 139(C), pages 1-17.
    4. Schrempf, Dominik & Hobolth, Asger, 2017. "An alternative derivation of the stationary distribution of the multivariate neutral Wright–Fisher model for low mutation rates with a view to mutation rate estimation from site frequency data," Theoretical Population Biology, Elsevier, vol. 114(C), pages 88-94.
    5. Burden, Conrad J. & Griffiths, Robert C., 2018. "Stationary distribution of a 2-island 2-allele Wright–Fisher diffusion model with slow mutation and migration rates," Theoretical Population Biology, Elsevier, vol. 124(C), pages 70-80.
    6. Vogl, Claus & Clemente, Florian, 2012. "The allele-frequency spectrum in a decoupled Moran model with mutation, drift, and directional selection, assuming small mutation rates," Theoretical Population Biology, Elsevier, vol. 81(3), pages 197-209.
    7. Malaguti, Giulia & Singh, Param Priya & Isambert, Hervé, 2014. "On the retention of gene duplicates prone to dominant deleterious mutations," Theoretical Population Biology, Elsevier, vol. 93(C), pages 38-51.
    8. Vogl, Claus, 2014. "Estimating the scaled mutation rate and mutation bias with site frequency data," Theoretical Population Biology, Elsevier, vol. 98(C), pages 19-27.
    9. Jüri Lember & Chris Watkins, 2022. "An Evolutionary Model that Satisfies Detailed Balance," Methodology and Computing in Applied Probability, Springer, vol. 24(1), pages 1-37, March.
    10. Burden, Conrad J. & Tang, Yurong, 2016. "An approximate stationary solution for multi-allele neutral diffusion with low mutation rates," Theoretical Population Biology, Elsevier, vol. 112(C), pages 22-32.
    11. Griffiths, Robert C. & Tavaré, Simon, 2018. "Ancestral inference from haplotypes and mutations," Theoretical Population Biology, Elsevier, vol. 122(C), pages 12-21.
    12. Steinrücken, Matthias & Paul, Joshua S. & Song, Yun S., 2013. "A sequentially Markov conditional sampling distribution for structured populations with migration and recombination," Theoretical Population Biology, Elsevier, vol. 87(C), pages 51-61.
    13. Desai, Michael M. & Nicolaisen, Lauren E. & Walczak, Aleksandra M. & Plotkin, Joshua B., 2012. "The structure of allelic diversity in the presence of purifying selection," Theoretical Population Biology, Elsevier, vol. 81(2), pages 144-157.
    14. Birkner, Matthias & Blath, Jochen & Steinrücken, Matthias, 2011. "Importance sampling for Lambda-coalescents in the infinitely many sites model," Theoretical Population Biology, Elsevier, vol. 79(4), pages 155-173.
    15. Larribe Fabrice & Lessard Sabin, 2008. "A Composite-Conditional-Likelihood Approach for Gene Mapping Based on Linkage Disequilibrium in Windows of Marker Loci," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 7(1), pages 1-33, August.
    16. Frank Hollander & Shubhamoy Nandan, 2022. "Spatially Inhomogeneous Populations with Seed-Banks: I. Duality, Existence and Clustering," Journal of Theoretical Probability, Springer, vol. 35(3), pages 1795-1841, September.
    17. Wenkai Huang & Feng Zhan, 2023. "A Novel Probabilistic Diffusion Model Based on the Weak Selection Mimicry Theory for the Generation of Hypnotic Songs," Mathematics, MDPI, vol. 11(15), pages 1-26, July.
    18. Blath, Jochen & Buzzoni, Eugenio & Koskela, Jere & Wilke Berenguer, Maite, 2020. "Statistical tools for seed bank detection," Theoretical Population Biology, Elsevier, vol. 132(C), pages 1-15.
    19. Corujo, Josué, 2021. "Dynamics of a Fleming–Viot type particle system on the cycle graph," Stochastic Processes and their Applications, Elsevier, vol. 136(C), pages 57-91.
    20. Der, Ricky & Epstein, Charles L. & Plotkin, Joshua B., 2011. "Generalized population models and the nature of genetic drift," Theoretical Population Biology, Elsevier, vol. 80(2), pages 80-99.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:thpobi:v:134:y:2020:i:c:p:106-118. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/intelligence .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.