IDEAS home Printed from https://ideas.repec.org/a/eee/thpobi/v98y2014icp19-27.html
   My bibliography  Save this article

Estimating the scaled mutation rate and mutation bias with site frequency data

Author

Listed:
  • Vogl, Claus

Abstract

The distribution of allele frequencies of a large number of biallelic sites is known as “allele-frequency spectrum†or “site-frequency spectrum†(SFS). Without selection and in regions of relatively high recombination rates, sites may be assumed to be independently and identically distributed. With a beta equilibrium distribution of allelic proportions and binomial sampling, a beta–binomial compound likelihood for each site results. The likelihood of the data and the posterior distribution of two parameters, scaled mutation rate θ and mutation bias α, is investigated in the general case and for small scaled mutation rates θ. In the general case, an expectation–maximization (EM) algorithm is derived to obtain maximum likelihood estimates of both parameters. With an appropriate prior distribution, a Markov chain Monte Carlo sampler to integrate the posterior distribution is also derived. As far as I am aware, previous maximum likelihood or Bayesian estimators of θ, explicitly or implicitly assume small scaled mutation rates, i.e., θ≪1. For θ≪1, maximum-likelihood estimators are also derived for both parameters using a Taylor series expansion of the beta–binomial distribution. The estimator of θ is a variant of the Ewens–Watterson estimator and of the maximum likelihood estimator derived with the Poisson Random Field approach. With a conjugate prior distribution, marginal and conditional beta posterior distributions are also derived for both parameters.

Suggested Citation

  • Vogl, Claus, 2014. "Estimating the scaled mutation rate and mutation bias with site frequency data," Theoretical Population Biology, Elsevier, vol. 98(C), pages 19-27.
  • Handle: RePEc:eee:thpobi:v:98:y:2014:i:c:p:19-27
    DOI: 10.1016/j.tpb.2014.10.002
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0040580914000793
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.tpb.2014.10.002?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Vogl, Claus & Clemente, Florian, 2012. "The allele-frequency spectrum in a decoupled Moran model with mutation, drift, and directional selection, assuming small mutation rates," Theoretical Population Biology, Elsevier, vol. 81(3), pages 197-209.
    2. RoyChoudhury, Arindam & Wakeley, John, 2010. "Sufficiency of the number of segregating sites in the limit under finite-sites mutation," Theoretical Population Biology, Elsevier, vol. 78(2), pages 118-122.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Burden, Conrad J. & Tang, Yurong, 2016. "An approximate stationary solution for multi-allele neutral diffusion with low mutation rates," Theoretical Population Biology, Elsevier, vol. 112(C), pages 22-32.
    2. Vogl, Claus & Bergman, Juraj, 2015. "Inference of directional selection and mutation parameters assuming equilibrium," Theoretical Population Biology, Elsevier, vol. 106(C), pages 71-82.
    3. Vogl, Claus & Mikula, Lynette Caitlin, 2021. "A nearly-neutral biallelic Moran model with biased mutation and linear and quadratic selection," Theoretical Population Biology, Elsevier, vol. 139(C), pages 1-17.
    4. Burden, Conrad J. & Tang, Yurong, 2017. "Rate matrix estimation from site frequency data," Theoretical Population Biology, Elsevier, vol. 113(C), pages 23-33.
    5. Vogl, Claus & Mikula, Lynette C. & Burden, Conrad J., 2020. "Maximum likelihood estimators for scaled mutation rates in an equilibrium mutation–drift model," Theoretical Population Biology, Elsevier, vol. 134(C), pages 106-118.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Vogl, Claus & Bergman, Juraj, 2015. "Inference of directional selection and mutation parameters assuming equilibrium," Theoretical Population Biology, Elsevier, vol. 106(C), pages 71-82.
    2. Vogl, Claus & Mikula, Lynette C. & Burden, Conrad J., 2020. "Maximum likelihood estimators for scaled mutation rates in an equilibrium mutation–drift model," Theoretical Population Biology, Elsevier, vol. 134(C), pages 106-118.
    3. Vogl, Claus & Clemente, Florian, 2012. "The allele-frequency spectrum in a decoupled Moran model with mutation, drift, and directional selection, assuming small mutation rates," Theoretical Population Biology, Elsevier, vol. 81(3), pages 197-209.
    4. Burden, Conrad J. & Tang, Yurong, 2017. "Rate matrix estimation from site frequency data," Theoretical Population Biology, Elsevier, vol. 113(C), pages 23-33.
    5. Malaguti, Giulia & Singh, Param Priya & Isambert, Hervé, 2014. "On the retention of gene duplicates prone to dominant deleterious mutations," Theoretical Population Biology, Elsevier, vol. 93(C), pages 38-51.
    6. Ferretti, Luca & Ramos-Onsins, Sebástian E., 2015. "A generalized Watterson estimator for next-generation sequencing: From trios to autopolyploids," Theoretical Population Biology, Elsevier, vol. 100(C), pages 79-87.
    7. Jüri Lember & Chris Watkins, 2022. "An Evolutionary Model that Satisfies Detailed Balance," Methodology and Computing in Applied Probability, Springer, vol. 24(1), pages 1-37, March.
    8. Vogl, Claus & Mikula, Lynette Caitlin, 2021. "A nearly-neutral biallelic Moran model with biased mutation and linear and quadratic selection," Theoretical Population Biology, Elsevier, vol. 139(C), pages 1-17.
    9. Schrempf, Dominik & Hobolth, Asger, 2017. "An alternative derivation of the stationary distribution of the multivariate neutral Wright–Fisher model for low mutation rates with a view to mutation rate estimation from site frequency data," Theoretical Population Biology, Elsevier, vol. 114(C), pages 88-94.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:thpobi:v:98:y:2014:i:c:p:19-27. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/intelligence .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.