IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1002806.html
   My bibliography  Save this article

SnIPRE: Selection Inference Using a Poisson Random Effects Model

Author

Listed:
  • Kirsten E Eilertson
  • James G Booth
  • Carlos D Bustamante

Abstract

We present an approach for identifying genes under natural selection using polymorphism and divergence data from synonymous and non-synonymous sites within genes. A generalized linear mixed model is used to model the genome-wide variability among categories of mutations and estimate its functional consequence. We demonstrate how the model's estimated fixed and random effects can be used to identify genes under selection. The parameter estimates from our generalized linear model can be transformed to yield population genetic parameter estimates for quantities including the average selection coefficient for new mutations at a locus, the synonymous and non-synynomous mutation rates, and species divergence times. Furthermore, our approach incorporates stochastic variation due to the evolutionary process and can be fit using standard statistical software. The model is fit in both the empirical Bayes and Bayesian settings using the lme4 package in R, and Markov chain Monte Carlo methods in WinBUGS. Using simulated data we compare our method to existing approaches for detecting genes under selection: the McDonald-Kreitman test, and two versions of the Poisson random field based method MKprf. Overall, we find our method universally outperforms existing methods for detecting genes subject to selection using polymorphism and divergence data. Author Summary: We present a new methodology, SnIPRE, for identifying genes under natural selection. SnIPRE is a “McDonald-Kreitman” type of analysis, in that it is based on MK table data and has an advantage over other types of statistics because it is robust to demography. Similar to the MKprf method, SnIPRE makes use of genome-wide information to increase power, but is non-parametric in the sense that it makes no assumptions (and does not require estimation) of parameters such as mutation rate and species divergence time in order to identify genes under selection. In simulations SnIPRE outperforms both the MK statistic and the two versions of MKprf considered. We then apply our method to Drosophila and human-chimp data.

Suggested Citation

  • Kirsten E Eilertson & James G Booth & Carlos D Bustamante, 2012. "SnIPRE: Selection Inference Using a Poisson Random Effects Model," PLOS Computational Biology, Public Library of Science, vol. 8(12), pages 1-14, December.
  • Handle: RePEc:plo:pcbi00:1002806
    DOI: 10.1371/journal.pcbi.1002806
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1002806
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1002806&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1002806?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Hadfield, Jarrod D., 2010. "MCMC Methods for Multi-Response Generalized Linear Mixed Models: The MCMCglmm R Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i02).
    2. Katherine S. Pollard & Sofie R. Salama & Nelle Lambert & Marie-Alexandra Lambot & Sandra Coppens & Jakob S. Pedersen & Sol Katzman & Bryan King & Courtney Onodera & Adam Siepel & Andrew D. Kern & Cole, 2006. "An RNA gene expressed during cortical development evolved rapidly in humans," Nature, Nature, vol. 443(7108), pages 167-172, September.
    3. Carlos D. Bustamante & Rasmus Nielsen & Stanley A. Sawyer & Kenneth M. Olsen & Michael D. Purugganan & Daniel L. Hartl, 2002. "The cost of inbreeding in Arabidopsis," Nature, Nature, vol. 416(6880), pages 531-534, April.
    4. Nick G. C. Smith & Adam Eyre-Walker, 2002. "Adaptive protein evolution in Drosophila," Nature, Nature, vol. 415(6875), pages 1022-1024, February.
    5. Gill Bejerano & Craig B. Lowe & Nadav Ahituv & Bryan King & Adam Siepel & Sofie R. Salama & Edward M. Rubin & W. James Kent & David Haussler, 2006. "A distal enhancer and an ultraconserved exon are derived from a novel retroposon," Nature, Nature, vol. 441(7089), pages 87-90, May.
    6. Peter Andolfatto, 2005. "Adaptive evolution of non-coding DNA in Drosophila," Nature, Nature, vol. 437(7062), pages 1149-1152, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Xiao Feng & Qipian Chen & Weihong Wu & Jiexin Wang & Guohong Li & Shaohua Xu & Shao Shao & Min Liu & Cairong Zhong & Chung-I Wu & Suhua Shi & Ziwen He, 2024. "Genomic evidence for rediploidization and adaptive evolution following the whole-genome triplication," Nature Communications, Nature, vol. 15(1), pages 1-15, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Noah Dukler & Mehreen R. Mughal & Ritika Ramani & Yi-Fei Huang & Adam Siepel, 2022. "Extreme purifying selection against point mutations in the human genome," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    2. Benger, Etam & Sella, Guy, 2013. "Modeling the effect of changing selective pressures on polymorphism and divergence," Theoretical Population Biology, Elsevier, vol. 85(C), pages 73-85.
    3. Sella, Guy, 2009. "An exact steady state solution of Fisher’s geometric model and other models," Theoretical Population Biology, Elsevier, vol. 75(1), pages 30-34.
    4. Xinru Zhang & Bohao Fang & Yi-Fei Huang, 2023. "Transcription factor binding sites are frequently under accelerated evolution in primates," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    5. Wolfgang Goymann & John C. Wingfield, 2014. "Male-to-female testosterone ratios, dimorphism, and life history—what does it really tell us?," Behavioral Ecology, International Society for Behavioral Ecology, vol. 25(4), pages 685-699.
    6. I. Albarrán & P. Alonso-González & J. M. Marin, 2017. "Some criticism to a general model in Solvency II: an explanation from a clustering point of view," Empirical Economics, Springer, vol. 52(4), pages 1289-1308, June.
    7. Andrés López-Sepulcre & Sebastiano De Bona & Janne K. Valkonen & Kate D.L. Umbers & Johanna Mappes, 2015. "Item Response Trees: a recommended method for analyzing categorical data in behavioral studies," Behavioral Ecology, International Society for Behavioral Ecology, vol. 26(5), pages 1268-1273.
    8. Jesse Shore & Ethan Bernstein & David Lazer, 2014. "Facts and Figuring: An Experimental Investigation of Network Structure and Performance in Information and Solution Spaces," Harvard Business School Working Papers 14-075, Harvard Business School, revised Jun 2014.
    9. Weliton Menário & Wendy J King & Timothée Bonnet & Marco Festa-Bianchet & Loeske E B Kruuk, 2023. "Early-life behavior, survival, and maternal personality in a wild marsupial," Behavioral Ecology, International Society for Behavioral Ecology, vol. 34(6), pages 1002-1012.
    10. Bakar, Khandoker Shuvo & Sahu, Sujit K., 2015. "spTimer: Spatio-Temporal Bayesian Modeling Using R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 63(i15).
    11. Parady, Giancarlos & Frei, Andreas & Kowald, Matthias & Guidon, Sergio & Wicki, Michael & van den Berg, Pauline & Carrasco, Juan-Antonio & Arentze, Theo & Timmermans, Harry & Wellman, Barry & Takami, , 2021. "A comparative study of social interaction frequencies among social network members in five countries," Journal of Transport Geography, Elsevier, vol. 90(C).
    12. Amoroso, S., 2013. "Heterogeneity of innovative, collaborative, and productive firm-level processes," Other publications TiSEM f5784a49-7053-401d-855d-1, Tilburg University, School of Economics and Management.
    13. RoyChoudhury, Arindam & Wakeley, John, 2010. "Sufficiency of the number of segregating sites in the limit under finite-sites mutation," Theoretical Population Biology, Elsevier, vol. 78(2), pages 118-122.
    14. Apergis, Nicholas & Aye, Goodness C. & Barros, Carlos Pestana & Gupta, Rangan & Wanke, Peter, 2015. "Energy efficiency of selected OECD countries: A slacks based model with undesirable outputs," Energy Economics, Elsevier, vol. 51(C), pages 45-53.
    15. Francis K. C. Hui & Samuel Müller & Alan H. Welsh, 2021. "Random Effects Misspecification Can Have Severe Consequences for Random Effects Inference in Linear Mixed Models," International Statistical Review, International Statistical Institute, vol. 89(1), pages 186-206, April.
    16. Mikhail V Matz & Rachel M Wright & James G Scott, 2013. "No Control Genes Required: Bayesian Analysis of qRT-PCR Data," PLOS ONE, Public Library of Science, vol. 8(8), pages 1-12, August.
    17. Amei Amei & Stanley Sawyer, 2012. "Statistical Inference of Selection and Divergence from a Time-Dependent Poisson Random Field Model," PLOS ONE, Public Library of Science, vol. 7(4), pages 1-7, April.
    18. Saeede Ajorlou & Issac Shams & Kai Yang, 2015. "An analytics approach to designing patient centered medical homes," Health Care Management Science, Springer, vol. 18(1), pages 3-18, March.
    19. Kandt, Jens & Leak, Alistair, 2019. "Examining inclusive mobility through smartcard data: What shall we make of senior citizens' declining bus patronage in the West Midlands?," Journal of Transport Geography, Elsevier, vol. 79(C), pages 1-1.
    20. Hart, Jordan D. A. & Franks, Daniel Wayne & Brent, Lauren & Weiss, Michael N., 2022. "bisonR - Bayesian Inference of Social Networks with R," OSF Preprints ywu7j, Center for Open Science.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1002806. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.