IDEAS home Printed from https://ideas.repec.org/a/bla/jorssc/v67y2018i1p3-23.html
   My bibliography  Save this article

A Bayesian model selection approach for identifying differentially expressed transcripts from RNA sequencing data

Author

Listed:
  • Panagiotis Papastamoulis
  • Magnus Rattray

Abstract

Recent advances in molecular biology allow the quantification of the transcriptome and scoring transcripts as differentially or equally expressed between two biological conditions. Although these two tasks are closely linked, the available inference methods treat them separately: a primary model is used to estimate expression and its output is post processed by using a differential expression model. In the paper, both issues are simultaneously addressed by proposing the joint estimation of expression levels and differential expression: the unknown relative abundance of each transcript can either be equal or not between two conditions. A hierarchical Bayesian model builds on the BitSeq framework and the posterior distribution of transcript expression and differential expression is inferred by using Markov chain Monte Carlo sampling. It is shown that the model proposed enjoys conjugacy for fixed dimension variables; thus the full conditional distributions are analytically derived. Two samplers are constructed, a reversible jump Markov chain Monte Carlo sampler and a collapsed Gibbs sampler, and the latter is found to perform better. A cluster representation of the aligned reads to the transcriptome is introduced, allowing parallel estimation of the marginal posterior distribution of subsets of transcripts under reasonable computing time. Under a fixed prior probability of differential expression the clusterwise sampler has the same marginal posterior distributions as the raw sampler, but a more general prior structure is also employed. The algorithm proposed is benchmarked against alternative methods by using synthetic data sets and applied to real RNA sequencing data. Source code is available on line from https://github.com/mqbssppe/cjBitSeq.

Suggested Citation

  • Panagiotis Papastamoulis & Magnus Rattray, 2018. "A Bayesian model selection approach for identifying differentially expressed transcripts from RNA sequencing data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 67(1), pages 3-23, January.
  • Handle: RePEc:bla:jorssc:v:67:y:2018:i:1:p:3-23
    DOI: 10.1111/rssc.12213
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/rssc.12213
    Download Restriction: no

    File URL: https://libkey.io/10.1111/rssc.12213?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Vinícius Diniz Mayrink & Flávio B. Gonçalves, 2020. "Identifying atypically expressed chromosome regions using RNA-Seq data," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 29(3), pages 619-649, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssc:v:67:y:2018:i:1:p:3-23. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.