IDEAS home Printed from https://ideas.repec.org/a/spr/stabio/v16y2024i1d10.1007_s12561-023-09388-4.html
   My bibliography  Save this article

Tweedie Distributions for Biological Sequences Alignments

Author

Listed:
  • Ben Hassen Hanen

    (University of Sfax)

  • Masmoudi Khalil

    (University of Sfax)

  • Masmoudi Afif

    (University of Sfax)

Abstract

An important technique in the study of the similarity between biological sequences is the analysis of their alignments score distribution. The estimation of such distribution plays a central role in the evaluation of the statistical significance of these alignments. In the amino acid sequences alignment, the scores of the ungapped aligned segments are proven to be asymptotically distributed according to the extreme value law. Their gapped alignments scores are generally fitted with poisson or Gumbel distributions. In order to widen the scope of the candidate distributions, other classes of statistical models can be used. In this paper, we proposed to use the class of exponential dispersion models which includes several common laws such as Gaussian, Poisson and Gamma distributions on top of many others. In this context, a new algorithm for this model parameters estimation was introduced. This proposed approach is based on the selection of the appropriate distribution and maximum likelihood estimation. An asymptotic confidence interval was provided to estimate the dispersion parameter. Ultimately, the suggested algorithm performance was evaluated through different numerical experiments based on random sequences using different generation techniques.

Suggested Citation

  • Ben Hassen Hanen & Masmoudi Khalil & Masmoudi Afif, 2024. "Tweedie Distributions for Biological Sequences Alignments," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 16(1), pages 165-184, April.
  • Handle: RePEc:spr:stabio:v:16:y:2024:i:1:d:10.1007_s12561-023-09388-4
    DOI: 10.1007/s12561-023-09388-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s12561-023-09388-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s12561-023-09388-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stabio:v:16:y:2024:i:1:d:10.1007_s12561-023-09388-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.