IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v13y2025i13p2103-d1688477.html
   My bibliography  Save this article

Approximate String Matching with Non-Overlapping Adjacent Unbalanced Translocations

Author

Listed:
  • Domenico Cantone

    (Department of Mathematics and Computer Science, University of Catania, Viale A. Doria 6, I-95125 Catania, Italy)

  • Simone Faro

    (Department of Mathematics and Computer Science, University of Catania, Viale A. Doria 6, I-95125 Catania, Italy)

  • Arianna Pavone

    (Department of Mathematics and Computer Science, University of Palermo, Via Archirafi 34, I-90123 Palermo, Italy)

Abstract

In this paper, we investigate the approximate string matching problem when the allowed edit operations are non-overlapping unbalanced translocations of adjacent factors . This kind of edit operation takes place when two adjacent substrings of the text swap, resulting in a modified string. The two involved substrings are allowed to be of different lengths. Such large-scale modifications of strings have various applications, notably in fields such as computational biology and genomics, where structural rearrangements play a key role. However, despite their central role in many fields of text processing, little attention has been devoted to the problem of matching strings allowing for this kind of edit operation. In this paper, we present three algorithms for solving the problem, all of them with an O ( n m 3 ) worst-case and an O ( m 2 ) -space complexity, where m and n are the length of the pattern and of the text, respectively. Specifically, our first algorithm is based on the dynamic programming approach. Our second solution improves the previous one by making use of the Directed Acyclic Word Graph of the pattern. Finally, our third algorithm is based on an alignment procedure. We also show that under the assumptions of equiprobability and independence of characters, our second algorithm has an O ( n log σ 2 m ) average time complexity for an alphabet of size σ ≥ 4 .

Suggested Citation

  • Domenico Cantone & Simone Faro & Arianna Pavone, 2025. "Approximate String Matching with Non-Overlapping Adjacent Unbalanced Translocations," Mathematics, MDPI, vol. 13(13), pages 1-28, June.
  • Handle: RePEc:gam:jmathe:v:13:y:2025:i:13:p:2103-:d:1688477
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/13/13/2103/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/13/13/2103/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:13:y:2025:i:13:p:2103-:d:1688477. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.