Author
Listed:
- Kuanysh Kadirkulov
- Yekaterina Golenko
- Aisulu Ismailova
- Iskander Baizhanov
Abstract
The purpose of this study is to develop a robust methodology for the automated detection and quantitative analysis of tandem repeats in genomic sequences, taking into account mismatches and distances, to enhance primer design and improve the accuracy of genomic research. The approach combines an efficient algorithm for identifying complementary DNA fragments, focusing on the 3' end of primers, and integrates two independent similarity metrics: the Hardy–Weinberg χ² test and cosine similarity. The methodology involves generating similarity matrices, heat maps, 3D surface visualizations, and scatter plots for comprehensive evaluation of sequences. Experimental validation of the complete genome of Lactobacillus brevis ATCC 367 identified 586 tandem repeats, demonstrating high consistency between the two metrics and revealing high similarity among most repeats, while highlighting specific cases with discrepancies that require further investigation. The developed methodology effectively combines statistical and vector analyses, enhancing the reliability of genomic studies and enabling the identification of biologically significant variations. The proposed tool can be widely applied in molecular biology, especially for primer design, genome annotation, and biomarker discovery. It is scalable and adaptable to large genomic datasets, making it suitable for high-throughput bioinformatics analyses.
Suggested Citation
Kuanysh Kadirkulov & Yekaterina Golenko & Aisulu Ismailova & Iskander Baizhanov, 2025.
"Methodology for detection and comparison of tandem repeats in genomic sequences using modern statistical and vector metrics,"
International Journal of Innovative Research and Scientific Studies, Innovative Research Publishing, vol. 8(5), pages 2151-2161.
Handle:
RePEc:aac:ijirss:v:8:y:2025:i:5:p:2151-2161:id:9437
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:aac:ijirss:v:8:y:2025:i:5:p:2151-2161:id:9437. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Natalie Jean (email available below). General contact details of provider: https://ijirss.com/index.php/ijirss/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.