Novel graphical representation of genome sequence and its applications in similarity analysis
AbstractIn order to compare different genome sequences, an alignment-free method has been proposed. Considering the essential property of sequence is sequentiality, we define a compound transformation which transforms a genome sequence into a sparse 16 by L−1 matrix M based on 16 kinds of 2-mer (dinucleotides). Furthermore, we found the transformation above-mentioned is an order-preserving transformation (OPT). Based on the theory of matrix analysis, we derive a 16-dimensional vector to characterize a genome sequence via singular value decomposition (SVD) of M. Finally, we analyze the similarities among multiple sequences from 20 eutherian species. The experiment results show that our approach performs well in the field of sequence analysis.
Download InfoIf you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
As the access to this document is restricted, you may want to look for a different version under "Related research" (further below) or search for a different version of it.
Bibliographic InfoArticle provided by Elsevier in its journal Physica A: Statistical Mechanics and its Applications.
Volume (Year): 391 (2012)
Issue (Month): 23 ()
Contact details of provider:
Web page: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/
Genome sequence; Order-preserving; Singular value decomposition (SVD); Similarity analysis;
You can help add them by filling out this form.
CitEc Project, subscribe to its RSS feed for this item.
- Ma, Tingting & Liu, Yuxin & Dai, Qi & Yao, Yuhua & He, Ping-an, 2014. "A graphical representation of protein based on a novel iterated function system," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 403(C), pages 21-28.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Zhang, Lei).
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If references are entirely missing, you can add them using this form.
If the full references list an item that is present in RePEc, but the system did not link to it, you can help with this form.
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your profile, as there may be some citations waiting for confirmation.
Please note that corrections may take a couple of weeks to filter through the various RePEc services.