Over-represented sequences located on UTRs are potentially involved in regulatory functions
Eukaryotic gene expression must be coordinated for the proper functioning of biological processes. This coordination can be achieved both at the transcriptional and post-transcriptional levels. In both cases, regulatory sequences placed at either promoter regions or on UTRs function as markers recognized by regulators that can then activate or repress different groups of genes according to necessity. While regulatory sequences involved in transcription are quite well documented, there is a lack of information on sequence elements involved in post-transcriptional regulation. We used a statistical over-representation method to identify novel regulatory elements located on UTRs. An exhaustive search approach was used to calculate the frequency of all possible n-mers (short nucleotide sequences) in 16,160 human genes of NCBI RefSeq sequences and to identify any peculiar usage of n-mers on UTRs. After a stringent filtering process, we identified circa 4,000 highly over-represented n-mers on UTRs. We provide evidence that these n-mers are potentially involved in regulatory functions. Identified n-mers overlap with previously identified binding sites for HuR and Tia1 and, AU-rich and GU-rich sequences. We determined also that over-represented n-mers are particularly enriched in a group of 159 genes directly involved in tumor formation. Finally, a method to cluster n-mer groups allowed the identification of putative gene networks.
|Date of creation:|
|Date of revision:|
|Contact details of provider:|| Postal: 6900 North Loop 1604 West, San Antonio, TX 78249-0631|
Web page: http://business.utsa.edu/wps
More information through EDIRC
References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Leiva, Ricardo, 2007. "Linear discrimination with equicorrelated training vectors," Journal of Multivariate Analysis, Elsevier, vol. 98(2), pages 384-409, February.
- Ritter, Gunter & Gallegos, María Teresa, 2002. "Bayesian Object Identification: Variants," Journal of Multivariate Analysis, Elsevier, vol. 81(2), pages 301-334, May.
When requesting a correction, please mention this item's handle: RePEc:tsa:wpaper:0053. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Eddie Salinas)
If references are entirely missing, you can add them using this form.