Over-represented sequences located on UTRs are potentially involved in regulatory functions
AbstractEukaryotic gene expression must be coordinated for the proper functioning of biological processes. This coordination can be achieved both at the transcriptional and post-transcriptional levels. In both cases, regulatory sequences placed at either promoter regions or on UTRs function as markers recognized by regulators that can then activate or repress different groups of genes according to necessity. While regulatory sequences involved in transcription are quite well documented, there is a lack of information on sequence elements involved in post-transcriptional regulation. We used a statistical over-representation method to identify novel regulatory elements located on UTRs. An exhaustive search approach was used to calculate the frequency of all possible n-mers (short nucleotide sequences) in 16,160 human genes of NCBI RefSeq sequences and to identify any peculiar usage of n-mers on UTRs. After a stringent filtering process, we identified circa 4,000 highly over-represented n-mers on UTRs. We provide evidence that these n-mers are potentially involved in regulatory functions. Identified n-mers overlap with previously identified binding sites for HuR and Tia1 and, AU-rich and GU-rich sequences. We determined also that over-represented n-mers are particularly enriched in a group of 159 genes directly involved in tumor formation. Finally, a method to cluster n-mer groups allowed the identification of putative gene networks.
Download InfoIf you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
Bibliographic InfoPaper provided by College of Business, University of Texas at San Antonio in its series Working Papers with number 0053.
Length: 39 pages
Date of creation:
Date of revision:
Over-represented sequences; UTRs; regulatory functions;
Find related papers by JEL classification:
- C10 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - General
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Leiva, Ricardo, 2007. "Linear discrimination with equicorrelated training vectors," Journal of Multivariate Analysis, Elsevier, vol. 98(2), pages 384-409, February.
- Ritter, Gunter & Gallegos, María Teresa, 2002. "Bayesian Object Identification: Variants," Journal of Multivariate Analysis, Elsevier, vol. 81(2), pages 301-334, May.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Eddie Salinas).
If references are entirely missing, you can add them using this form.