IDEAS home Printed from https://ideas.repec.org/p/bep/ucbbio/1147.html
   My bibliography  Save this paper

Multiple Testing Methods For ChIP-Chip High Density Oligonucleotide Array Data

Author

Listed:
  • Sunduz Keles

    (Dept. of Statistics & Biostatistics & Medical Informatics, University of Wisconsin, Madison)

  • Mark van der Laan

    (Division of Biostatistics, School of Public Health, University of California, Berkeley)

  • Sandrine Dudoit

    (Division of Biostatistics, School of Public Health, University of California, Berkeley)

  • Simon Cawley

    (Affymetrix, 3380 Central Expressway, Santa Clara, CA 95051)

Abstract

Cawley et al. (2004) have recently mapped the locations of binding sites for three transcription factors along human chromosomes 21 and 22 using ChIP-Chip experiments. ChIP-Chip experiments are a new approach to the genome-wide identification of transcription factor binding sites and consist of chromatin (Ch) immunoprecipitation (IP) of transcription factor-bound genomic DNA followed by high density oligonucleotide hybridization (Chip) of the IP-enriched DNA. We investigate the ChIP-Chip data structure and propose methods for inferring the location of transcription factor binding sites from these data. The proposed methods involve testing for each probe whether it is part of a bound sequence or not using a scan statistic that takes into account the spatial structure of the data. Different multiple testing procedures are considered for controlling the family-wise error rate and false discovery rate. A nested-Bonferroni adjustment, that is more powerful than the traditional Bonferroni adjustment when the test statistics are dependent, is discussed. Simulation studies show that taking into account the spatial structure of the data substantially improves the sensitivity of the multiple testing procedures. Application of the proposed methods to ChIP-Chip data for transcription factor p53 identified many potential target binding regions along human chromosomes 21 and 22. Among these identified regions, 18% fall within a 3kb vicinity of the 5'UTR of a known gene or CpG island, 31% fall between the codon start site and the codon end site of a known gene but not inside an exon. More than half of these potential target sequences contain the p53 consensus binding site or very close matches to it. Moreover, these target segments include the 13 experimentally verified p53 binding regions of Cawley et al. (2004), as well as 49 additional regions that show higher hybridization signal than these 13 experimentally verified regions.

Suggested Citation

  • Sunduz Keles & Mark van der Laan & Sandrine Dudoit & Simon Cawley, 2004. "Multiple Testing Methods For ChIP-Chip High Density Oligonucleotide Array Data," U.C. Berkeley Division of Biostatistics Working Paper Series 1147, Berkeley Electronic Press.
  • Handle: RePEc:bep:ucbbio:1147
    Note: oai:bepress.com:ucbbiostat-1147
    as

    Download full text from publisher

    File URL: http://www.bepress.com/cgi/viewcontent.cgi?article=1147&context=ucbbiostat
    Download Restriction: no
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Olbricht Gayla R. & Craig Bruce A. & Doerge Rebecca W., 2012. "Incorporating Genomic Annotation into a Hidden Markov Model for DNA Methylation Tiling Array Data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(5), pages 1-37, November.
    2. Hongkai Ji & Wing Hung Wong, 2006. "Computational Biology: Toward Deciphering Gene Regulatory Information in Mammalian Genomes," Biometrics, The International Biometric Society, vol. 62(3), pages 645-663, September.
    3. Anat Reiner-Benaim, 2016. "Scan Statistic Tail Probability Assessment Based on Process Covariance and Window Size," Methodology and Computing in Applied Probability, Springer, vol. 18(3), pages 717-745, September.
    4. Jonathan A. L. Gelfond & Mayetri Gupta & Joseph G. Ibrahim, 2009. "A Bayesian Hidden Markov Model for Motif Discovery Through Joint Modeling of Genomic Sequence and ChIP-Chip Data," Biometrics, The International Biometric Society, vol. 65(4), pages 1087-1095, December.
    5. Bérard Caroline & Martin-Magniette Marie-Laure & Brunaud Véronique & Aubourg Sébastien & Robin Stéphane, 2011. "Unsupervised Classification for Tiling Arrays: ChIP-chip and Transcriptome," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 10(1), pages 1-22, November.
    6. Raphael Gottardo & Wei Li & W. Evan Johnson & X. Shirley Liu, 2008. "A Flexible and Powerful Bayesian Hierarchical Model for ChIP–Chip Experiments," Biometrics, The International Biometric Society, vol. 64(2), pages 468-478, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bep:ucbbio:1147. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum (email available below). General contact details of provider: http://www.bepress.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.