To how many simultaneous hypothesis tests can normal student's t or bootstrap calibrations be applied
In the analysis of microarray data, and in some other contemporary statistical problems, it is not uncommon to apply hypothesis tests in a highly simultaneous way. The number, N say, of tests used can be much larger than the sample sizes, n, to which the tests are applied, yet we wish to calibrate the tests so that the overall level of the simultaneous test is accurate. Often the sampling distribution is quite different for each test, so there may not be an opportunity to combine data across samples. In this setting, how large can N be, as a function of n, before level accuracy becomes poor? Here we answer this question in cases where the statistic under test is of Student's t type. We show that if either the normal or Student t distribution is used for calibration, then the level of the simultaneous test is accurate provided that log N increases at a strictly slower rate than n1/3 as n diverges. On the other hand, if bootstrap methods are used for calibration, then we may choose log N almost as large as n1/2 and still achieve asymptotic-level accuracy. The implications of these results are explored both theoretically and numerically.
|Date of creation:||Dec 2007|
|Publication status:||Published in Journal of the American Statistical Association, December, 2007, 102(480), pp. 1282-1288. ISSN: 0162-1459|
|Contact details of provider:|| Postal: LSE Library Portugal Street London, WC2A 2HD, U.K.|
Phone: +44 (020) 7405 7686
Web page: http://www.lse.ac.uk/
More information through EDIRC
References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Efron, Bradley, 2004. "Large-Scale Simultaneous Hypothesis Testing: The Choice of a Null Hypothesis," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 96-104, January.
- Fan, Jianqing & Peng, Heng & Huang, Tao, 2005. "Semilinear High-Dimensional Model for Normalization of Microarray Data: A Theoretical Analysis and Partial Consistency," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 781-796, September.
When requesting a correction, please mention this item's handle: RePEc:ehl:lserod:5399. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (LSERO Manager)
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If references are entirely missing, you can add them using this form.
If the full references list an item that is present in RePEc, but the system did not link to it, you can help with this form.
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your profile, as there may be some citations waiting for confirmation.
Please note that corrections may take a couple of weeks to filter through the various RePEc services.