In a non-parametric regression, the heteroscedasticity (dependence of the variance of the regression error on the predictor) can be a serious complication in estimation or visualization of an underlying regression function. If a controlled sampling is permitted, then the statistician can choose the design of predictors which attenuates the effect of heteroscedasticity. It is proposed to use a design which minimizes the mean integrated squared error of the regression function estimation. Then the corresponding optimal design density is proportional to the standard deviation of the regression error (the so-called scale function). Because in general the statistician does not know an underlying scale function, the natural question is as follows: is it possible to suggest a sequential design which performs as well as an oracle that knows the underlying scale function? The answer is 'yes', and a corresponding sequential procedure is developed. It is proved, for the first time in the literature, that a data-driven sequential design, together with an adaptive regression estimator, can mimic the oracle and be sharp minimax. Further, it is shown that the suggested method is feasible for small samples. Copyright (c) Board of the Foundation of the Scandinavian Journal of Statistics 2008.
Download Info
To download:
If you experience problems downloading a file, check if you have the
proper application to
view it first. Information about this may be contained
in the File-Format links below. In case of further problems read
the IDEAS help
page. Note that these files are not on the IDEAS
site. Please be patient as the files may be large.
As the access to this document is restricted, you may want to look for a different version under "Related research" (further below) or search for a different version of it.
Publisher Info
Article provided by Danish Society for Theoretical Statistics, Finnish Statistical Society, Norwegian Statistical Association and Swedish Statistical Association in its journal Scandinavian Journal of Statistics.
Did you know? All full texts are decentralized with the publishers, none reside on this server, thus making it possible to offer this service for free to all parties.