We develop mathematical models for high-dimensional binary distributions, and apply them to the study of smoothing methods for sparse binary data. Specifically, we treat the kernel-type estimator developed by Aitchison and Aitken (Biometrika63 (1976), 413-420). Our analysis is of an asymptotic nature. It permits a concise account of the way in which data dimension, data sparseness, and distribution smoothness interact to determine the over-all performance of smoothing methods. Previous work on this problem has been hampered by the requirement that the data dimension be fixed. Our approach allows dimension to increase with sample size, so that the theoretical model may accurately reflect the situations encountered in practice; e.g., approximately 20 dimensions and 40 data points. We compare the performance of kernel estimators with that of the cell frequency estimator, and describe the effectiveness of cross-validation.
Download Info
To download:
If you experience problems downloading a file, check if you have the
proper application to
view it first. Information about this may be contained
in the File-Format links below. In case of further problems read
the IDEAS help
page. Note that these files are not on the IDEAS
site. Please be patient as the files may be large.
As the access to this document is restricted, you may want to look for a different version under "Related research" (further below) or search for a different version of it.
Volume (Year): 44 (1993) Issue (Month): 2 (February) Pages: 321-344 Download reference. The following formats are available: HTML
(with abstract),
plain text
(with abstract),
BibTeX,
RIS (EndNote, RefMan, ProCite),
ReDIF
For technical questions regarding this item, or to correct its listing, contact: (Heidi Boesdal).
Related research
Keywords:
Cited by: (explanations, Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.)