An imputation method for categorical variables with application to nonlinear principal component analysis
The problem of missing data in building multidimensional composite indicators is a delicate problem which is often underrated. An imputation method particularly suitable for categorical data is proposed. This method is discussed in detail in the framework of nonlinear principal component analysis and compared to other missing data treatments which are commonly used in this analysis. Its performance vs. these other methods is evaluated throughout a simulation procedure performed on both an artificial case, varying the experimental conditions, and a real case. The proposed procedure is implemented using R1.
If you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
As the access to this document is restricted, you may want to look for a different version under "Related research" (further below) or search for a different version of it.
References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- James R. Carpenter & Michael G. Kenward & Stijn Vansteelandt, 2006. "A comparison of multiple imputation and doubly robust estimation for analyses with missing data," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 169(3), pages 571-584.
- Pier Ferrari & Paola Annoni & Giancarlo Manzi, 2010.
"Evaluation and comparison of European countries: public opinion on services,"
Quality & Quantity: International Journal of Methodology,
Springer, vol. 44(6), pages 1191-1205, October.
- Pier Alda Ferrari & Paola Annoni & Giancarlo Manzi, 2007. "Evaluation and comparison of European countries: public opinion on services," UNIMI - Research Papers in Economics, Business, and Statistics unimi-1058, Universitá degli Studi di Milano.
- Siddique, Juned & Belin, Thomas R., 2008. "Using an Approximate Bayesian Bootstrap to multiply impute nonignorable missing data," Computational Statistics & Data Analysis, Elsevier, vol. 53(2), pages 405-415, December.
- Serneels, Sven & Verdonck, Tim, 2009. "Principal component regression for data containing outliers and missing elements," Computational Statistics & Data Analysis, Elsevier, vol. 53(11), pages 3855-3863, September.
- White, Ian R. & Daniel, Rhian & Royston, Patrick, 2010. "Avoiding bias due to perfect prediction in multiple imputation of incomplete categorical variables," Computational Statistics & Data Analysis, Elsevier, vol. 54(10), pages 2267-2275, October.
- Christopher Paul & William Mason & Daniel McCaffrey & Sarah Fox, 2008. "A cautionary case study of approaches to the treatment of missing data," Statistical Methods and Applications, Springer, vol. 17(3), pages 351-372, July.
When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:55:y:2011:i:7:p:2410-2420. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Zhang, Lei)
If references are entirely missing, you can add them using this form.