Privacy preservation in data mining using hybrid perturbation methods: an application to bankruptcy prediction in banks
Today, the data related to business, finance and healthcare pose problems for Privacy-Preserving Data Mining (PPDM). Privacy regulations and concerns prevent data owners from sharing data for mining purposes. To circumvent this problem, data owners must design strategies to meet privacy requirements and ensure valid data mining results. This paper proposes the hybridisation of the random projection and random rotation methods for privacy-preserving classification. The hybrid method is tested on six benchmark data sets and four bank bankruptcy data sets. These methods ensure the privacy and secrecy of bank data and the resulting data set is mined without a considerable loss of accuracy. A multilayer perceptron, decision tree J48 and logistic regression are used as classifiers. The results of a tenfold cross-validation and t-test indicate improved average accuracies for the hybrid privacy preservation method compared to when random projection is used alone. The reasons for the superior performance of the hybrid privacy preservation method are also highlighted.
Volume (Year): 1 (2009)
Issue (Month): 4 ()
|Contact details of provider:|| Web page: http://www.inderscience.com/browse/index.php?journalID=282|
When requesting a correction, please mention this item's handle: RePEc:ids:injdan:v:1:y:2009:i:4:p:313-331. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Graham Langley)
If references are entirely missing, you can add them using this form.