This file is part of IDEAS, which uses RePEc data


[ Papers | Articles | Software | Books | Chapters | Authors | Institutions | JEL Classification | NEP reports | Search | New papers by email | Author registration | Rankings | Volunteers | FAQ | Blog | Help! ]

Generalized power method for sparse principal component analysis

Author info | Abstract | Publisher info | Download info | Related research | Statistics
Author Info
JournŽe, Michel (---)
Nesterov, Yurii (UniversitŽ catholique de Louvain (UCL). Center for Operations Research and Econometrics (CORE))
Richtarik, Peter (UniversitŽ catholique de Louvain (UCL). Center for Operations Research and Econometrics (CORE))
Sepulchre, Rodolphe
Abstract

In this paper we develop a new approach to sparse principal component analysis (sparse PCA). We propose two single-unit and two block optimization formulations of the sparse PCA problem, aimed at extracting a single sparse dominant principal component of a data matrix, or more components at once, respectively. While the initial formulations involve nonconvex functions, and are therefore computationally intractable, we rewrite them into the form of an optimization program involving maximization of a convex function on a compact set. The dimension of the search space is decreased enormously if the data matrix has many more columns (variables) than rows. We then propose and analyze a simple gradient method suited for the task. It appears that our algorithm has best convergence properties in the case when either the objective function or the feasible set are strongly convex, which is the case with our single-unit formulations and can be enforced in the block case. Finally, we demonstrate numerically on a set of random and gene expression test problems that our approach outperforms existing algorithms both in quality of the obtained solution and in computational speed.

Download Info
To download:

If you experience problems downloading a file, check if you have the proper application to view it first. Information about this may be contained in the File-Format links below. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.

File URL: http://www.uclouvain.be/cps/ucl/doc/core/documents/coredp2008_70.pdf
File Format: application/pdf
File Function:
Download Restriction: no

Publisher Info
Paper provided by Université catholique de Louvain, Center for Operations Research and Econometrics (CORE) in its series CORE Discussion Papers with number 2008070.

Download reference. The following formats are available: HTML (with abstract), plain text (with abstract), BibTeX, RIS (EndNote, RefMan, ProCite), ReDIF
Length:
Date of creation: 01 Nov 2008
Date of revision:
Handle: RePEc:cor:louvco:2008070

Contact details of provider:
Postal: Voie du Roman Pays 34, 1348 Louvain-la-Neuve (Belgium)
Phone: 32(10)474321
Fax: +32 10474301
Email:
Web page: http://www.uclouvain.be/core
More information through EDIRC

For technical questions regarding this item, or to correct its listing, contact: (Alain GILLIS).

Related research
Keywords: sparse PCA; power method; gradient ascent; strongly convex sets; block algorithms.;

This paper has been announced in the following NEP Reports:

Statistics
Access and download statistics

Did you know? All RePEc services are meant to be be free forever, as they are all run by volunteers.

This page was last updated on 2009-11-19.


This information is provided to you by IDEAS at the Department of Economics, College of Liberal Arts and Sciences, University of Connecticut using RePEc data on a server sponsored by the Society for Economic Dynamics.