This file is part of IDEAS, which uses RePEc data


[ Papers | Articles | Software | Books | Chapters | Authors | Institutions | JEL Classification | NEP reports | Search | New papers by email | Author registration | Rankings | Volunteers | FAQ | Blog | Help! ]

Finding cancer subtypes in microarray data using random projections

Author info | Abstract | Publisher info | Download info | Related research | Statistics
Author Info
Debashis Ghosh (University of Michigan)
Abstract

One of the benefits of profiling of cancer samples using microarrays is the generation of molecular fingerprints that will define subtypes of disease. Such subgroups have typically been found in microarray data using hierarchical clustering. A major problem in interpretation of the output is determining the number of clusters. We approach the problem of determining disease subtypes using mixture models. A novel estimation procedure of the parameters in the mixture model is developed based on a combination of random projections and the expectation-maximization algorithm. Because the approach is probabilistic, our approach provides a measure for the number of true clusters in a given dataset. We illustrate our approach with applications to both simulated and real microarray data.

Download Info
To download:

If you experience problems downloading a file, check if you have the proper application to view it first. Information about this may be contained in the File-Format links below. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.

File URL: http://www.bepress.com/cgi/viewcontent.cgi?article=1045&context=umichbiostat
File Format: application/pdf
File Function:
Download Restriction: no

Publisher Info
Paper provided by Berkeley Electronic Press in its series The University of Michigan Department of Biostatistics Working Paper Series with number 1045.

Download reference. The following formats are available: HTML (with abstract), plain text (with abstract), BibTeX, RIS (EndNote, RefMan, ProCite), ReDIF
Length:
Date of creation: 20 Oct 2004
Date of revision:
Handle: RePEc:bep:mchbio:1045

Note: oai:bepress.com:umichbiostat-1045
Contact details of provider:
Web page: http://www.bepress.com

For technical questions regarding this item, or to correct its listing, contact: (Christopher F. Baum).

Related research
Keywords: expectation-maximization algorithm; gene expression data; high-dimensional data; mixture models;

Statistics
Access and download statistics

Did you know? You can use IDEAS to provide links to papers and articles in your course syllabus.

This page was last updated on 2009-12-15.


This information is provided to you by IDEAS at the Department of Economics, College of Liberal Arts and Sciences, University of Connecticut using RePEc data on a server sponsored by the Society for Economic Dynamics.