Wim van Putten () (Department of Statistics, Erasmus MC - Daniel den Hoed Cancer Center, Rotterdam, Netherlands)
Abstract
Classification and Regression Tree analysis can be applied for the identification and assessment of prognostic factors in clinical research. It involves repeated subdivisions of a group of subjects on the basis of the choice of optimal cutpoints of binary, ordinal or continuous covariates, that maximizes a certain split criterion. I will describe a specific implementation of CART as Stata ado file cart.ado for failure time data with as split criterion an adjusted P-value. The P-value is associated with the chisquare logrank statistic based on residuals. The adjustment is for the multiple testing associated with the search for the optimal cutpoint with the maximum chisquare value (Lausen, 1997). Examples of applications are given. CART has a serious risk of overfitting. However it can be a useful exploratory tool in addition to more standard regression type techniques. [Reference: Lausen B. et al, The regression tree method and its application in nutritional epidemiology. Informatik, Biometrie und Epidemiologie in Medizin und Biologie 28 (1), 1-13, 1997.]
Download Info
To download:
If you experience problems downloading a file, check if you have the
proper application to
view it first. Information about this may be contained
in the File-Format links below. In case of further problems read
the IDEAS help
page. Note that these files are not on the IDEAS
site. Please be patient as the files may be large.