IDEAS home Printed from https://ideas.repec.org/a/eee/stapro/v80y2010i17-18p1313-1319.html

An integrative pathway-based clinical-genomic model for cancer survival prediction

Author

Listed:
  • Chen, Xi
  • Wang, Lily
  • Ishwaran, Hemant

Abstract

Prediction models that use gene expression levels are now being proposed for personalized treatment of cancer, but building accurate models that are easy to interpret remains a challenge. In this paper, we describe an integrative clinical-genomic approach that combines both genomic pathway and clinical information. First, we summarize information from genes in each pathway using Supervised Principal Components (SPCA) to obtain pathway-based genomic predictors. Next, we build a prediction model based on clinical variables and pathway-based genomic predictors using Random Survival Forests (RSF). Our rationale for this two-stage procedure is that the underlying disease process may be influenced by environmental exposure (measured by clinical variables) and perturbations in different pathways (measured by pathway-based genomic variables), as well as their interactions. Using two cancer microarray datasets, we show that the pathway-based clinical-genomic model outperforms gene-based clinical-genomic models, with improved prediction accuracy and interpretability.

Suggested Citation

  • Chen, Xi & Wang, Lily & Ishwaran, Hemant, 2010. "An integrative pathway-based clinical-genomic model for cancer survival prediction," Statistics & Probability Letters, Elsevier, vol. 80(17-18), pages 1313-1319, September.
  • Handle: RePEc:eee:stapro:v:80:y:2010:i:17-18:p:1313-1319
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-7152(10)00120-3
    Download Restriction: Full text for ScienceDirect subscribers only
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Lily Wang & Bing Zhang & Russell D Wolfinger & Xi Chen, 2008. "An Integrated Approach for the Analysis of Biological Pathways using Mixed Models," PLOS Genetics, Public Library of Science, vol. 4(7), pages 1-9, July.
    2. Robert Tibshirani & Guenther Walther & Trevor Hastie, 2001. "Estimating the number of clusters in a data set via the gap statistic," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 63(2), pages 411-423.
    3. Susmita Datta & Jennifer Le-Rademacher & Somnath Datta, 2007. "Predicting Patient Survival from Microarray Data by Accelerated Failure Time Modeling Using Partial Least Squares and LASSO," Biometrics, The International Biometric Society, vol. 63(1), pages 259-271, March.
    4. Charles M. Perou & Therese Sørlie & Michael B. Eisen & Matt van de Rijn & Stefanie S. Jeffrey & Christian A. Rees & Jonathan R. Pollack & Douglas T. Ross & Hilde Johnsen & Lars A. Akslen & Øystein Flu, 2000. "Molecular portraits of human breast tumours," Nature, Nature, vol. 406(6797), pages 747-752, August.
    5. Bair, Eric & Hastie, Trevor & Paul, Debashis & Tibshirani, Robert, 2006. "Prediction by Supervised Principal Components," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 119-137, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wang Zhu & Wang C.Y., 2010. "Buckley-James Boosting for Survival Analysis with High-Dimensional Biomarker Data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 9(1), pages 1-33, June.
    2. Zhiguang Huo & Li Zhu & Tianzhou Ma & Hongcheng Liu & Song Han & Daiqing Liao & Jinying Zhao & George Tseng, 2020. "Two-Way Horizontal and Vertical Omics Integration for Disease Subtype Discovery," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 12(1), pages 1-22, April.
    3. Gaynor, Sheila & Bair, Eric, 2017. "Identification of relevant subtypes via preweighted sparse clustering," Computational Statistics & Data Analysis, Elsevier, vol. 116(C), pages 139-154.
    4. Zhiguang Huo & Ying Ding & Silvia Liu & Steffi Oesterreich & George Tseng, 2016. "Meta-Analytic Framework for Sparse K -Means to Identify Disease Subtypes in Multiple Transcriptomic Studies," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(513), pages 27-42, March.
    5. Lingsong Meng & Dorina Avram & George Tseng & Zhiguang Huo, 2022. "Outcome‐guided sparse K‐means for disease subtype discovery via integrating phenotypic data with high‐dimensional transcriptomic data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(2), pages 352-375, March.
    6. Erika S. Helgeson & David M. Vock & Eric Bair, 2021. "Nonparametric cluster significance testing with reference to a unimodal null distribution," Biometrics, The International Biometric Society, vol. 77(4), pages 1215-1226, December.
    7. Eric Hillebrand & Huiyu Huang & Tae-Hwy Lee & Canlin Li, 2018. "Using the Entire Yield Curve in Forecasting Output and Inflation," Econometrics, MDPI, vol. 6(3), pages 1-27, August.
    8. Yang, Xi & Hoadley, Katherine A. & Hannig, Jan & Marron, J.S., 2023. "Jackstraw inference for AJIVE data integration," Computational Statistics & Data Analysis, Elsevier, vol. 180(C).
    9. Qin Yu & Xin Zhou & Jia Zhou & Zemin Zheng, 2026. "Confidence intervals for high-dimensional accelerated failure time models under measurement errors," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 32(1), pages 1-18, March.
    10. Tomohiro Ando & Ruey S. Tsay, 2009. "Model selection for generalized linear models with factor‐augmented predictors," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 25(3), pages 207-235, May.
    11. Thiemo Fetzer & Samuel Marden, 2017. "Take What You Can: Property Rights, Contestability and Conflict," Economic Journal, Royal Economic Society, vol. 0(601), pages 757-783, May.
    12. Jan Radovan & Igor Masten, 2025. "Nowcasting economic activity in a small open CESEE economy using mixed frequency data," Empirica, Springer;Austrian Institute for Economic Research;Austrian Economic Association, vol. 52(4), pages 721-776, November.
    13. Khanh Duong, 2024. "Is meritocracy just? New evidence from Boolean analysis and Machine learning," Journal of Computational Social Science, Springer, vol. 7(2), pages 1795-1821, October.
    14. Daniel Agness & Travis Baseler & Sylvain Chassang & Pascaline Dupas & Erik Snowberg, 2025. "Valuing the Time of the Self-Employed," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 92(6), pages 3471-3503.
    15. Batool, Fatima & Hennig, Christian, 2021. "Clustering with the Average Silhouette Width," Computational Statistics & Data Analysis, Elsevier, vol. 158(C).
    16. Nicoleta Serban & Huijing Jiang, 2012. "Multilevel Functional Clustering Analysis," Biometrics, The International Biometric Society, vol. 68(3), pages 805-814, September.
    17. Manish G & Anil Kumar Badana & Rama Rao Malla, 2017. "Emerging Diagnostic and Prognostic Biomarkers of Triple Negative Breast Cancer," Biomedical Journal of Scientific & Technical Research, Biomedical Research Network+, LLC, vol. 1(3), pages 561-565, August.
    18. Jacob Elnaggar & Fern Tsien & Lucio Miele & Chindo Hicks & Clayton Yates & Melisa Davis, 2019. "An Integrative Genomics Approach for Associating Genetic Susceptibility with the Tumor Immune Microenvironment in Triple Negative Breast Cancer," Biomedical Journal of Scientific & Technical Research, Biomedical Research Network+, LLC, vol. 15(1), pages 1-12, February.
    19. Egashira, Kento & Yata, Kazuyoshi & Aoshima, Makoto, 2024. "Asymptotic properties of hierarchical clustering in high-dimensional settings," Journal of Multivariate Analysis, Elsevier, vol. 199(C).
    20. Patrick J. Cunniff & Nicole Sivetz & Damianos Skopelitis & Olaf Klingbeil & Daniel Toobian & Diogo Maia-Silva & Mikala Egeblad & Christopher R. Vakoc, 2025. "KLF5 enables dichotomous lineage programs in pancreatic cancer via the AAA+ ATPase coactivators RUVBL1 and RUVBL2," Nature Communications, Nature, vol. 16(1), pages 1-20, December.

    More about this item

    Keywords

    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:stapro:v:80:y:2010:i:17-18:p:1313-1319. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.