IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v78y2022i3p1045-1055.html
   My bibliography  Save this article

Addressing patient heterogeneity in disease predictive model development

Author

Listed:
  • Xu Gao
  • Weining Shen
  • Jing Ning
  • Ziding Feng
  • Jianhua Hu

Abstract

This paper addresses patient heterogeneity associated with prediction problems in biomedical applications. We propose a systematic hypothesis testing approach to determine the existence of patient subgroup structure and the number of subgroups in patient population if subgroups exist. A mixture of generalized linear models is considered to model the relationship between the disease outcome and patient characteristics and clinical factors, including targeted biomarker profiles. We construct a test statistic based on expectation maximization (EM) algorithm and derive its asymptotic distribution under the null hypothesis. An important computational advantage of the test is that the involved parameter estimates under the complex alternative hypothesis can be obtained through a small number of EM iterations, rather than optimizing the objective function. We demonstrate the finite sample performance of the proposed test in terms of type‐I error rate and power, using extensive simulation studies. The applicability of the proposed method is illustrated through an application to a multicenter prostate cancer study.

Suggested Citation

  • Xu Gao & Weining Shen & Jing Ning & Ziding Feng & Jianhua Hu, 2022. "Addressing patient heterogeneity in disease predictive model development," Biometrics, The International Biometric Society, vol. 78(3), pages 1045-1055, September.
  • Handle: RePEc:bla:biomet:v:78:y:2022:i:3:p:1045-1055
    DOI: 10.1111/biom.13514
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.13514
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.13514?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Juan Shen & Xuming He, 2015. "Inference for Subgroup Analysis With a Structured Logistic-Normal Mixture Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(509), pages 303-312, March.
    2. Ratkovic, Marc & Tingley, Dustin, 2017. "Sparse Estimation and Uncertainty with Application to Subgroup Analysis," Political Analysis, Cambridge University Press, vol. 25(1), pages 1-40, January.
    3. Jiahua Chen & Pengfei Li & Yuejiao Fu, 2012. "Inference on the Order of a Normal Mixture," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(499), pages 1096-1105, September.
    4. Li, Pengfei & Chen, Jiahua, 2010. "Testing the Order of a Finite Mixture," Journal of the American Statistical Association, American Statistical Association, vol. 105(491), pages 1084-1092.
    5. Xiao Song & Margaret Pepe, 2004. "Evaluating Markers for Selecting a Patient's Treatment," UW Biostatistics Working Paper Series 1029, Berkeley Electronic Press.
    6. Ailin Fan & Rui Song & Wenbin Lu, 2017. "Change-Plane Analysis for Subgroup Detection and Sample Size Calculation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 769-778, April.
    7. Lihui Zhao & Lu Tian & Tianxi Cai & Brian Claggett & L. J. Wei, 2013. "Effectively Selecting a Target Population for a Future Comparative Study," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(502), pages 527-539, June.
    8. Shujie Ma & Jian Huang, 2017. "A Concave Pairwise Fusion Approach to Subgroup Analysis," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(517), pages 410-423, January.
    9. Xiao Song & Margaret Sullivan Pepe, 2004. "Evaluating Markers for Selecting a Patient's Treatment," Biometrics, The International Biometric Society, vol. 60(4), pages 874-883, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ying Huang & Juhee Cho & Youyi Fong, 2021. "Threshold‐based subgroup testing in logistic regression models in two‐phase sampling designs," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(2), pages 291-311, March.
    2. Peng Jin & Wenbin Lu & Yu Chen & Mengling Liu, 2023. "Change‐plane analysis for subgroup detection with a continuous treatment," Biometrics, The International Biometric Society, vol. 79(3), pages 1920-1933, September.
    3. Ailin Fan & Rui Song & Wenbin Lu, 2017. "Change-Plane Analysis for Subgroup Detection and Sample Size Calculation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 769-778, April.
    4. Juan Shen & Xuming He, 2015. "Inference for Subgroup Analysis With a Structured Logistic-Normal Mixture Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(509), pages 303-312, March.
    5. Ruo-fan Wu & Ming Zheng & Wen Yu, 2016. "Subgroup Analysis with Time-to-Event Data Under a Logistic-Cox Mixture Model," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 43(3), pages 863-878, September.
    6. Shengli An & Peter Zhang & Hong-Bin Fang, 2023. "Subgroup Identification in Survival Outcome Data Based on Concordance Probability Measurement," Mathematics, MDPI, vol. 11(13), pages 1-10, June.
    7. Cai, Tingting & Li, Jianbo & Zhou, Qin & Yin, Songlou & Zhang, Riquan, 2024. "Subgroup detection based on partially linear additive individualized model with missing data in response," Computational Statistics & Data Analysis, Elsevier, vol. 192(C).
    8. Roland A. Matsouaka & Junlong Li & Tianxi Cai, 2014. "Evaluating marker-guided treatment selection strategies," Biometrics, The International Biometric Society, vol. 70(3), pages 489-499, September.
    9. Wichitchan, Supawadee & Yao, Weixin & Yang, Guangren, 2019. "Hypothesis testing for finite mixture models," Computational Statistics & Data Analysis, Elsevier, vol. 132(C), pages 180-189.
    10. Dana Johnson & Wenbin Lu & Marie Davidian, 2023. "A general framework for subgroup detection via one‐step value difference estimation," Biometrics, The International Biometric Society, vol. 79(3), pages 2116-2126, September.
    11. Ying Huang & Eric Laber, 2016. "Personalized Evaluation of Biomarker Value: A Cost-Benefit Perspective," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 8(1), pages 43-65, June.
    12. Veronika Skrivankova & Patrick J. Heagerty, 2018. "Single index methods for evaluation of marker†guided treatment rules based on multivariate marker panels," Biometrics, The International Biometric Society, vol. 74(2), pages 663-672, June.
    13. Baosheng Liang & Peng Wu & Xingwei Tong & Yanping Qiu, 2020. "Regression and subgroup detection for heterogeneous samples," Computational Statistics, Springer, vol. 35(4), pages 1853-1878, December.
    14. Jingxiang Chen & Yufeng Liu & Donglin Zeng & Rui Song & Yingqi Zhao & Michael R. Kosorok, 2016. "Comment," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(515), pages 942-947, July.
    15. Ying Huang & Youyi Fong, 2014. "Identifying optimal biomarker combinations for treatment selection via a robust kernel method," Biometrics, The International Biometric Society, vol. 70(4), pages 891-901, December.
    16. Wentian Guo & Yuan Ji & Daniel V. T. Catenacci, 2017. "A subgroup cluster-based Bayesian adaptive design for precision medicine," Biometrics, The International Biometric Society, vol. 73(2), pages 367-377, June.
    17. Holzmann, Hajo & Schwaiger, Florian, 2016. "Testing for the number of states in hidden Markov models," Computational Statistics & Data Analysis, Elsevier, vol. 100(C), pages 318-330.
    18. Huicong Yu & Jiaqi Wu & Weiping Zhang, 2024. "Simultaneous subgroup identification and variable selection for high dimensional data," Computational Statistics, Springer, vol. 39(6), pages 3181-3205, September.
    19. James Y. Dai & C. Jason Liang & Michael LeBlanc & Ross L. Prentice & Holly Janes, 2018. "Case†only approach to identifying markers predicting treatment effects on the relative risk scale," Biometrics, The International Biometric Society, vol. 74(2), pages 753-763, June.
    20. Zhang, Xiaochen & Zhang, Qingzhao & Ma, Shuangge & Fang, Kuangnan, 2022. "Subgroup analysis for high-dimensional functional regression," Journal of Multivariate Analysis, Elsevier, vol. 192(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:78:y:2022:i:3:p:1045-1055. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.