IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v88y2015icp15-27.html

Two simple algorithms on linear combination of multiple biomarkers to maximize partial area under the ROC curve

Author

Listed:
  • Yu, Wenbao
  • Park, Taesung

Abstract

In clinical practices, it is common that several biomakers are related to a specific disease and each single marker does not have enough diagnostic power. An effective way to improve the diagnostic accuracy is to combine multiple markers. It is known that the area under the receiver operating characteristic curve (AUC) is very popular for evaluation of a diagnostic tool. Su and Liu (1993) derived the best linear combination that maximizes AUC when the markers are multivariate normally distributed. However, there are many applications that do not operate in the entire range of the curve, but only in particular regions of it, for example, high specificity regions. In these cases, it is more practical to analyze the partial area under the curve (pAUC). In this paper, we propose two easy-implemented algorithms, to find the best linear combination of multiple biomarkers that optimizes the pAUC, for given range of specificity. Analysis of synthesized and real datasets shows that the proposed algorithms achieve larger predictive pAUC values on future observations than existing methods, such as Su and Liu’s method, logistic regression and others.

Suggested Citation

  • Yu, Wenbao & Park, Taesung, 2015. "Two simple algorithms on linear combination of multiple biomarkers to maximize partial area under the ROC curve," Computational Statistics & Data Analysis, Elsevier, vol. 88(C), pages 15-27.
  • Handle: RePEc:eee:csdana:v:88:y:2015:i:c:p:15-27
    DOI: 10.1016/j.csda.2014.12.002
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947314003405
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2014.12.002?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Yan, Jun, 2007. "Enjoy the Joy of Copulas: With a Package copula," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 21(i04).
    2. Lori E. Dodd & Margaret S. Pepe, 2003. "Partial AUC Estimation and Regression," Biometrics, The International Biometric Society, vol. 59(3), pages 614-623, September.
    3. Margaret Sullivan Pepe & Gary Longton & Garnet L. Anderson & Michel Schummer, 2003. "Selecting Differentially Expressed Genes from Microarray Experiments," Biometrics, The International Biometric Society, vol. 59(1), pages 133-142, March.
    4. Donna Katzman McClish, 1989. "Analyzing a Portion of the ROC Curve," Medical Decision Making, , vol. 9(3), pages 190-195, August.
    5. Kelly Zou & W. J. Hall, 2002. "Semiparametric and parametric transformation models for comparing diagnostic markers with paired design," Journal of Applied Statistics, Taylor & Francis Journals, vol. 29(6), pages 803-816.
    6. Jin, Hua & Lu, Ying, 2009. "The optimal linear combination of multiple predictors under the generalized linear models," Statistics & Probability Letters, Elsevier, vol. 79(22), pages 2321-2327, November.
    7. Man-Jen Hsu & Huey-Miin Hsueh, 2013. "The linear combinations of biomarkers which maximize the partial area under the ROC curves," Computational Statistics, Springer, vol. 28(2), pages 647-666, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Rocío Aznar-Gimeno & Luis M. Esteban & Rafael del-Hoyo-Alonso & Ángel Borque-Fernando & Gerardo Sanz, 2022. "A Stepwise Algorithm for Linearly Combining Biomarkers under Youden Index Maximization," Mathematics, MDPI, vol. 10(8), pages 1-26, April.
    2. Yusuf Yıldırım & Anirban Sanyal, 2022. "Evaluating the Effectiveness of Early Warning Indicators: An Application of Receiver Operating Characteristic Curve Approach to Panel Data," Scientific Annals of Economics and Business (continues Analele Stiintifice), Alexandru Ioan Cuza University, Faculty of Economics and Business Administration, vol. 69(4), pages 557-597, December.
    3. Rocío Aznar-Gimeno & Luis M. Esteban & Gerardo Sanz & Rafael del-Hoyo-Alonso & Ricardo Savirón-Cornudella, 2021. "Incorporating a New Summary Statistic into the Min–Max Approach: A Min–Max–Median, Min–Max–IQR Combination of Biomarkers for Maximising the Youden Index," Mathematics, MDPI, vol. 9(19), pages 1-17, October.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jialiang Li & Jason P. Fine, 2010. "Weighted area under the receiver operating characteristic curve and its application to gene selection," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 59(4), pages 673-692, August.
    2. Man-Jen Hsu & Huey-Miin Hsueh, 2013. "The linear combinations of biomarkers which maximize the partial area under the ROC curves," Computational Statistics, Springer, vol. 28(2), pages 647-666, April.
    3. Schmid Matthias & Hothorn Torsten & Krause Friedemann & Rabe Christina, 2012. "A PAUC-based Estimation Technique for Disease Classification and Biomarker Selection," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(5), pages 1-26, October.
    4. Gigliarano, Chiara & Figini, Silvia & Muliere, Pietro, 2014. "Making classifier performance comparisons when ROC curves intersect," Computational Statistics & Data Analysis, Elsevier, vol. 77(C), pages 300-312.
    5. Yousef, Waleed A., 2013. "Assessing classifiers in terms of the partial area under the ROC curve," Computational Statistics & Data Analysis, Elsevier, vol. 64(C), pages 51-70.
    6. M.L. Nores & M.P. Díaz, 2016. "Bootstrap hypothesis testing in generalized additive models for comparing curves of treatments in longitudinal studies," Journal of Applied Statistics, Taylor & Francis Journals, vol. 43(5), pages 810-826, April.
    7. Chen, Zhelun & O’Neill, Zheng & Wen, Jin & Pradhan, Ojas & Yang, Tao & Lu, Xing & Lin, Guanjing & Miyata, Shohei & Lee, Seungjae & Shen, Chou & Chiosa, Roberto & Piscitelli, Marco Savino & Capozzoli, , 2023. "A review of data-driven fault detection and diagnostics for building HVAC systems," Applied Energy, Elsevier, vol. 339(C).
    8. Eunhee Kim & Zheng Zhang & Youdan Wang & Donglin Zeng, 2014. "Power calculation for comparing diagnostic accuracies in a multi-reader, multi-test design," Biometrics, The International Biometric Society, vol. 70(4), pages 1033-1041, December.
    9. Göran Kauermann & Renate Meyer, 2014. "Penalized marginal likelihood estimation of finite mixtures of Archimedean copulas," Computational Statistics, Springer, vol. 29(1), pages 283-306, February.
    10. Li-Xuan Qin & Steven G. Self, 2006. "The Clustering of Regression Models Method with Applications in Gene Expression Data," Biometrics, The International Biometric Society, vol. 62(2), pages 526-533, June.
    11. Peterson, A. Townsend & Papeş, Monica & Soberón, Jorge, 2008. "Rethinking receiver operating characteristic analysis applications in ecological niche modeling," Ecological Modelling, Elsevier, vol. 213(1), pages 63-72.
    12. Merve Basol & Dincer Goksuluk & Ergun Karaagaoglu, 2023. "Comparing the diagnostic performance of methods used in a full-factorial design multi-reader multi-case studies," Computational Statistics, Springer, vol. 38(3), pages 1537-1553, September.
    13. Pisit Leeahtam & Chukiat Chaiboonsri & Kanchana Chokethaworn & Prasert Chaitip & Songsak Sriboonchitta, 2011. "The Appropriate Model and Dependence Measures of Thailand’s Exchange Rate and Malaysia’s Exchange Rate: Linear, Nonlinear and Copulas Approach," Journal of Knowledge Management, Economics and Information Technology, ScientificPapers.org, vol. 1(6), pages 1-14, October.
    14. Majeed, Fahd & Khanna, Madhu & Miao, Ruiqing & Betes, Elena Blanc & Hudiburg, Tara & DeLucia, Evan, 2022. "Payment for carbon mitigation reduces riskiness of bioenergy crop production," 2022 Annual Meeting, July 31-August 2, Anaheim, California 322277, Agricultural and Applied Economics Association.
    15. Margaret Sullivan Pepe & Tianxi Cai, 2004. "The Analysis of Placement Values for Evaluating Discriminatory Measures," Biometrics, The International Biometric Society, vol. 60(2), pages 528-535, June.
    16. Gong Chen & Qing Zhou, 2010. "Heterogeneity in DNA Multiple Alignments: Modeling, Inference, and Applications in Motif Finding," Biometrics, The International Biometric Society, vol. 66(3), pages 694-704, September.
    17. Ozonder, Gozde & Miller, Eric J., 2021. "Longitudinal investigation of skeletal activity episode timing decisions – A copula approach," Journal of choice modelling, Elsevier, vol. 40(C).
    18. Debashis Ghosh & Arul Chinnaiyan, 2004. "Covariate adjustment in the analysis of microarray data from clinical studies," The University of Michigan Department of Biostatistics Working Paper Series 1030, Berkeley Electronic Press.
    19. Marius Galabe Sampid & Haslifah M Hasim & Hongsheng Dai, 2018. "Refining value-at-risk estimates using a Bayesian Markov-switching GJR-GARCH copula-EVT model," PLOS ONE, Public Library of Science, vol. 13(6), pages 1-33, June.
    20. Ángel Beade & Manuel Rodríguez & José Santos, 2024. "Multiperiod Bankruptcy Prediction Models with Interpretable Single Models," Computational Economics, Springer;Society for Computational Economics, vol. 64(3), pages 1357-1390, September.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:88:y:2015:i:c:p:15-27. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.