IDEAS home Printed from https://ideas.repec.org/a/sae/medema/v21y2001i1p45-56.html
   My bibliography  Save this article

Prognostic Modeling with Logistic Regression Analysis

Author

Listed:
  • Ewout W. Steyerberg

    (Center for Clinical Decision Sciences, Department of Public Health, Erasmus University, Rotterdam, the Netherlands)

  • Marinus J. C. Eijkemans

    (Center for Clinical Decision Sciences, Department of Public Health, Erasmus University, Rotterdam, the Netherlands)

  • Frank E. Harrell Jr

    (Division of Biostatistics and Epidemiology, Department of Health Evaluation Sciences, University of Virginia, Charlottesville, Virginia)

  • J. Dik F. Habbema

    (Center for Clinical Decision Sciences, Department of Public Health, Erasmus University, Rotterdam, the Netherlands)

Abstract

Clinical decision making often requires estimates of the likelihood of a dichotomous outcome in individual patients. When empirical data are available, these estimates may well be obtained from a logistic regression model. Several strategies may be followed in the development of such a model. In this study, the authors compare alternative strategies in 23 small subsamples from a large data set of patients with an acute myocardial infarction, where they developed predictive models for 30-day mortality. Evaluations were performed in an independent part of the data set. Specifically, the authors studied the effect of coding of covariables and stepwise selection on discriminative ability of the resulting model, and the effect of statistical “shrinkage†techniques on calibration. As expected, dichotomization of continuous covariables implied a loss of information. Remarkably, stepwise selection resulted in less discriminating models compared to full models including all available covariables, even when more than half of these were randomly associated with the outcome. Using qualitative information on the sign of the effect of predictors slightly improved the predictive ability. Calibration improved when shrinkage was applied on the standard maximum likelihood estimates of the regression coefficients. In conclusion, a sensible strategy in small data sets is to apply shrinkage methods in full models that include well-coded predictors that are selected based on external information.

Suggested Citation

  • Ewout W. Steyerberg & Marinus J. C. Eijkemans & Frank E. Harrell Jr & J. Dik F. Habbema, 2001. "Prognostic Modeling with Logistic Regression Analysis," Medical Decision Making, , vol. 21(1), pages 45-56, February.
  • Handle: RePEc:sae:medema:v:21:y:2001:i:1:p:45-56
    DOI: 10.1177/0272989X0102100106
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/0272989X0102100106
    Download Restriction: no

    File URL: https://libkey.io/10.1177/0272989X0102100106?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Chris Chatfield, 1995. "Model Uncertainty, Data Mining and Statistical Inference," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 158(3), pages 419-444, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Lisa Cherry & Darren Mollendor & Bill Eisenstein & Terri S. Hogue & Katharyn Peterman & John E. McCray, 2019. "Predicting Parcel-Scale Redevelopment Using Linear and Logistic Regression—the Berkeley Neighborhood Denver, Colorado Case Study," Sustainability, MDPI, vol. 11(7), pages 1-16, March.
    2. Areti Kontogianni & Dimitris Damigos & Michail Skourtos & Christos Tourkolias & Eleanor Denny & Ibon Galarraga & Steffen Kallbekken & Edin Lakić, 2021. "Model Validity and Transferability Informing Behavioral Energy Policies," Energies, MDPI, vol. 14(11), pages 1-20, May.
    3. Haridarshan Patel & Robert J DiDomenico & Katie J Suda & Glen T Schumock & Gregory S Calip & Todd A Lee, 2020. "Risk of cardiac events with azithromycin—A prediction model," PLOS ONE, Public Library of Science, vol. 15(10), pages 1-11, October.
    4. Mohammad Mojtahedi & Sidney Newton & Jason Meding, 2017. "Predicting the resilience of transport infrastructure to a natural disaster using Cox’s proportional hazards regression model," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 85(2), pages 1119-1133, January.
    5. Tri-Long Nguyen & Géraldine Leguelinel-Blache & Jean-Marie Kinowski & Clarisse Roux-Marson & Marion Rougier & Jessica Spence & Yannick Le Manach & Paul Landais, 2017. "Improving medication safety: Development and impact of a multivariate model-based strategy to target high-risk patients," PLOS ONE, Public Library of Science, vol. 12(2), pages 1-13, February.
    6. Phung Khanh Lam & Dong Thi Hoai Tam & Nguyen Minh Dung & Nguyen Thi Hanh Tien & Nguyen Tan Thanh Kieu & Cameron Simmons & Jeremy Farrar & Bridget Wills & Marcel Wolbers, 2015. "A Prognostic Model for Development of Profound Shock among Children Presenting with Dengue Shock Syndrome," PLOS ONE, Public Library of Science, vol. 10(5), pages 1-13, May.
    7. Beatrice Asenso Barnieh & Li Jia & Massimo Menenti & Min Jiang & Jie Zhou & Yelong Zeng & Ali Bennour, 2021. "Modeling the Underlying Drivers of Natural Vegetation Occurrence in West Africa with Binary Logistic Regression Method," Sustainability, MDPI, vol. 13(9), pages 1-37, April.
    8. Luuk Wieske & Esther Witteveen & Camiel Verhamme & Daniela S Dettling-Ihnenfeldt & Marike van der Schaaf & Marcus J Schultz & Ivo N van Schaik & Janneke Horn, 2014. "Early Prediction of Intensive Care Unit–Acquired Weakness Using Easily Available Parameters: A Prospective Observational Study," PLOS ONE, Public Library of Science, vol. 9(10), pages 1-8, October.
    9. Layla Khoja & Maxwell Chipulu & Ranadeva Jayasekera, 2016. "Analysing corporate insolvency in the Gulf Cooperation Council using logistic regression and multidimensional scaling," Review of Quantitative Finance and Accounting, Springer, vol. 46(3), pages 483-518, April.
    10. Fraser S Brown & Stella A Glasmacher & Patrick K A Kearns & Niall MacDougall & David Hunt & Peter Connick & Siddharthan Chandran, 2020. "Systematic review of prediction models in relapsing remitting multiple sclerosis," PLOS ONE, Public Library of Science, vol. 15(5), pages 1-13, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Claudia García-García & Catalina B. García-García & Román Salmerón, 2021. "Confronting collinearity in environmental regression models: evidence from world data," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(3), pages 895-926, September.
    2. Chou, Ping & Chuang, Howard Hao-Chun & Chou, Yen-Chun & Liang, Ting-Peng, 2022. "Predictive analytics for customer repurchase: Interdisciplinary integration of buy till you die modeling and machine learning," European Journal of Operational Research, Elsevier, vol. 296(2), pages 635-651.
    3. Sai Ding & John Knight, 2011. "Why has China Grown So Fast? The Role of Physical and Human Capital Formation," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 73(2), pages 141-174, April.
    4. Riccardo (Jack) Lucchetti & Luca Pedini, 2020. "ParMA: Parallelised Bayesian Model Averaging for Generalised Linear Models," Working Papers 2020:28, Department of Economics, University of Venice "Ca' Foscari".
    5. Robert Lehmann & Antje Weyh, 2016. "Forecasting Employment in Europe: Are Survey Results Helpful?," Journal of Business Cycle Research, Springer;Centre for International Research on Economic Tendency Surveys (CIRET), vol. 12(1), pages 81-117, September.
    6. Castle Jennifer L. & Doornik Jurgen A & Hendry David F., 2011. "Evaluating Automatic Model Selection," Journal of Time Series Econometrics, De Gruyter, vol. 3(1), pages 1-33, February.
    7. Lee, Yun Shin & Scholtes, Stefan, 2014. "Empirical prediction intervals revisited," International Journal of Forecasting, Elsevier, vol. 30(2), pages 217-234.
    8. Johan Verbeeck & Martin Geroldinger & Konstantin Thiel & Andrew Craig Hooker & Sebastian Ueckert & Mats Karlsson & Arne Cornelius Bathke & Johann Wolfgang Bauer & Geert Molenberghs & Georg Zimmermann, 2023. "How to analyze continuous and discrete repeated measures in small‐sample cross‐over trials?," Biometrics, The International Biometric Society, vol. 79(4), pages 3998-4011, December.
    9. Coleman, Stephen, 2005. "Testing Theories with Qualitative and Quantitative Predictions," MPRA Paper 105171, University Library of Munich, Germany.
    10. Ewout W. Steyerberg, 2005. "Local Applicability of Clinical and Model-Based Probability Estimates," Medical Decision Making, , vol. 25(6), pages 678-680, November.
    11. Mark F. J. Steel, 2020. "Model Averaging and Its Use in Economics," Journal of Economic Literature, American Economic Association, vol. 58(3), pages 644-719, September.
    12. Brooks, Jeremy S., 2010. "The Buddha mushroom: Conservation behavior and the development of institutions in Bhutan," Ecological Economics, Elsevier, vol. 69(4), pages 779-795, February.
    13. Ebersberger, Bernd & Galia, Fabrice & Laursen, Keld & Salter, Ammon, 2021. "Inbound Open Innovation and Innovation Performance: A Robustness Study," Research Policy, Elsevier, vol. 50(7).
    14. Brian Knaeble & Seth Dutter, 2017. "Reversals of Least-Square Estimates and Model-Invariant Estimation for Directions of Unique Effects," The American Statistician, Taylor & Francis Journals, vol. 71(2), pages 97-105, April.
    15. John Knight & Sai Ding, 2008. "Why has China Grown so Fast? The Role of Structural Change," Economics Series Working Papers 415, University of Oxford, Department of Economics.
    16. Pritularga, Kandrika F. & Svetunkov, Ivan & Kourentzes, Nikolaos, 2021. "Stochastic coherency in forecast reconciliation," International Journal of Production Economics, Elsevier, vol. 240(C).
    17. Steven M. Shugan, 2002. "In Search of Data: An Editorial," Marketing Science, INFORMS, vol. 21(4), pages 369-377.
    18. Fletcher, David & Dillingham, Peter W., 2011. "Model-averaged confidence intervals for factorial experiments," Computational Statistics & Data Analysis, Elsevier, vol. 55(11), pages 3041-3048, November.
    19. Liu, Min & He, Honglin & Ren, Xiaoli & Sun, Xiaomin & Yu, Guirui & Han, Shijie & Wang, Huimin & Zhou, Guoyi, 2015. "The effects of constraining variables on parameter optimization in carbon and water flux modeling over different forest ecosystems," Ecological Modelling, Elsevier, vol. 303(C), pages 30-41.
    20. Fang, Jiali & Jacobsen, Ben & Qin, Yafeng, 2014. "Predictability of the simple technical trading rules: An out-of-sample test," Review of Financial Economics, Elsevier, vol. 23(1), pages 30-45.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:medema:v:21:y:2001:i:1:p:45-56. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.