IDEAS home Printed from https://ideas.repec.org/p/boc/dsug05/01.html
   My bibliography  Save this paper

Multivariable regression models with continuous covariates, with a practical emphasis on fractional polynomials and applications in clinical epidemiology

Author

Listed:
  • Patrick Royston

    (MRC Clinical Trials Unit, London)

Abstract

Regression models play a central role in epidemiology and clinical studies. In epidemiology the emphasis is typically either on determining whether a given risk factor affects the outcome of interest (adjusted for confounders), or on estimating a dose/response curve for a given factor, again adjusting for confounders. An important class of clinical studies is the so-called prognostic factors studies, in which the outcome for patients with chronic diseases such as cancer is predicted from various clinical features. In both application areas, it is almost always necessary to build a multivariable model incorporating known or suspected influential variables while eliminating those found to be unimportant. It is commonplace for risk or prognostic factors to be measured on a continuous scale, an obvious example being a person's age. Conventionally, such factors are either modelled as linear functions or are converted into categories according to some chosen set of cut-points. However, categorisation and use of the resulting estimates is a procedure known to be fraught with difficulty. A linear function may fit the data badly and give misleading estimates of risk. Therefore, reliable approaches for representing the effects of continuous factors in multivariable models are urgently needed. Building multivariable regression models by selecting influential covariates and determining the functional form of the relationship between a continuous covariate and the outcome when analysing data from clinical and epidemiological studies is the main concern of this talk. Systematic procedures which combine selection of influential variables with determination of functional form for continuous factors are rare. Analysts may apply their individual subjective preferences for each part of the model-building process, estimate parameters for several models and then decide on the final strategy according to the results they find. By contrast, we will present here the multivariable fractional polynomial (MFP) approach as a systematic way to determine a multivariable regression model. The MFP approach was made generally available to Stata users in version 8 as the -mfp- command. Major concerns will be discussed, including robustness and possible model instability. Regarding determination of the functional form, we will also discuss some alternatives with more emphasis on local estimation of the function (e.g. splines). The MFP procedure may be used for various types of regression models (linear regression model, logistic model, Cox model, and many more). Examples with real data will be used as illustrations.

Suggested Citation

  • Patrick Royston, 2005. "Multivariable regression models with continuous covariates, with a practical emphasis on fractional polynomials and applications in clinical epidemiology," German Stata Users' Group Meetings 2005 01, Stata Users Group.
  • Handle: RePEc:boc:dsug05:01
    as

    Download full text from publisher

    File URL: http://repec.org/dsug2005/royston_berlin.ppt
    Download Restriction: no
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhongheng Zhang & Hongying Ni, 2015. "Prediction Model for Critically Ill Patients with Acute Respiratory Distress Syndrome," PLOS ONE, Public Library of Science, vol. 10(3), pages 1-14, March.
    2. Thomas A. Gerds & Martin Schumacher, 2007. "Efron-Type Measures of Prediction Error for Survival Analysis," Biometrics, The International Biometric Society, vol. 63(4), pages 1283-1287, December.
    3. Sauerbrei, W. & Meier-Hirmer, C. & Benner, A. & Royston, P., 2006. "Multivariable regression model building by using fractional polynomials: Description of SAS, STATA and R programs," Computational Statistics & Data Analysis, Elsevier, vol. 50(12), pages 3464-3485, August.
    4. Sauerbrei, Willi & Royston, Patrick & Zapien, Karina, 2007. "Detecting an interaction between treatment and a continuous covariate: A comparison of two approaches," Computational Statistics & Data Analysis, Elsevier, vol. 51(8), pages 4054-4063, May.
    5. Strasak, Alexander M. & Umlauf, Nikolaus & Pfeiffer, Ruth M. & Lang, Stefan, 2011. "Comparing penalized splines and fractional polynomials for flexible modelling of the effects of continuous predictor variables," Computational Statistics & Data Analysis, Elsevier, vol. 55(4), pages 1540-1551, April.
    6. Nahila Justo & Jonas Nilsson & Beata Korytowsky & Johan Dalen & Terri Madison & Alistair McGuire, 2020. "Retrospective observational cohort study on innovation in oncology and progress in survival: How far have we gotten in the two decades of treating patients with advanced non-small cell lung cancer as ," PLOS ONE, Public Library of Science, vol. 15(5), pages 1-12, May.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:boc:dsug05:01. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F Baum (email available below). General contact details of provider: https://edirc.repec.org/data/stataea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.