IDEAS home Printed from
   My bibliography  Save this paper

Homoskedastic adjustment inflation factors in model selection


  • Roger Newson

    () (National Heart and Lung Institute, Imperial College London)


Insufficient confounder adjustment is viewed as a common source of "false discoveries", especially in the epidemiology sector. However, adjustment for "confounders" that are correlated with the exposure, but which do not independently predict the outcome, may cause loss of power to detect the exposure effect. On the other hand, choosing confounders based on "stepwise" methods is subject to many hazards, which imply that the confidence interval eventually published is likely not to have the advertized coverage probability for the effect that we wanted to know. We would like to be able to find a model in the data on exposures and confounders, and then to estimate the parameters of that model from the conditional distribution of the outcome, given the exposures and confounders. The haif package, downloadable from SSC, calculates the homoskedastic adjustment inflation factors (HAIFs), by which the variances and standard errors of coeffcients for a matrix of X-variables are scaled (or inflated), if a matrix of unnecessary confounders A is also included in a regression model, assuming equal variances (homoskedasticity). These can be calculated from the A- and X-variables alone, and can be used to inform the choice of a set of models eventually fitted to the outcome data, together with the usual criteria involving causality and prior opinion. Examples are given of the use of HAIFs and their ratios.

Suggested Citation

  • Roger Newson, 2009. "Homoskedastic adjustment inflation factors in model selection," United Kingdom Stata Users' Group Meetings 2009 15, Stata Users Group.
  • Handle: RePEc:boc:usug09:15

    Download full text from publisher

    File URL:
    File Function: presentation slides
    Download Restriction: no

    References listed on IDEAS

    1. Roger Newson, 2006. "Confidence intervals for rank statistics: Somers' D and extensions," Stata Journal, StataCorp LP, vol. 6(3), pages 309-334, September.
    Full references (including those not matched with items on IDEAS)


    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

    Cited by:

    1. Roger Newson, 2016. "The role of Somers's D in propensity modeling," United Kingdom Stata Users' Group Meetings 2016 01, Stata Users Group.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ichiro Kunitsugu & Masayuki Okuda & Shinichi Sugiyama & Norikazu Yoshitake & Yukio Tanizawa & Satoshi Sasaki & Tatsuya Hobara, 2012. "Meat intake frequency and anemia in Japanese children and adolescents," Nursing & Health Sciences, John Wiley & Sons, vol. 14(2), pages 197-203, June.
    2. Roger Newson, 2016. "The role of Somers's D in propensity modeling," United Kingdom Stata Users' Group Meetings 2016 01, Stata Users Group.
    3. Orth, Walter, 2012. "The predictive accuracy of credit ratings: Measurement and statistical inference," International Journal of Forecasting, Elsevier, vol. 28(1), pages 288-296.
    4. Marszalec, Daniel, 2018. "Fear not the simplicity - An experimental analysis of auctions for complements," Journal of Economic Behavior & Organization, Elsevier, vol. 152(C), pages 81-97.
    5. Jeannette Brosig‐Koch & Heike Hennig‐Schmidt & Nadja Kairies‐Schwarz & Daniel Wiesen, 2017. "The Effects of Introducing Mixed Payment Systems for Physicians: Experimental Evidence," Health Economics, John Wiley & Sons, Ltd., vol. 26(2), pages 243-262, February.
    6. Löschel, Andreas & Sturm, Bodo & Uehleke, Reinhard, 2013. "Revealed preferences for climate protection when the purely individual perspective is relaxed: Evidence from a framed field experiment," ZEW Discussion Papers 13-006, ZEW - Leibniz Centre for European Economic Research.
    7. Roger Newson, 2019. "Bland–Altman plots, rank parameters, and calibration ridit splines," London Stata Conference 2019 01, Stata Users Group.
    8. Dirk Tasche, 2009. "Estimating discriminatory power and PD curves when the number of defaults is small," Papers 0905.3928,, revised Mar 2010.
    9. Orth, Walter, 2010. "The predictive accuracy of credit ratings: Measurement and statistical inference," MPRA Paper 30148, University Library of Munich, Germany, revised 16 Feb 2011.
    10. Roger Newson, 2014. "Easy-to-use packages for estimating rank and spline parameters," United Kingdom Stata Users' Group Meetings 2014 01, Stata Users Group.
    11. Navarro, Noemí & Veszteg, Róbert F., 2020. "On the empirical validity of axioms in unstructured bargaining," Games and Economic Behavior, Elsevier, vol. 121(C), pages 117-145.
    12. Hanna Karolina Szymborska, 2018. "Household wealth structures and position in the income distribution – econometric analysis for the USA, 1989-2013," Working Papers PKWP1806, Post Keynesian Economics Society (PKES).
    13. Roger Newson, 2006. "On the central role of Somers' D," United Kingdom Stata Users' Group Meetings 2006 01, Stata Users Group.
    14. Ahdesmäki Miika & Lancashire Lee & Proutski Vitali & Wilson Claire & Davison Timothy S. & Harkin D. Paul & Kennedy Richard D., 2013. "Model selection for prognostic time-to-event gene signature discovery with applications in early breast cancer data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 12(5), pages 619-635, October.
    15. Roger Newson, 2015. "Somers' D: A common currency for associations," United Kingdom Stata Users' Group Meetings 2015 01, Stata Users Group.
    16. Graevenitz, Georg von & Weber, Richard, 2011. "How to Educate Entrepreneurs?," Discussion Papers in Business Administration 12280, University of Munich, Munich School of Management.
    17. Löschel, Andreas & Sturm, Bodo & Uehleke, Reinhard, 2017. "Revealed preferences for voluntary climate change mitigation when the purely individual perspective is relaxed – evidence from a framed field experiment," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 67(C), pages 149-160.
    18. Alena MINÃ ROVÃ, 2012. "Evaluation Of Dependence Of Occurrence Of Risk Events In Logistics On Risk Factors By Means Of Somers' D Coefficient," Journal of Applied Economic Sciences, Spiru Haret University, Faculty of Financial Management and Accounting Craiova, vol. 7(1(19)/ Sp), pages 73-86.
    19. Peter, Eckley, 2015. "Measuring economic uncertainty using news-media textual data," MPRA Paper 64874, University Library of Munich, Germany, revised 01 May 2015.
    20. Stephen F Weng & Jenna Reps & Joe Kai & Jonathan M Garibaldi & Nadeem Qureshi, 2017. "Can machine-learning improve cardiovascular risk prediction using routine clinical data?," PLOS ONE, Public Library of Science, vol. 12(4), pages 1-14, April.

    More about this item


    Access and download statistics


    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:boc:usug09:15. See general information about how to correct material in RePEc.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Christopher F Baum). General contact details of provider: .

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service hosted by the Research Division of the Federal Reserve Bank of St. Louis . RePEc uses bibliographic data supplied by the respective publishers.