IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v100y2009i3p459-472.html
   My bibliography  Save this article

A note on bias due to fitting prospective multivariate generalized linear models to categorical outcomes ignoring retrospective sampling schemes

Author

Listed:
  • Mukherjee, Bhramar
  • Liu, Ivy

Abstract

Outcome-dependent sampling designs are commonly used in economics, market research and epidemiological studies. Case-control sampling design is a classic example of outcome-dependent sampling, where exposure information is collected on subjects conditional on their disease status. In many situations, the outcome under consideration may have multiple categories instead of a simple dichotomization. For example, in a case-control study, there may be disease sub-classification among the "cases" based on progression of the disease, or in terms of other histological and morphological characteristics of the disease. In this note, we investigate the issue of fitting prospective multivariate generalized linear models to such multiple-category outcome data, ignoring the retrospective nature of the sampling design. We first provide a set of necessary and sufficient conditions for the link functions that will allow for equivalence of prospective and retrospective inference for the parameters of interest. We show that for categorical outcomes, prospective-retrospective equivalence does not hold beyond the generalized multinomial logit link. We then derive an approximate expression for the bias incurred when link functions outside this class are used. Most popular models for ordinal response fall outside the multiplicative intercept class and one should be cautious while performing a naive prospective analysis of such data as the bias could be substantial. We illustrate the extent of bias through a real data example, based on the ongoing Prostate, Lung, Colorectal and Ovarian (PLCO) cancer screening trial by the National Cancer Institute. The simulations based on the real study illustrate that the bias approximations work well in practice.

Suggested Citation

  • Mukherjee, Bhramar & Liu, Ivy, 2009. "A note on bias due to fitting prospective multivariate generalized linear models to categorical outcomes ignoring retrospective sampling schemes," Journal of Multivariate Analysis, Elsevier, vol. 100(3), pages 459-472, March.
  • Handle: RePEc:eee:jmvana:v:100:y:2009:i:3:p:459-472
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047-259X(08)00149-8
    Download Restriction: Full text for ScienceDirect subscribers only
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Chatterjee, Nilanjan, 2004. "A Two-Stage Regression Model for Epidemiological Studies With Multivariate Disease Classification Data," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 127-138, January.
    2. N. E. Breslow & N. Chatterjee, 1999. "Design and analysis of two‐phase studies with binary outcome applied to Wilms tumour prognosis," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 48(4), pages 457-468.
    3. Shaun R. Seaman, 2004. "Equivalence of prospective and retrospective models in the Bayesian analysis of case-control studies," Biometrika, Biometrika Trust, vol. 91(1), pages 15-25, March.
    4. Zhang, Biao, 2006. "Prospective and retrospective analyses under logistic regression models," Journal of Multivariate Analysis, Elsevier, vol. 97(1), pages 211-230, January.
    5. White, Halbert, 1982. "Maximum Likelihood Estimation of Misspecified Models," Econometrica, Econometric Society, vol. 50(1), pages 1-25, January.
    6. Wang, C. Y. & Wang, Suojin & Carroll, R. J., 1997. "Estimation in choice-based sampling with measurement error and bootstrap analysis," Journal of Econometrics, Elsevier, vol. 77(1), pages 65-86, March.
    7. John M. Neuhaus, 2002. "Theory & Methods: Bias due to Ignoring the Sample Design in Case–Control Studies," Australian & New Zealand Journal of Statistics, Australian Statistical Publishing Association Inc., vol. 44(3), pages 285-293, September.
    8. Bercedis Peterson & Frank E. Harrell, 1990. "Partial Proportional Odds Models for Ordinal Response Variables," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 39(2), pages 205-217, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. S. Haneuse & J. Chen, 2011. "A Multiphase Design Strategy for Dealing with Participation Bias," Biometrics, The International Biometric Society, vol. 67(1), pages 309-318, March.
    2. Das, Debojyoti & Bhatia, Vaneet & Kumar, Surya Bhushan & Basu, Sankarshan, 2022. "Do precious metals hedge crude oil volatility jumps?," International Review of Financial Analysis, Elsevier, vol. 83(C).
    3. P.A.V.B. Swamy & I-Lok Chang & Jatinder S. Mehta & William H. Greene & Stephen G. Hall & George S. Tavlas, 2016. "Removing Specification Errors from the Usual Formulation of Binary Choice Models," Econometrics, MDPI, vol. 4(2), pages 1-21, June.
    4. Lu Chen & Li Hsu & Kathleen Malone, 2009. "A Frailty-Model-Based Approach to Estimating the Age-Dependent Penetrance Function of Candidate Genes Using Population-Based Case-Control Study Designs: An Application to Data on the BRCA1 Gene," Biometrics, The International Biometric Society, vol. 65(4), pages 1105-1114, December.
    5. Carlo Altavilla & Raffaella Giacomini & Giuseppe Ragusa, 2017. "Anchoring the yield curve using survey expectations," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 32(6), pages 1055-1068, September.
    6. Fernando Rios-Avila & Gustavo Canavire-Bacarreza, 2018. "Standard-error correction in two-stage optimization models: A quasi–maximum likelihood estimation approach," Stata Journal, StataCorp LP, vol. 18(1), pages 206-222, March.
    7. Sandy Fréret & Denis Maguain, 2017. "The effects of agglomeration on tax competition: evidence from a two-regime spatial panel model on French data," International Tax and Public Finance, Springer;International Institute of Public Finance, vol. 24(6), pages 1100-1140, December.
    8. Ai, Chunrong & Chen, Xiaohong, 2007. "Estimation of possibly misspecified semiparametric conditional moment restriction models with different conditioning variables," Journal of Econometrics, Elsevier, vol. 141(1), pages 5-43, November.
    9. Ayouz, Mourad K. & Remaud, Herve, 2003. "The Internationalization Determinants Of The Small Agro-Food Firms: Hypotheses And Statistical Tests," International Food and Agribusiness Management Review, International Food and Agribusiness Management Association, vol. 5(2), pages 1-27.
    10. Broze, Laurence & Gourieroux, Christian, 1998. "Pseudo-maximum likelihood method, adjusted pseudo-maximum likelihood method and covariance estimators," Journal of Econometrics, Elsevier, vol. 85(1), pages 75-98, July.
    11. Sridhar, Shrihari & Naik, Prasad A. & Kelkar, Ajay, 2017. "Metrics unreliability and marketing overspending," International Journal of Research in Marketing, Elsevier, vol. 34(4), pages 761-779.
    12. Yen, Steven T. & Chern, Wen S. & Lee, Hwang-Jaw, 1991. "Effects Of Income Sources On Household Food Expenditures," 1991 Annual Meeting, August 4-7, Manhattan, Kansas 271167, American Agricultural Economics Association (New Name 2008: Agricultural and Applied Economics Association).
    13. Ruoxuan Xiong & Allison Koenecke & Michael Powell & Zhu Shen & Joshua T. Vogelstein & Susan Athey, 2021. "Federated Causal Inference in Heterogeneous Observational Data," Papers 2107.11732, arXiv.org, revised Apr 2023.
    14. Posch, Olaf, 2009. "Structural estimation of jump-diffusion processes in macroeconomics," Journal of Econometrics, Elsevier, vol. 153(2), pages 196-210, December.
    15. Koutmos, Dimitrios, 2012. "An intertemporal capital asset pricing model with heterogeneous expectations," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 22(5), pages 1176-1187.
    16. Gregory, Allan W. & McCurdy, Thomas H., 1986. "The unbiasedness hypothesis in the forward foreign exchange market: A specification analysis with application to France, Italy, Japan, the United Kingdom and West Germany," European Economic Review, Elsevier, vol. 30(2), pages 365-381, April.
    17. Lanot, Gauthier & Walker, Ian, 1998. "The union/non-union wage differential: An application of semi-parametric methods," Journal of Econometrics, Elsevier, vol. 84(2), pages 327-349, June.
    18. Magnus, Jan R., 2007. "The Asymptotic Variance Of The Pseudo Maximum Likelihood Estimator," Econometric Theory, Cambridge University Press, vol. 23(5), pages 1022-1032, October.
    19. William Magee, 2023. "Earnings, Intersectional Earnings Inequality, Disappointment in One’s Life Achievements and Life (Dis)satisfaction," Journal of Happiness Studies, Springer, vol. 24(1), pages 373-396, January.
    20. Özlem Onaran & Engelbert Stockhammer, 2006. "The effect of FDI and foreign trade on wages in the Central and Eastern European Countries in the post-transition era: A sectoral analysis," Department of Economics Working Papers wuwp094, Vienna University of Economics and Business, Department of Economics.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:100:y:2009:i:3:p:459-472. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.