IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v64y2008i3p685-694.html
   My bibliography  Save this article

Exploiting Gene‐Environment Independence for Analysis of Case–Control Studies: An Empirical Bayes‐Type Shrinkage Estimator to Trade‐Off between Bias and Efficiency

Author

Listed:
  • Bhramar Mukherjee
  • Nilanjan Chatterjee

Abstract

Summary Standard prospective logistic regression analysis of case–control data often leads to very imprecise estimates of gene‐environment interactions due to small numbers of cases or controls in cells of crossing genotype and exposure. In contrast, under the assumption of gene‐environment independence, modern “retrospective” methods, including the “case‐only” approach, can estimate the interaction parameters much more precisely, but they can be seriously biased when the underlying assumption of gene‐environment independence is violated. In this article, we propose a novel empirical Bayes‐type shrinkage estimator to analyze case–control data that can relax the gene‐environment independence assumption in a data‐adaptive fashion. In the special case, involving a binary gene and a binary exposure, the method leads to an estimator of the interaction log odds ratio parameter in a simple closed form that corresponds to an weighted average of the standard case‐only and case–control estimators. We also describe a general approach for deriving the new shrinkage estimator and its variance within the retrospective maximum‐likelihood framework developed by Chatterjee and Carroll (2005, Biometrika92, 399–418). Both simulated and real data examples suggest that the proposed estimator strikes a balance between bias and efficiency depending on the true nature of the gene‐environment association and the sample size for a given study.

Suggested Citation

  • Bhramar Mukherjee & Nilanjan Chatterjee, 2008. "Exploiting Gene‐Environment Independence for Analysis of Case–Control Studies: An Empirical Bayes‐Type Shrinkage Estimator to Trade‐Off between Bias and Efficiency," Biometrics, The International Biometric Society, vol. 64(3), pages 685-694, September.
  • Handle: RePEc:bla:biomet:v:64:y:2008:i:3:p:685-694
    DOI: 10.1111/j.1541-0420.2007.00953.x
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/j.1541-0420.2007.00953.x
    Download Restriction: no

    File URL: https://libkey.io/10.1111/j.1541-0420.2007.00953.x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Nilanjan Chatterjee & Raymond J. Carroll, 2005. "Semiparametric maximum likelihood estimation exploiting gene-environment independence in case-control studies," Biometrika, Biometrika Trust, vol. 92(2), pages 399-418, June.
    2. Lin, D.Y. & Zeng, D., 2006. "Likelihood-Based Inference on Haplotype Effects in Genetic Association Studies," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 89-104, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jinbo Chen & Dongyu Lin & Hagit Hochner, 2012. "Semiparametric Maximum Likelihood Methods for Analyzing Genetic and Environmental Effects with Case-Control Mother–Child Pair Data," Biometrics, The International Biometric Society, vol. 68(3), pages 869-877, September.
    2. Tianying Wang & Alex Asher, 2021. "Improved Semiparametric Analysis of Polygenic Gene–Environment Interactions in Case–Control Studies," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 13(3), pages 386-401, December.
    3. Bhramar Mukherjee & Jaeil Ahn & Stephen B. Gruber & Malay Ghosh & Nilanjan Chatterjee, 2010. "Case–Control Studies of Gene–Environment Interaction: Bayesian Design and Analysis," Biometrics, The International Biometric Society, vol. 66(3), pages 934-948, September.
    4. Brisa N. Sánchez & Shan Kang & Bhramar Mukherjee, 2012. "A Latent Variable Approach to Study Gene–Environment Interactions in the Presence of Multiple Correlated Exposures," Biometrics, The International Biometric Society, vol. 68(2), pages 466-476, June.
    5. Yuan Zhang & Shili Lin & Swati Biswas, 2017. "Detecting rare and common haplotype–environment interaction under uncertainty of gene–environment independence assumption," Biometrics, The International Biometric Society, vol. 73(1), pages 344-355, March.
    6. Jason P. Estes & Bhramar Mukherjee & Jeremy M. G. Taylor, 2018. "Empirical Bayes Estimation and Prediction Using Summary-Level Information From External Big Data Sources Adjusting for Violations of Transportability," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 10(3), pages 568-586, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jinbo Chen & Dongyu Lin & Hagit Hochner, 2012. "Semiparametric Maximum Likelihood Methods for Analyzing Genetic and Environmental Effects with Case-Control Mother–Child Pair Data," Biometrics, The International Biometric Society, vol. 68(3), pages 869-877, September.
    2. Yulia V. Marchenko & Raymond K. Carroll & Danyu Y. Lin & Christopher I. Amos & Roberto G. Gutierrez, 2008. "Semiparametric analysis of case–control genetic data in the presence of environmental factors," Stata Journal, StataCorp LP, vol. 8(3), pages 305-333, September.
    3. Hua Yun Chen & Daniel E. Rader & Mingyao Li, 2015. "Likelihood Inferences on Semiparametric Odds Ratio Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(511), pages 1125-1135, September.
    4. Jinbo Chen & Carmen Rodriguez, 2007. "Conditional Likelihood Methods for Haplotype-Based Association Analysis Using Matched Case–Control Data," Biometrics, The International Biometric Society, vol. 63(4), pages 1099-1107, December.
    5. Brisa N. Sánchez & Shan Kang & Bhramar Mukherjee, 2012. "A Latent Variable Approach to Study Gene–Environment Interactions in the Presence of Multiple Correlated Exposures," Biometrics, The International Biometric Society, vol. 68(2), pages 466-476, June.
    6. Iryna Lobach & Raymond J. Carroll & Christine Spinka & Mitchell H. Gail & Nilanjan Chatterjee, 2008. "Haplotype‐Based Regression Analysis and Inference of Case–Control Studies with Unphased Genotypes and Measurement Errors in Environmental Exposures," Biometrics, The International Biometric Society, vol. 64(3), pages 673-684, September.
    7. Bhramar Mukherjee & Jaeil Ahn & Stephen B. Gruber & Malay Ghosh & Nilanjan Chatterjee, 2010. "Case–Control Studies of Gene–Environment Interaction: Bayesian Design and Analysis," Biometrics, The International Biometric Society, vol. 66(3), pages 934-948, September.
    8. Summer S. Han & Philip S. Rosenberg & Nilanjan Chatterjee, 2012. "Testing for Gene--Environment and Gene--Gene Interactions Under Monotonicity Constraints," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(500), pages 1441-1452, December.
    9. Wu Song & Yang Jie & Wu Rongling, 2010. "Mapping Quantitative Trait Loci in a Non-Equilibrium Population," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 9(1), pages 1-21, August.
    10. French Benjamin & Lumley Thomas & Cappola Thomas P. & Mitra Nandita, 2012. "Non-Iterative, Regression-Based Estimation of Haplotype Associations with Censored Survival Outcomes," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(3), pages 1-24, February.
    11. Colin O. Wu & Gang Zheng & Minjung Kwak, 2013. "A Joint Regression Analysis for Genetic Association Studies with Outcome Stratified Samples," Biometrics, The International Biometric Society, vol. 69(2), pages 417-426, June.
    12. Bhramar Mukherjee & Li Zhang & Malay Ghosh & Samiran Sinha, 2007. "Semiparametric Bayesian Analysis of Case–Control Data under Conditional Gene-Environment Independence," Biometrics, The International Biometric Society, vol. 63(3), pages 834-844, September.
    13. Tina Tsz-Ting Chui & Wen-Chung Lee, 2014. "Estimating Risks and Relative Risks in Case-Base Studies under the Assumptions of Gene-Environment Independence and Hardy-Weinberg Equilibrium," PLOS ONE, Public Library of Science, vol. 9(8), pages 1-5, August.
    14. Eric J. Tchetgen Tchetgen & James Robins, 2010. "The Semiparametric Case-Only Estimator," Biometrics, The International Biometric Society, vol. 66(4), pages 1138-1144, December.
    15. Summer S. Han & Philip S. Rosenberg & Arpita Ghosh & Maria Teresa Landi & Neil E. Caporaso & Nilanjan Chatterjee, 2015. "An exposure‐weighted score test for genetic associations integrating environmental risk factors," Biometrics, The International Biometric Society, vol. 71(3), pages 596-605, September.
    16. Sinha Samiran & Gruber Stephen B & Mukherjee Bhramar & Rennert Gad, 2008. "Inference of the Haplotype Effect in a Matched Case-Control Study Using Unphased Genotype Data," The International Journal of Biostatistics, De Gruyter, vol. 4(1), pages 1-26, May.
    17. X. Li & B. N. Thomas & S. M. Rich & D. Ecker & J. K. Tumwine & A. S. Foulkes, 2009. "Estimating and testing haplotype–trait associations in non‐diploid populations," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 58(5), pages 663-678, December.
    18. Gustafson Paul, 2010. "Bayesian Inference for Partially Identified Models," The International Journal of Biostatistics, De Gruyter, vol. 6(2), pages 1-20, March.
    19. Liang, Liang & Ma, Yanyuan & Carroll, Raymond J., 2019. "A semiparametric efficient estimator in case-control studies for gene–environment independent models," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 38-50.
    20. Tianying Wang & Alex Asher, 2021. "Improved Semiparametric Analysis of Polygenic Gene–Environment Interactions in Case–Control Studies," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 13(3), pages 386-401, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:64:y:2008:i:3:p:685-694. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.