IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v64y2008i3p673-684.html
   My bibliography  Save this article

Haplotype‐Based Regression Analysis and Inference of Case–Control Studies with Unphased Genotypes and Measurement Errors in Environmental Exposures

Author

Listed:
  • Iryna Lobach
  • Raymond J. Carroll
  • Christine Spinka
  • Mitchell H. Gail
  • Nilanjan Chatterjee

Abstract

Summary It is widely believed that risks of many complex diseases are determined by genetic susceptibilities, environmental exposures, and their interaction. Chatterjee and Carroll (2005, Biometrika92, 399–418) developed an efficient retrospective maximum‐likelihood method for analysis of case–control studies that exploits an assumption of gene–environment independence and leaves the distribution of the environmental covariates to be completely nonparametric. Spinka, Carroll, and Chatterjee (2005, Genetic Epidemiology29, 108–127) extended this approach to studies where certain types of genetic information, such as haplotype phases, may be missing on some subjects. We further extend this approach to situations when some of the environmental exposures are measured with error. Using a polychotomous logistic regression model, we allow disease status to have K+ 1 levels. We propose use of a pseudolikelihood and a related EM algorithm for parameter estimation. We prove consistency and derive the resulting asymptotic covariance matrix of parameter estimates when the variance of the measurement error is known and when it is estimated using replications. Inferences with measurement error corrections are complicated by the fact that the Wald test often behaves poorly in the presence of large amounts of measurement error. The likelihood‐ratio (LR) techniques are known to be a good alternative. However, the LR tests are not technically correct in this setting because the likelihood function is based on an incorrect model, i.e., a prospective model in a retrospective sampling scheme. We corrected standard asymptotic results to account for the fact that the LR test is based on a likelihood‐type function. The performance of the proposed method is illustrated using simulation studies emphasizing the case when genetic information is in the form of haplotypes and missing data arises from haplotype‐phase ambiguity. An application of our method is illustrated using a population‐based case–control study of the association between calcium intake and the risk of colorectal adenoma.

Suggested Citation

  • Iryna Lobach & Raymond J. Carroll & Christine Spinka & Mitchell H. Gail & Nilanjan Chatterjee, 2008. "Haplotype‐Based Regression Analysis and Inference of Case–Control Studies with Unphased Genotypes and Measurement Errors in Environmental Exposures," Biometrics, The International Biometric Society, vol. 64(3), pages 673-684, September.
  • Handle: RePEc:bla:biomet:v:64:y:2008:i:3:p:673-684
    DOI: 10.1111/j.1541-0420.2007.00930.x
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/j.1541-0420.2007.00930.x
    Download Restriction: no

    File URL: https://libkey.io/10.1111/j.1541-0420.2007.00930.x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Daowen Zhang & Marie Davidian, 2001. "Linear Mixed Models with Flexible Distributions of Random Effects for Longitudinal Data," Biometrics, The International Biometric Society, vol. 57(3), pages 795-802, September.
    2. Lin, D.Y. & Zeng, D., 2006. "Likelihood-Based Inference on Haplotype Effects in Genetic Association Studies," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 89-104, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yanyuan Ma & Raymond J. Carroll, 2016. "Semiparametric estimation in the secondary analysis of case–control studies," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(1), pages 127-151, January.
    2. Jun Zhang & Zhenghui Feng & Peirong Xu & Hua Liang, 2017. "Generalized varying coefficient partially linear measurement errors models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 69(1), pages 97-120, February.
    3. Tianying Wang & Alex Asher, 2021. "Improved Semiparametric Analysis of Polygenic Gene–Environment Interactions in Case–Control Studies," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 13(3), pages 386-401, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Fabienne Comte & Adeline Samson, 2012. "Nonparametric estimation of random-effects densities in linear mixed-effects model," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 24(4), pages 951-975, December.
    2. Jinbo Chen & Dongyu Lin & Hagit Hochner, 2012. "Semiparametric Maximum Likelihood Methods for Analyzing Genetic and Environmental Effects with Case-Control Mother–Child Pair Data," Biometrics, The International Biometric Society, vol. 68(3), pages 869-877, September.
    3. Peng Zhang & Peter X.-K. Song & Annie Qu & Tom Greene, 2008. "Efficient Estimation for Patient-Specific Rates of Disease Progression Using Nonnormal Linear Mixed Models," Biometrics, The International Biometric Society, vol. 64(1), pages 29-38, March.
    4. M. Teimourian & T. Baghfalaki & M. Ganjali & D. Berridge, 2015. "Joint modeling of mixed skewed continuous and ordinal longitudinal responses: a Bayesian approach," Journal of Applied Statistics, Taylor & Francis Journals, vol. 42(10), pages 2233-2256, October.
    5. Jinbo Chen & Carmen Rodriguez, 2007. "Conditional Likelihood Methods for Haplotype-Based Association Analysis Using Matched Case–Control Data," Biometrics, The International Biometric Society, vol. 63(4), pages 1099-1107, December.
    6. Ye, Rendao & Wang, Tonghui & Gupta, Arjun K., 2014. "Distribution of matrix quadratic forms under skew-normal settings," Journal of Multivariate Analysis, Elsevier, vol. 131(C), pages 229-239.
    7. Lourdes Montenegro & Víctor Lachos & Heleno Bolfarine, 2010. "Inference for a skew extension of the Grubbs model," Statistical Papers, Springer, vol. 51(3), pages 701-715, September.
    8. Manuel Arellano & Stéphane Bonhomme, 2012. "Identifying Distributional Characteristics in Random Coefficients Panel Data Models," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 79(3), pages 987-1020.
    9. Francis K. C. Hui & Samuel Müller & Alan H. Welsh, 2021. "Random Effects Misspecification Can Have Severe Consequences for Random Effects Inference in Linear Mixed Models," International Statistical Review, International Statistical Institute, vol. 89(1), pages 186-206, April.
    10. Warrington Nicole M. & Tilling Kate & Howe Laura D. & Paternoster Lavinia & Pennell Craig E. & Wu Yan Yan & Briollais Laurent, 2014. "Robustness of the linear mixed effects model to error distribution assumptions and the consequences for genome-wide association studies," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 13(5), pages 1-21, October.
    11. Roger Tovar-Falón & Guillermo Martínez-Flórez & Heleno Bolfarine, 2022. "Modelling Asymmetric Data by Using the Log-Gamma-Normal Regression Model," Mathematics, MDPI, vol. 10(7), pages 1-16, April.
    12. Daniel McNeish & Jeffrey R. Harring & Denis Dumas, 2023. "A multilevel structured latent curve model for disaggregating student and school contributions to learning," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 32(2), pages 545-575, June.
    13. Jacqmin-Gadda, Helene & Sibillot, Solenne & Proust, Cecile & Molina, Jean-Michel & Thiebaut, Rodolphe, 2007. "Robustness of the linear mixed model to misspecified error distribution," Computational Statistics & Data Analysis, Elsevier, vol. 51(10), pages 5142-5154, June.
    14. Staudenmayer, John & Ruppert, David & Buonaccorsi, John P., 2008. "Density Estimation in the Presence of Heteroscedastic Measurement Error," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 726-736, June.
    15. Li, Erning & Pourahmadi, Mohsen, 2013. "An alternative REML estimation of covariance matrices in linear mixed models," Statistics & Probability Letters, Elsevier, vol. 83(4), pages 1071-1077.
    16. Brent A Coull, 2011. "A Random Intercepts–Functional Slopes Model for Flexible Assessment of Susceptibility in Longitudinal Designs," Biometrics, The International Biometric Society, vol. 67(2), pages 486-494, June.
    17. Wu Song & Yang Jie & Wu Rongling, 2010. "Mapping Quantitative Trait Loci in a Non-Equilibrium Population," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 9(1), pages 1-21, August.
    18. F. Kahrari & C. S. Ferreira & R. B. Arellano-Valle, 2019. "Skew-Normal-Cauchy Linear Mixed Models," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 81(2), pages 185-202, December.
    19. Kheradmandi, Ameneh & Rasekh, Abdolrahman, 2015. "Estimation in skew-normal linear mixed measurement error models," Journal of Multivariate Analysis, Elsevier, vol. 136(C), pages 1-11.
    20. French Benjamin & Lumley Thomas & Cappola Thomas P. & Mitra Nandita, 2012. "Non-Iterative, Regression-Based Estimation of Haplotype Associations with Censored Survival Outcomes," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(3), pages 1-24, February.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:64:y:2008:i:3:p:673-684. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.