IDEAS home Printed from https://ideas.repec.org/a/bpj/sagmbi/v16y2017i3p173-187n1.html
   My bibliography  Save this article

Mixture model-based association analysis with case-control data in genome wide association studies

Author

Listed:
  • Ali Fadhaa

    (Department of Statistics, College of Administration and Economics, University of Baghdad, Baghdad, Iraq)

  • Zhang Jian

    (School of Mathematics, Statistics and Actuarial Science, University of Kent, Canterbury, Kent CT2 7NF, UK)

Abstract

Multilocus haplotype analysis of candidate variants with genome wide association studies (GWAS) data may provide evidence of association with disease, even when the individual loci themselves do not. Unfortunately, when a large number of candidate variants are investigated, identifying risk haplotypes can be very difficult. To meet the challenge, a number of approaches have been put forward in recent years. However, most of them are not directly linked to the disease-penetrances of haplotypes and thus may not be efficient. To fill this gap, we propose a mixture model-based approach for detecting risk haplotypes. Under the mixture model, haplotypes are clustered directly according to their estimated disease penetrances. A theoretical justification of the above model is provided. Furthermore, we introduce a hypothesis test for haplotype inheritance patterns which underpin this model. The performance of the proposed approach is evaluated by simulations and real data analysis. The results show that the proposed approach outperforms an existing multiple testing method.

Suggested Citation

  • Ali Fadhaa & Zhang Jian, 2017. "Mixture model-based association analysis with case-control data in genome wide association studies," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 16(3), pages 173-187, August.
  • Handle: RePEc:bpj:sagmbi:v:16:y:2017:i:3:p:173-187:n:1
    DOI: 10.1515/sagmb-2016-0022
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/sagmb-2016-0022
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.1515/sagmb-2016-0022?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Alan Agresti, 1999. "On Logit Confidence Intervals for the Odds Ratio with Small Samples," Biometrics, The International Biometric Society, vol. 55(2), pages 597-602, June.
    2. Karlis, Dimitris & Xekalaki, Evdokia, 2003. "Choosing initial values for the EM algorithm for finite mixtures," Computational Statistics & Data Analysis, Elsevier, vol. 41(3-4), pages 577-590, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Adrian O’Hagan & Arthur White, 2019. "Improved model-based clustering performance using Bayesian initialization averaging," Computational Statistics, Springer, vol. 34(1), pages 201-231, March.
    2. Arielle Marks‐Anglin & Chongliang Luo & Jin Piao & Mary Beth Connolly Gibbons & Christopher H. Schmid & Jing Ning & Yong Chen, 2022. "EMBRACE: An EM‐based bias reduction approach through Copas‐model estimation for quantifying the evidence of selective publishing in network meta‐analysis," Biometrics, The International Biometric Society, vol. 78(2), pages 754-765, June.
    3. Weber, Frank & Knapp, Guido & Ickstadt, Katja & Kundt, Günther & Glass, Anne, 2020. "Zero-cell corrections in random-effects meta-analyses," OSF Preprints qjh5f, Center for Open Science.
    4. Pietro Coretto & Christian Hennig, 2010. "A simulation study to compare robust clustering methods based on mixtures," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 4(2), pages 111-135, September.
    5. Hung Tong & Cristina Tortora, 2022. "Model-based clustering and outlier detection with missing data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 16(1), pages 5-30, March.
    6. Salvatore Ingrassia & Antonio Punzo & Giorgio Vittadini & Simona Minotti, 2015. "Erratum to: The Generalized Linear Mixed Cluster-Weighted Model," Journal of Classification, Springer;The Classification Society, vol. 32(2), pages 327-355, July.
    7. Paolo Berta & Salvatore Ingrassia & Antonio Punzo & Giorgio Vittadini, 2016. "Multilevel cluster-weighted models for the evaluation of hospitals," METRON, Springer;Sapienza Università di Roma, vol. 74(3), pages 275-292, December.
    8. Nicolas Depraetere & Martina Vandebroek, 2014. "Order selection in finite mixtures of linear regressions," Statistical Papers, Springer, vol. 55(3), pages 871-911, August.
    9. Agresti, Alan & Gottard, Anna, 2007. "Nonconservative exact small-sample inference for discrete data," Computational Statistics & Data Analysis, Elsevier, vol. 51(12), pages 6447-6458, August.
    10. repec:jss:jstsof:28:i04 is not listed on IDEAS
    11. Saif Eddin Jabari & Nikolaos M. Freris & Deepthi Mary Dilip, 2020. "Sparse Travel Time Estimation from Streaming Data," Transportation Science, INFORMS, vol. 54(1), pages 1-20, January.
    12. Mahdi Teimouri & Saralees Nadarajah, 2022. "Maximum Likelihood Estimation for the Asymmetric Exponential Power Distribution," Computational Economics, Springer;Society for Computational Economics, vol. 60(2), pages 665-692, August.
    13. Papastamoulis, Panagiotis & Martin-Magniette, Marie-Laure & Maugis-Rabusseau, Cathy, 2016. "On the estimation of mixtures of Poisson regression models with large number of components," Computational Statistics & Data Analysis, Elsevier, vol. 93(C), pages 97-106.
    14. Derek S. Young & Xi Chen & Dilrukshi C. Hewage & Ricardo Nilo-Poyanco, 2019. "Finite mixture-of-gamma distributions: estimation, inference, and model-based clustering," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(4), pages 1053-1082, December.
    15. Kerekes, Monika, 2012. "Growth miracles and failures in a Markov switching classification model of growth," Journal of Development Economics, Elsevier, vol. 98(2), pages 167-177.
    16. Salvatore Ingrassia & Antonio Punzo, 2020. "Cluster Validation for Mixtures of Regressions via the Total Sum of Squares Decomposition," Journal of Classification, Springer;The Classification Society, vol. 37(2), pages 526-547, July.
    17. Masahiro Kuroda & Zhi Geng & Michio Sakakihara, 2015. "Improving the vector $$\varepsilon $$ ε acceleration for the EM algorithm using a re-starting procedure," Computational Statistics, Springer, vol. 30(4), pages 1051-1077, December.
    18. Nicola Loperfido, 2019. "Finite mixtures, projection pursuit and tensor rank: a triangulation," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(1), pages 145-173, March.
    19. Morten W. Fagerland, 2012. "Exact and mid-p confidence intervals for the odds ratio," Stata Journal, StataCorp LP, vol. 12(3), pages 505-524, September.
    20. Kerekes, Monika, 2009. "Growth miracles and failures in a Markov switching classification model of growth," Discussion Papers 2009/11, Free University Berlin, School of Business & Economics.
    21. Andrews, Jeffrey L., 2018. "Addressing overfitting and underfitting in Gaussian model-based clustering," Computational Statistics & Data Analysis, Elsevier, vol. 127(C), pages 160-171.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:sagmbi:v:16:y:2017:i:3:p:173-187:n:1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.