IDEAS home Printed from https://ideas.repec.org/a/bpj/sagmbi/v11y2012i4n12.html
   My bibliography  Save this article

How to analyze many contingency tables simultaneously in genetic association studies

Author

Listed:
  • Dickhaus Thorsten

    (Humboldt-University, Berlin)

  • Straßburger Klaus

    (German Diabetes Center, Düsseldorf)

  • Schunk Daniel

    (Johannes Gutenberg-Universität Mainz and University of Zurich)

  • Morcillo-Suarez Carlos

    (Universitat Pompeu Fabra, Barcelona)

  • Illig Thomas

    (Helmholtz Zentrum München)

  • Navarro Arcadi

    (ICREA and Universitat Pompeu Fabra, Barcelona)

Abstract

We study exact tests for (2 x 2) and (2 x 3) contingency tables, in particular exact chi-squared tests and exact tests of Fisher type. In practice, these tests are typically carried out without randomization, leading to reproducible results but not exhausting the significance level. We discuss that this can lead to methodological and practical issues in a multiple testing framework when many tables are simultaneously under consideration as in genetic association studies.Realized randomized p-values are proposed as a solution which is especially useful for data-adaptive (plug-in) procedures. These p-values allow to estimate the proportion of true null hypotheses much more accurately than their non-randomized counterparts. Moreover, we address the problem of positively correlated p-values for association by considering techniques to reduce multiplicity by estimating the "effective number of tests" from the correlation structure.An algorithm is provided that bundles all these aspects, efficient computer implementations are made available, a small-scale simulation study is presented and two real data examples are shown.

Suggested Citation

  • Dickhaus Thorsten & Straßburger Klaus & Schunk Daniel & Morcillo-Suarez Carlos & Illig Thomas & Navarro Arcadi, 2012. "How to analyze many contingency tables simultaneously in genetic association studies," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(4), pages 1-33, July.
  • Handle: RePEc:bpj:sagmbi:v:11:y:2012:i:4:n:12
    DOI: 10.1515/1544-6115.1776
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/1544-6115.1776
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.1515/1544-6115.1776?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Laurent Barras & Olivier Scaillet & Russ Wermers, 2010. "False Discoveries in Mutual Fund Performance: Measuring Luck in Estimated Alphas," Journal of Finance, American Finance Association, vol. 65(1), pages 179-216, February.
    2. Helmut Finner & Veronika Gontscharuk, 2009. "Controlling the familywise error rate with plug‐in estimator for the proportion of true null hypotheses," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(5), pages 1031-1048, November.
    3. John D. Storey & Jonathan E. Taylor & David Siegmund, 2004. "Strong control, conservative point estimation and simultaneous conservative consistency of false discovery rates: a unified approach," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 66(1), pages 187-205, February.
    4. Mette Langaas & Bo Henry Lindqvist & Egil Ferkingstad, 2005. "Estimating the proportion of true null hypotheses, with application to DNA microarray data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(4), pages 555-572, September.
    5. Bryan N Howie & Peter Donnelly & Jonathan Marchini, 2009. "A Flexible and Accurate Genotype Imputation Method for the Next Generation of Genome-Wide Association Studies," PLOS Genetics, Public Library of Science, vol. 5(6), pages 1-15, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Marina Bogomolov & Ruth Heller, 2018. "Assessing replicability of findings across two studies of multiple features," Biometrika, Biometrika Trust, vol. 105(3), pages 505-516.
    2. Stange, Jens & Dickhaus, Thorsten & Navarro, Arcadi & Schunk, Daniel, 2016. "Multiplicity- and dependency-adjusted p-values for control of the family-wise error rate," Statistics & Probability Letters, Elsevier, vol. 111(C), pages 32-40.
    3. Dickhaus Thorsten, 2015. "Simultaneous Bayesian analysis of contingency tables in genetic association studies," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 14(4), pages 347-360, August.
    4. Marta Cousido‐Rocha & Jacobo de Uña‐Álvarez & Sebastian Döhler, 2022. "Multiple comparison procedures for discrete uniform and homogeneous tests," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(1), pages 219-243, January.
    5. Anh-Tuan Hoang & Thorsten Dickhaus, 2022. "On the usage of randomized p-values in the Schweder–Spjøtvoll estimator," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(2), pages 289-319, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bryan D. MacGregor & Rainer Schulz & Yuan Zhao, 2021. "Performance and Market Maturity in Mutual Funds: Is Real Estate Different?," The Journal of Real Estate Finance and Economics, Springer, vol. 63(3), pages 437-492, October.
    2. Bajgrowicz, Pierre & Scaillet, Olivier, 2012. "Technical trading revisited: False discoveries, persistence tests, and transaction costs," Journal of Financial Economics, Elsevier, vol. 106(3), pages 473-491.
    3. Shigeyuki Matsui & Hisashi Noma, 2011. "Estimating Effect Sizes of Differentially Expressed Genes for Power and Sample-Size Assessments in Microarray Experiments," Biometrics, The International Biometric Society, vol. 67(4), pages 1225-1235, December.
    4. Axel Gandy & Georg Hahn, 2016. "A Framework for Monte Carlo based Multiple Testing," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 43(4), pages 1046-1063, December.
    5. Moon, H.R. & Perron, B., 2012. "Beyond panel unit root tests: Using multiple testing to determine the nonstationarity properties of individual series in a panel," Journal of Econometrics, Elsevier, vol. 169(1), pages 29-33.
    6. Sermpinis, Georgios & Hassanniakalager, Arman & Stasinakis, Charalampos & Psaradellis, Ioannis, 2021. "Technical analysis profitability and Persistence: A discrete false discovery approach on MSCI indices," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 73(C).
    7. Psaradellis, Ioannis & Laws, Jason & Pantelous, Athanasios A. & Sermpinis, Georgios, 2023. "Technical analysis, spread trading, and data snooping control," International Journal of Forecasting, Elsevier, vol. 39(1), pages 178-191.
    8. Ferreira José A. & Berkhof Johannes & Souverein Olga & Zwinderman Koos, 2009. "A Multiple Testing Approach to High-Dimensional Association Studies with an Application to the Detection of Associations between Risk Factors of Heart Disease and Genetic Polymorphisms," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-56, January.
    9. Huang, Rong & Pilbeam, Keith & Pouliot, William, 2021. "Do actively managed US mutual funds produce positive alpha?," Journal of Economic Behavior & Organization, Elsevier, vol. 182(C), pages 472-492.
    10. Bredin, Don & Cuthbertson, Keith & Nitzsche, Dirk & Thomas, Dylan C., 2014. "Performance and performance persistence of UK closed-end equity funds," International Review of Financial Analysis, Elsevier, vol. 34(C), pages 189-199.
    11. Chen, Xiongzhi, 2019. "Uniformly consistently estimating the proportion of false null hypotheses via Lebesgue–Stieltjes integral equations," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 724-744.
    12. Georgios Sermpinis & Arman Hassanniakalager & Charalampos Stasinakis & Ioannis Psaradellis, 2018. "Technical Analysis and Discrete False Discovery Rate: Evidence from MSCI Indices," Papers 1811.06766, arXiv.org, revised Jun 2019.
    13. Zehetmayer Sonja & Graf Alexandra C. & Posch Martin, 2015. "Sample size reassessment for a two-stage design controlling the false discovery rate," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 14(5), pages 429-442, November.
    14. Nik Tuzov & Frederi Viens, 2011. "Mutual fund performance: false discoveries, bias, and power," Annals of Finance, Springer, vol. 7(2), pages 137-169, May.
    15. Habiger, Joshua D. & Adekpedjou, Akim, 2014. "Optimal rejection curves for exact false discovery rate control," Statistics & Probability Letters, Elsevier, vol. 94(C), pages 21-28.
    16. Xiaoye Jin, 2022. "Evaluating the predictive power of intraday technical trading in China's crude oil market," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(7), pages 1416-1432, November.
    17. Zarrabi, Nima & Snaith, Stuart & Coakley, Jerry, 2017. "FX technical trading rules can be profitable sometimes!," International Review of Financial Analysis, Elsevier, vol. 49(C), pages 113-127.
    18. Andrew Y. Chen, 2022. "Do t-Statistic Hurdles Need to be Raised?," Papers 2204.10275, arXiv.org, revised Apr 2024.
    19. Cuthbertson, Keith & Nitzsche, Dirk, 2013. "Performance, stock selection and market timing of the German equity mutual fund industry," Journal of Empirical Finance, Elsevier, vol. 21(C), pages 86-101.
    20. Tim Bancroft & Chuanlong Du & Dan Nettleton, 2013. "Estimation of False Discovery Rate Using Sequential Permutation p-Values," Biometrics, The International Biometric Society, vol. 69(1), pages 1-7, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:sagmbi:v:11:y:2012:i:4:n:12. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.