IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v8y2020i9p1514-d409073.html
   My bibliography  Save this article

Using Copula to Model Dependence When Testing Multiple Hypotheses in DNA Microarray Experiments: A Bayesian Approximation

Author

Listed:
  • Elisa C. J. Maria

    (Departamento de Matemática e Estatística, Faculdade de Ciências Naturais, Matemática e Estatística, Universidade Rovuma, 3100 Nampula, Mozambique)

  • Isabel Salazar

    (Departamento de Producción Animal, Facultad de Veterinaria, Universidad Complutense de Madrid, 28040 Madrid, Spain)

  • Luis Sanz

    (Departamento de Estadística e IO, Facultad de Ciencias Matemáticas, Universidad Complutense de Madrid, 28040 Madrid, Spain)

  • Miguel A. Gómez-Villegas

    (Departamento de Estadística e IO, Facultad de Ciencias Matemáticas, Universidad Complutense de Madrid, 28040 Madrid, Spain)

Abstract

Many experiments require simultaneously testing many hypotheses. This is particularly relevant in the context of DNA microarray experiments, where it is common to analyze many genes to determine which of them are differentially expressed under two conditions. Another important problem in this context is how to model the dependence at the level of gene expression. In this paper, we propose a Bayesian procedure for simultaneously testing multiple hypotheses, modeling the dependence through copula functions, where all available information, both objective and subjective, can be used. The approach has the advantage that it can be used with different dependency structures. Simulated data analysis was performed to examine the performance of the proposed approach. The results show that our procedure captures the dependence appropriately classifying adequately a high percentage of true and false null hypotheses when choosing a prior distribution beta skewed to the right for the initial probability of each null hypothesis, resulting in a very powerful procedure. The procedure is also illustrated with real data.

Suggested Citation

  • Elisa C. J. Maria & Isabel Salazar & Luis Sanz & Miguel A. Gómez-Villegas, 2020. "Using Copula to Model Dependence When Testing Multiple Hypotheses in DNA Microarray Experiments: A Bayesian Approximation," Mathematics, MDPI, vol. 8(9), pages 1-22, September.
  • Handle: RePEc:gam:jmathe:v:8:y:2020:i:9:p:1514-:d:409073
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/8/9/1514/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/8/9/1514/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Ming Yuan & Christina Kendziorski, 2006. "A Unified Approach for Simultaneous Gene Clustering and Differential Expression Identification," Biometrics, The International Biometric Society, vol. 62(4), pages 1089-1098, December.
    2. Kim‐Anh Do & Peter Müller & Feng Tang, 2005. "A Bayesian mixture model for differential gene expression," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 54(3), pages 627-644, June.
    3. Raphael Gottardo & Adrian E. Raftery & Ka Yee Yeung & Roger E. Bumgarner, 2006. "Bayesian Robust Inference for Differential Gene Expression in Microarrays with Multiple Samples," Biometrics, The International Biometric Society, vol. 62(1), pages 10-18, March.
    4. Ibrahim J. G. & Chen M-H. & Gray R. J., 2002. "Bayesian Models for Gene Expression With DNA Microarray Data," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 88-99, March.
    5. Raphael Gottardo & Adrian E. Raftery & Ka Yee Yeung & Roger E. Bumgarner, 2006. "Bayesian Robust Inference for Differential Gene Expression in Microarrays with Multiple Samples," Biometrics, The International Biometric Society, vol. 62(1), pages 10-18, March.
    6. Wenguang Sun & T. Tony Cai, 2009. "Large‐scale multiple testing under dependence," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(2), pages 393-424, April.
    7. Christopher Genovese & Larry Wasserman, 2002. "Operating characteristics and extensions of the false discovery rate procedure," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(3), pages 499-517, August.
    8. David J. Spiegelhalter & Nicola G. Best & Bradley P. Carlin & Angelika Van Der Linde, 2002. "Bayesian measures of model complexity and fit," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(4), pages 583-639, October.
    9. Thorsten Dickhaus & Jakob Gierl, 2012. "Simultaneous test procedures in terms of p-value copulae," SFB 649 Discussion Papers SFB649DP2012-049, Sonderforschungsbereich 649, Humboldt University, Berlin, Germany.
    10. Peter Muller & Giovanni Parmigiani & Christian Robert & Judith Rousseau, 2004. "Optimal Sample Size for Multiple Testing: The Case of Gene Expression Microarrays," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 990-1001, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ricardo López-Ruiz, 2022. "Mathematical Biology: Modeling, Analysis, and Simulations," Mathematics, MDPI, vol. 10(20), pages 1-2, October.
    2. Kexin Li & Jianxu Liu & Yuting Xue & Sanzidur Rahman & Songsak Sriboonchitta, 2022. "Consequences of Ignoring Dependent Error Components and Heterogeneity in a Stochastic Frontier Model: An Application to Rice Producers in Northern Thailand," Agriculture, MDPI, vol. 12(8), pages 1-17, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. E. M. Conlon & B. L. Postier & B. A. Methe & K. P. Nevin & D. R. Lovley, 2009. "Hierarchical Bayesian meta-analysis models for cross-platform microarray studies," Journal of Applied Statistics, Taylor & Francis Journals, vol. 36(10), pages 1067-1085.
    2. Hong, Zhaoping & Lian, Heng, 2012. "BOPA: A Bayesian hierarchical model for outlier expression detection," Computational Statistics & Data Analysis, Elsevier, vol. 56(12), pages 4146-4156.
    3. Rossell David & Guerra Rudy & Scott Clayton, 2008. "Semi-Parametric Differential Expression Analysis via Partial Mixture Estimation," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 7(1), pages 1-29, April.
    4. Wang, Xia & Shojaie, Ali & Zou, Jian, 2019. "Bayesian hidden Markov models for dependent large-scale multiple testing," Computational Statistics & Data Analysis, Elsevier, vol. 136(C), pages 123-136.
    5. Ghosh Debashis, 2012. "Incorporating the Empirical Null Hypothesis into the Benjamini-Hochberg Procedure," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(4), pages 1-21, July.
    6. Ruth Heller & Saharon Rosset, 2021. "Optimal control of false discovery criteria in the two‐group model," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(1), pages 133-155, February.
    7. Gómez-Villegas Miguel A. & Sanz Luis & Salazar Isabel, 2014. "A Bayesian decision procedure for testing multiple hypotheses in DNA microarray experiments," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 13(1), pages 49-65, February.
    8. Qingyun Cai & Hock Peng Chan, 2017. "A Double Application of the Benjamini-Hochberg Procedure for Testing Batched Hypotheses," Methodology and Computing in Applied Probability, Springer, vol. 19(2), pages 429-443, June.
    9. Noirrit Kiran Chandra & Sourabh Bhattacharya, 2021. "Asymptotic theory of dependent Bayesian multiple testing procedures under possible model misspecification," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 73(5), pages 891-920, October.
    10. Zhigen Zhao, 2022. "Where to find needles in a haystack?," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(1), pages 148-174, March.
    11. Izmirlian, Grant, 2020. "Strong consistency and asymptotic normality for quantities related to the Benjamini–Hochberg false discovery rate procedure," Statistics & Probability Letters, Elsevier, vol. 160(C).
    12. Cipolli III, William & Hanson, Timothy & McLain, Alexander C., 2016. "Bayesian nonparametric multiple testing," Computational Statistics & Data Analysis, Elsevier, vol. 101(C), pages 64-79.
    13. Joaquim Casellas & Luis Varona, 2012. "Modeling Skewness in Human Transcriptomes," PLOS ONE, Public Library of Science, vol. 7(6), pages 1-5, June.
    14. Pei Fen Kuan & Derek Y. Chiang, 2012. "Integrating Prior Knowledge in Multiple Testing under Dependence with Applications to Detecting Differential DNA Methylation," Biometrics, The International Biometric Society, vol. 68(3), pages 774-783, September.
    15. Wang, Jiangzhou & Cui, Tingting & Zhu, Wensheng & Wang, Pengfei, 2023. "Covariate-modulated large-scale multiple testing under dependence," Computational Statistics & Data Analysis, Elsevier, vol. 180(C).
    16. Michele Guindani & Wesley O. Johnson, 2018. "More nonparametric Bayesian inference in applications," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 27(2), pages 239-251, June.
    17. Hironori Fujisawa & Takayuki Sakaguchi, 2012. "Optimal significance analysis of microarray data in a class of tests whose null statistic can be constructed," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 21(2), pages 280-300, June.
    18. Hai Shu & Bin Nan & Robert Koeppe, 2015. "Multiple testing for neuroimaging via hidden Markov random field," Biometrics, The International Biometric Society, vol. 71(3), pages 741-750, September.
    19. Shotwell Matthew S & Slate Elizabeth H, 2010. "Bayesian Modeling of Footrace Finishing Times," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 6(3), pages 1-21, July.
    20. David I. Ohlssen & Linda D. Sharples & David J. Spiegelhalter, 2007. "A hierarchical modelling framework for identifying unusual performance in health care providers," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 170(4), pages 865-890, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:8:y:2020:i:9:p:1514-:d:409073. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.