IDEAS home Printed from https://ideas.repec.org/a/spr/stmapp/v26y2017i2d10.1007_s10260-016-0369-4.html
   My bibliography  Save this article

High dimensional extension of the growth curve model and its application in genetics

Author

Listed:
  • Sayantee Jana

    (McMaster University)

  • Narayanaswamy Balakrishnan

    (McMaster University)

  • Dietrich Rosen

    (Swedish Agricultural University
    Linköping University)

  • Jemila Seid Hamid

    (McMaster University
    St. Michael’s Hospital
    McMaster University)

Abstract

Recent advances in technology have allowed researchers to collect large scale complex biological data, simultaneously, often in matrix format. In genomic studies, for instance, measurements from tens to hundreds of thousands of genes are taken from individuals across several experimental groups. In time course microarray experiments, gene expression is measured at several time points for each individual across the whole genome resulting in a high-dimensional matrix for each gene. In such experiments, researchers are faced with high-dimensional longitudinal data. Unfortunately, traditional methods for longitudinal data are not appropriate for high-dimensional situations. In this paper, we use the growth curve model and introduce test useful for high-dimensional longitudinal data and evaluate its performance using simulations. We also show how our approach can be used to filter genes in time course genomic experiments. We illustrate this using publicly available genomic data, involving experiments comparing normal human lung tissue with vanadium pentoxide treated human lung tissue, designed with the aim of understanding the susceptibility of individuals working in petro-chemical factories to airway re-modelling. Using our method, we were able to filter out 1053 (about 5 %) genes as non-noise genes from a pool of 22,277. Although our focus is on hypothesis testing, we also provided modified maximum likelihood estimator for the mean parameter of the growth curve model and assessed its performance through bias and mean squared error.

Suggested Citation

  • Sayantee Jana & Narayanaswamy Balakrishnan & Dietrich Rosen & Jemila Seid Hamid, 2017. "High dimensional extension of the growth curve model and its application in genetics," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 26(2), pages 273-292, June.
  • Handle: RePEc:spr:stmapp:v:26:y:2017:i:2:d:10.1007_s10260-016-0369-4
    DOI: 10.1007/s10260-016-0369-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10260-016-0369-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10260-016-0369-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Yu Chuan Tai & Terence P. Speed, 2009. "On Gene Ranking Using Replicated Microarray Time Course Data," Biometrics, The International Biometric Society, vol. 65(1), pages 40-51, March.
    2. von Rosen, Dietrich, 1989. "Maximum likelihood estimators in multivariate linear normal models," Journal of Multivariate Analysis, Elsevier, vol. 31(2), pages 187-200, November.
    3. Smyth Gordon K, 2004. "Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 3(1), pages 1-28, February.
    4. Chen, Song Xi & Qin, Yingli, 2010. "A Two Sample Test for High Dimensional Data with Applications to Gene-set Testing," MPRA Paper 59642, University Library of Munich, Germany.
    5. Yuan, Ming & Kendziorski, Christina, 2006. "Hidden Markov Models for Microarray Time Course Data in Multiple Biological Conditions," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1323-1332, December.
    6. Hamid, Jemila S. & Beyene, Joseph & von Rosen, Dietrich, 2011. "A novel trace test for the mean parameters in a multivariate growth curve model," Journal of Multivariate Analysis, Elsevier, vol. 102(2), pages 238-251, February.
    7. Hamid Jemila S & Beyene Joseph, 2009. "A Multivariate Growth Curve Model for Ranking Genes in Replicated Time Course Microarray Data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-26, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jana, Sayantee & Balakrishnan, Narayanaswamy & Hamid, Jemila S., 2018. "Estimation of the parameters of the extended growth curve model under multivariate skew normal distribution," Journal of Multivariate Analysis, Elsevier, vol. 166(C), pages 111-128.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sayantee Jana & Narayanaswamy Balakrishnan & Jemila S. Hamid, 2020. "Inference in the Growth Curve Model under Multivariate Skew Normal Distribution," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 82(1), pages 34-69, May.
    2. Hamid Jemila S & Beyene Joseph, 2009. "A Multivariate Growth Curve Model for Ranking Genes in Replicated Time Course Microarray Data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-26, July.
    3. Jana, Sayantee & Balakrishnan, Narayanaswamy & Hamid, Jemila S., 2018. "Estimation of the parameters of the extended growth curve model under multivariate skew normal distribution," Journal of Multivariate Analysis, Elsevier, vol. 166(C), pages 111-128.
    4. Xiao Min & Chen Ting & Ming Ruixing & Huang Kunpeng, 2020. "Optimal Estimation for Power of Variance with Application to Gene-Set Testing," Journal of Systems Science and Information, De Gruyter, vol. 8(6), pages 549-564, December.
    5. Aaron C Ericsson & J Wade Davis & William Spollen & Nathan Bivens & Scott Givan & Catherine E Hagan & Mark McIntosh & Craig L Franklin, 2015. "Effects of Vendor and Genetic Background on the Composition of the Fecal Microbiota of Inbred Mice," PLOS ONE, Public Library of Science, vol. 10(2), pages 1-19, February.
    6. Cabana Garceran del Vall, Elisa & Laniado Rodas, Henry & Lillo Rodríguez, Rosa Elvira, 2017. "Multivariate outlier detection based on a robust Mahalanobis distance with shrinkage estimators," DES - Working Papers. Statistics and Econometrics. WS 24613, Universidad Carlos III de Madrid. Departamento de Estadística.
    7. Pan, Jianxin, 2004. "Discordant outlier detection in the growth curve model with Rao's simple covariance structure," Statistics & Probability Letters, Elsevier, vol. 69(2), pages 135-142, August.
    8. Li, Yang & Wang, Zhaojun & Zou, Changliang, 2016. "A simpler spatial-sign-based two-sample test for high-dimensional data," Journal of Multivariate Analysis, Elsevier, vol. 149(C), pages 192-198.
    9. Yata, Kazuyoshi & Aoshima, Makoto, 2013. "PCA consistency for the power spiked model in high-dimensional settings," Journal of Multivariate Analysis, Elsevier, vol. 122(C), pages 334-354.
    10. Hossain, Ahmed & Beyene, Joseph & Willan, Andrew R. & Hu, Pingzhao, 2009. "A flexible approximate likelihood ratio test for detecting differential expression in microarray data," Computational Statistics & Data Analysis, Elsevier, vol. 53(10), pages 3685-3695, August.
    11. Ley, Christophe & Paindaveine, Davy & Verdebout, Thomas, 2015. "High-dimensional tests for spherical location and spiked covariance," Journal of Multivariate Analysis, Elsevier, vol. 139(C), pages 79-91.
    12. Xiaohong Li & Guy N Brock & Eric C Rouchka & Nigel G F Cooper & Dongfeng Wu & Timothy E O’Toole & Ryan S Gill & Abdallah M Eteleeb & Liz O’Brien & Shesh N Rai, 2017. "A comparison of per sample global scaling and per gene normalization methods for differential expression analysis of RNA-seq data," PLOS ONE, Public Library of Science, vol. 12(5), pages 1-22, May.
    13. Tzviel Frostig & Yoav Benjamini, 2022. "Testing the equality of multivariate means when $$p>n$$ p > n by combining the Hotelling and Simes tests," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(2), pages 390-415, June.
    14. Kerr Kathleen F., 2012. "Optimality Criteria for the Design of 2-Color Microarray Studies," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(1), pages 1-9, January.
    15. Zhou, Bu & Guo, Jia, 2017. "A note on the unbiased estimator of Σ2," Statistics & Probability Letters, Elsevier, vol. 129(C), pages 141-146.
    16. Li, Weiming & Xu, Yangchang, 2022. "Asymptotic properties of high-dimensional spatial median in elliptical distributions with application," Journal of Multivariate Analysis, Elsevier, vol. 190(C).
    17. Ambroise Jérôme & Bearzatto Bertrand & Robert Annie & Macq Benoit & Gala Jean-Luc, 2012. "Combining Multiple Laser Scans of Spotted Microarrays by Means of a Two-Way ANOVA Model," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(3), pages 1-20, February.
    18. J. McClatchy & R. Strogantsev & E. Wolfe & H. Y. Lin & M. Mohammadhosseini & B. A. Davis & C. Eden & D. Goldman & W. H. Fleming & P. Conley & G. Wu & L. Cimmino & H. Mohammed & A. Agarwal, 2023. "Clonal hematopoiesis related TET2 loss-of-function impedes IL1β-mediated epigenetic reprogramming in hematopoietic stem and progenitor cells," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    19. Alexandra Gyurdieva & Stefan Zajic & Ya-Fang Chang & E. Andres Houseman & Shan Zhong & Jaegil Kim & Michael Nathenson & Thomas Faitg & Mary Woessner & David C. Turner & Aisha N. Hasan & John Glod & Ro, 2022. "Biomarker correlates with response to NY-ESO-1 TCR T cells in patients with synovial sarcoma," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    20. Sora Yoon & Seon-Young Kim & Dougu Nam, 2016. "Improving Gene-Set Enrichment Analysis of RNA-Seq Data with Small Replicates," PLOS ONE, Public Library of Science, vol. 11(11), pages 1-16, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stmapp:v:26:y:2017:i:2:d:10.1007_s10260-016-0369-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.