IDEAS home Printed from https://ideas.repec.org/a/spr/stmapp/v26y2017i2d10.1007_s10260-016-0369-4.html
   My bibliography  Save this article

High dimensional extension of the growth curve model and its application in genetics

Author

Listed:
  • Sayantee Jana

    (McMaster University)

  • Narayanaswamy Balakrishnan

    (McMaster University)

  • Dietrich Rosen

    (Swedish Agricultural University
    Linköping University)

  • Jemila Seid Hamid

    (McMaster University
    St. Michael’s Hospital
    McMaster University)

Abstract

Recent advances in technology have allowed researchers to collect large scale complex biological data, simultaneously, often in matrix format. In genomic studies, for instance, measurements from tens to hundreds of thousands of genes are taken from individuals across several experimental groups. In time course microarray experiments, gene expression is measured at several time points for each individual across the whole genome resulting in a high-dimensional matrix for each gene. In such experiments, researchers are faced with high-dimensional longitudinal data. Unfortunately, traditional methods for longitudinal data are not appropriate for high-dimensional situations. In this paper, we use the growth curve model and introduce test useful for high-dimensional longitudinal data and evaluate its performance using simulations. We also show how our approach can be used to filter genes in time course genomic experiments. We illustrate this using publicly available genomic data, involving experiments comparing normal human lung tissue with vanadium pentoxide treated human lung tissue, designed with the aim of understanding the susceptibility of individuals working in petro-chemical factories to airway re-modelling. Using our method, we were able to filter out 1053 (about 5 %) genes as non-noise genes from a pool of 22,277. Although our focus is on hypothesis testing, we also provided modified maximum likelihood estimator for the mean parameter of the growth curve model and assessed its performance through bias and mean squared error.

Suggested Citation

  • Sayantee Jana & Narayanaswamy Balakrishnan & Dietrich Rosen & Jemila Seid Hamid, 2017. "High dimensional extension of the growth curve model and its application in genetics," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 26(2), pages 273-292, June.
  • Handle: RePEc:spr:stmapp:v:26:y:2017:i:2:d:10.1007_s10260-016-0369-4
    DOI: 10.1007/s10260-016-0369-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10260-016-0369-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10260-016-0369-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Chen, Song Xi & Qin, Yingli, 2010. "A Two Sample Test for High Dimensional Data with Applications to Gene-set Testing," MPRA Paper 59642, University Library of Munich, Germany.
    2. Yuan, Ming & Kendziorski, Christina, 2006. "Hidden Markov Models for Microarray Time Course Data in Multiple Biological Conditions," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1323-1332, December.
    3. Hamid Jemila S & Beyene Joseph, 2009. "A Multivariate Growth Curve Model for Ranking Genes in Replicated Time Course Microarray Data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-28, July.
    4. Yu Chuan Tai & Terence P. Speed, 2009. "On Gene Ranking Using Replicated Microarray Time Course Data," Biometrics, The International Biometric Society, vol. 65(1), pages 40-51, March.
    5. von Rosen, Dietrich, 1989. "Maximum likelihood estimators in multivariate linear normal models," Journal of Multivariate Analysis, Elsevier, vol. 31(2), pages 187-200, November.
    6. Smyth Gordon K, 2004. "Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 3(1), pages 1-28, February.
    7. Hamid, Jemila S. & Beyene, Joseph & von Rosen, Dietrich, 2011. "A novel trace test for the mean parameters in a multivariate growth curve model," Journal of Multivariate Analysis, Elsevier, vol. 102(2), pages 238-251, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jana, Sayantee & Balakrishnan, Narayanaswamy & Hamid, Jemila S., 2018. "Estimation of the parameters of the extended growth curve model under multivariate skew normal distribution," Journal of Multivariate Analysis, Elsevier, vol. 166(C), pages 111-128.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sayantee Jana & Narayanaswamy Balakrishnan & Jemila S. Hamid, 2020. "Inference in the Growth Curve Model under Multivariate Skew Normal Distribution," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 82(1), pages 34-69, May.
    2. Jana, Sayantee & Balakrishnan, Narayanaswamy & Hamid, Jemila S., 2018. "Estimation of the parameters of the extended growth curve model under multivariate skew normal distribution," Journal of Multivariate Analysis, Elsevier, vol. 166(C), pages 111-128.
    3. Hamid Jemila S & Beyene Joseph, 2009. "A Multivariate Growth Curve Model for Ranking Genes in Replicated Time Course Microarray Data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-28, July.
    4. Xiao Min & Chen Ting & Huang Kunpeng & Ming Ruixing, 2020. "Optimal Estimation for Power of Variance with Application to Gene-Set Testing," Journal of Systems Science and Information, De Gruyter, vol. 8(6), pages 549-564, December.
    5. Aaron C Ericsson & J Wade Davis & William Spollen & Nathan Bivens & Scott Givan & Catherine E Hagan & Mark McIntosh & Craig L Franklin, 2015. "Effects of Vendor and Genetic Background on the Composition of the Fecal Microbiota of Inbred Mice," PLOS ONE, Public Library of Science, vol. 10(2), pages 1-19, February.
    6. Yata, Kazuyoshi & Aoshima, Makoto, 2013. "PCA consistency for the power spiked model in high-dimensional settings," Journal of Multivariate Analysis, Elsevier, vol. 122(C), pages 334-354.
    7. Hossain, Ahmed & Beyene, Joseph & Willan, Andrew R. & Hu, Pingzhao, 2009. "A flexible approximate likelihood ratio test for detecting differential expression in microarray data," Computational Statistics & Data Analysis, Elsevier, vol. 53(10), pages 3685-3695, August.
    8. repec:plo:pone00:0103514 is not listed on IDEAS
    9. Xiaohong Li & Guy N Brock & Eric C Rouchka & Nigel G F Cooper & Dongfeng Wu & Timothy E O’Toole & Ryan S Gill & Abdallah M Eteleeb & Liz O’Brien & Shesh N Rai, 2017. "A comparison of per sample global scaling and per gene normalization methods for differential expression analysis of RNA-seq data," PLOS ONE, Public Library of Science, vol. 12(5), pages 1-22, May.
    10. Ambroise Jérôme & Bearzatto Bertrand & Robert Annie & Macq Benoit & Gala Jean-Luc, 2012. "Combining Multiple Laser Scans of Spotted Microarrays by Means of a Two-Way ANOVA Model," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(3), pages 1-20, February.
    11. J. McClatchy & R. Strogantsev & E. Wolfe & H. Y. Lin & M. Mohammadhosseini & B. A. Davis & C. Eden & D. Goldman & W. H. Fleming & P. Conley & G. Wu & L. Cimmino & H. Mohammed & A. Agarwal, 2023. "Clonal hematopoiesis related TET2 loss-of-function impedes IL1β-mediated epigenetic reprogramming in hematopoietic stem and progenitor cells," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    12. Alexandra Gyurdieva & Stefan Zajic & Ya-Fang Chang & E. Andres Houseman & Shan Zhong & Jaegil Kim & Michael Nathenson & Thomas Faitg & Mary Woessner & David C. Turner & Aisha N. Hasan & John Glod & Ro, 2022. "Biomarker correlates with response to NY-ESO-1 TCR T cells in patients with synovial sarcoma," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    13. Yu Lianbo & Gulati Parul & Fernandez Soledad & Pennell Michael & Kirschner Lawrence & Jarjoura David, 2011. "Fully Moderated T-statistic for Small Sample Size Gene Expression Arrays," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 10(1), pages 1-22, September.
    14. Sinan Xiong & Jianbiao Zhou & Tze King Tan & Tae-Hoon Chung & Tuan Zea Tan & Sabrina Hui-Min Toh & Nicole Xin Ning Tang & Yunlu Jia & Yi Xiang See & Melissa Jane Fullwood & Takaomi Sanda & Wee-Joo Chn, 2024. "Super enhancer acquisition drives expression of oncogenic PPP1R15B that regulates protein homeostasis in multiple myeloma," Nature Communications, Nature, vol. 15(1), pages 1-21, December.
    15. Saha, Enakshi & Sarkar, Soham & Ghosh, Anil K., 2017. "Some high-dimensional one-sample tests based on functions of interpoint distances," Journal of Multivariate Analysis, Elsevier, vol. 161(C), pages 83-95.
    16. Jianghao Li & Shizhe Hong & Zhenzhen Niu & Zhidong Bai, 2025. "Test for high-dimensional linear hypothesis of mean vectors via random integration," Statistical Papers, Springer, vol. 66(1), pages 1-34, January.
    17. Arturas Juodis & Simon Reese, 2018. "The Incidental Parameters Problem in Testing for Remaining Cross-section Correlation," Papers 1810.03715, arXiv.org, revised Feb 2021.
    18. Chaofeng Yuan & Wensheng Zhu & Xuming He & Jianhua Guo, 2019. "A mixture factor model with applications to microarray data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(1), pages 60-76, March.
    19. Nott, David J. & Yu, Zeming & Chan, Eva & Cotsapas, Chris & Cowley, Mark J. & Pulvers, Jeremy & Williams, Rohan & Little, Peter, 2007. "Hierarchical Bayes variable selection and microarray experiments," Journal of Multivariate Analysis, Elsevier, vol. 98(4), pages 852-872, April.
    20. Alexander Kaever & Manuel Landesfeind & Kirstin Feussner & Burkhard Morgenstern & Ivo Feussner & Peter Meinicke, 2014. "Meta-Analysis of Pathway Enrichment: Combining Independent and Dependent Omics Data Sets," PLOS ONE, Public Library of Science, vol. 9(2), pages 1-12, February.
    21. Liu, Shuangzhe & Leiva, Víctor & Zhuang, Dan & Ma, Tiefeng & Figueroa-Zúñiga, Jorge I., 2022. "Matrix differential calculus with applications in the multivariate linear model and its diagnostics," Journal of Multivariate Analysis, Elsevier, vol. 188(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stmapp:v:26:y:2017:i:2:d:10.1007_s10260-016-0369-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.