IDEAS home Printed from https://ideas.repec.org/a/bla/jorssb/v82y2020i4p933-963.html
   My bibliography  Save this article

A unified data‐adaptive framework for high dimensional change point detection

Author

Listed:
  • Bin Liu
  • Cheng Zhou
  • Xinsheng Zhang
  • Yufeng Liu

Abstract

In recent years, change point detection for a high dimensional data sequence has become increasingly important in many scientific fields such as biology and finance. The existing literature develops a variety of methods designed for either a specified parameter (e.g. the mean or covariance) or a particular alternative pattern (sparse or dense), but not for both scenarios simultaneously. To overcome this limitation, we provide a general framework for developing tests that are suitable for a large class of parameters, and also adaptive to various alternative scenarios. In particular, by generalizing the classical cumulative sum statistic, we construct the U‐statistic‐based cumulative sum matrix C. Two cases corresponding to common or different change point locations across the components are considered. We then propose two types of individual test statistics by aggregating C on the basis of the adjusted Lp‐norm with p ∈ {1,…,∞}. Combining the corresponding individual tests, we construct two types of data‐adaptive tests for the two cases, which are both powerful under various alternative patterns. A multiplier bootstrap method is introduced for approximating the proposed test statistics’ limiting distributions. With flexible dependence structure across co‐ordinates and mild moment conditions, we show the optimality of our methods theoretically in terms of size and power by allowing the dimension d and the number of parameters q to be much larger than the sample size n. An R package called AdaptiveCpt is developed to implement our algorithms. Extensive simulation studies provide further support for our theory. An application to a comparative genomic hybridization data set also demonstrates the usefulness of our proposed methods.

Suggested Citation

  • Bin Liu & Cheng Zhou & Xinsheng Zhang & Yufeng Liu, 2020. "A unified data‐adaptive framework for high dimensional change point detection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 82(4), pages 933-963, September.
  • Handle: RePEc:bla:jorssb:v:82:y:2020:i:4:p:933-963
    DOI: 10.1111/rssb.12375
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/rssb.12375
    Download Restriction: no

    File URL: https://libkey.io/10.1111/rssb.12375?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Haeran Cho & Piotr Fryzlewicz, 2015. "Multiple-change-point detection for high dimensional time series via sparsified binary segmentation," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 77(2), pages 475-507, March.
    2. Nancy R. Zhang & David O. Siegmund & Hanlee Ji & Jun Z. Li, 2010. "Detecting simultaneous changepoints in multiple sequences," Biometrika, Biometrika Trust, vol. 97(3), pages 631-645.
    3. Cho, Haeran & Fryzlewicz, Piotr, 2015. "Multiple-change-point detection for high dimensional time series via sparsified binary segmentation," LSE Research Online Documents on Economics 57147, London School of Economics and Political Science, LSE Library.
    4. Tengyao Wang & Richard J. Samworth, 2018. "High dimensional change point estimation via sparse projection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 80(1), pages 57-83, January.
    5. Horváth, Lajos & Kokoszka, Piotr & Steinebach, Josef, 1999. "Testing for Changes in Multivariate Dependent Observations with an Application to Temperature Changes," Journal of Multivariate Analysis, Elsevier, vol. 68(1), pages 96-119, January.
    6. David S. Matteson & Nicholas A. James, 2014. "A Nonparametric Approach for Multiple Change Point Analysis of Multivariate Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(505), pages 334-345, March.
    7. Shao, Xiaofeng & Zhang, Xianyang, 2010. "Testing for Change Points in Time Series," Journal of the American Statistical Association, American Statistical Association, vol. 105(491), pages 1228-1240.
    8. Marie Hušková & Simos Meintanis, 2006. "Change Point Analysis based on Empirical Characteristic Functions," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 63(2), pages 145-168, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jiang, Feiyu & Wang, Runmin & Shao, Xiaofeng, 2023. "Robust inference for change points in high dimension," Journal of Multivariate Analysis, Elsevier, vol. 193(C).
    2. Cui, Junfeng & Wang, Guanghui & Zou, Changliang & Wang, Zhaojun, 2023. "Change-point testing for parallel data sets with FDR control," Computational Statistics & Data Analysis, Elsevier, vol. 182(C).
    3. Liu, Bin & Zhang, Xinsheng & Liu, Yufeng, 2022. "High dimensional change point inference: Recent developments and extensions," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    4. Junwei Hu & Lihong Wang, 2023. "A weighted U-statistic based change point test for multivariate time series," Statistical Papers, Springer, vol. 64(3), pages 753-778, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Liu, Bin & Zhang, Xinsheng & Liu, Yufeng, 2022. "High dimensional change point inference: Recent developments and extensions," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    2. Fryzlewicz, Piotr, 2020. "Detecting possibly frequent change-points: Wild Binary Segmentation 2 and steepest-drop model selection," LSE Research Online Documents on Economics 103430, London School of Economics and Political Science, LSE Library.
    3. Mengjia Yu & Xiaohui Chen, 2021. "Finite sample change point inference and identification for high‐dimensional mean vectors," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(2), pages 247-270, April.
    4. Hajra Siddiqa & Sajid Ali & Ismail Shah, 2021. "Most recent changepoint detection in censored panel data," Computational Statistics, Springer, vol. 36(1), pages 515-540, March.
    5. Chen, Cathy Yi-hsuan & Okhrin, Yarema & Wang, Tengyao, 2022. "Monitoring network changes in social media," LSE Research Online Documents on Economics 113742, London School of Economics and Political Science, LSE Library.
    6. Steland, Ansgar, 2020. "Testing and estimating change-points in the covariance matrix of a high-dimensional time series," Journal of Multivariate Analysis, Elsevier, vol. 177(C).
    7. Brault, Vincent & Ouadah, Sarah & Sansonnet, Laure & Lévy-Leduc, Céline, 2018. "Nonparametric multiple change-point estimation for analyzing large Hi-C data matrices," Journal of Multivariate Analysis, Elsevier, vol. 165(C), pages 143-165.
    8. Chen, Likai & Wang, Weining & Wu, Wei Biao, 2019. "Inference of Break-Points in High-Dimensional Time Series," IRTG 1792 Discussion Papers 2019-013, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    9. Cui, Junfeng & Wang, Guanghui & Zou, Changliang & Wang, Zhaojun, 2023. "Change-point testing for parallel data sets with FDR control," Computational Statistics & Data Analysis, Elsevier, vol. 182(C).
    10. Cho, Haeran & Korkas, Karolos K., 2022. "High-dimensional GARCH process segmentation with an application to Value-at-Risk," Econometrics and Statistics, Elsevier, vol. 23(C), pages 187-203.
    11. Jiang, Feiyu & Zhao, Zifeng & Shao, Xiaofeng, 2023. "Time series analysis of COVID-19 infection curve: A change-point perspective," Journal of Econometrics, Elsevier, vol. 232(1), pages 1-17.
    12. Pang, Tianxiao & Du, Lingjie & Chong, Terence Tai-Leung, 2021. "Estimating multiple breaks in nonstationary autoregressive models," Journal of Econometrics, Elsevier, vol. 221(1), pages 277-311.
    13. Neil Hwang & Jiarui Xu & Shirshendu Chatterjee & Sharmodeep Bhattacharyya, 2022. "The Bethe Hessian and Information Theoretic Approaches for Online Change-Point Detection in Network Data," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 84(1), pages 283-320, June.
    14. B. Cooper Boniece & Lajos Horv'ath & Lorenzo Trapani, 2023. "On changepoint detection in functional data using empirical energy distance," Papers 2310.04853, arXiv.org.
    15. Aaron Paul Lowther & Rebecca Killick & Idris Arthur Eckley, 2023. "Detecting changes in mixed‐sampling rate data sequences," Environmetrics, John Wiley & Sons, Ltd., vol. 34(1), February.
    16. Ping‐Shou Zhong & Jun Li & Piotr Kokoszka, 2021. "Multivariate analysis of variance and change points estimation for high‐dimensional longitudinal data," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 48(2), pages 375-405, June.
    17. Barigozzi, Matteo & Cho, Haeran & Fryzlewicz, Piotr, 2018. "Simultaneous multiple change-point and factor analysis for high-dimensional time series," Journal of Econometrics, Elsevier, vol. 206(1), pages 187-225.
    18. Holger Dette & Theresa Eckle & Mathias Vetter, 2020. "Multiscale change point detection for dependent data," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 47(4), pages 1243-1274, December.
    19. S. O. Tickle & I. A. Eckley & P. Fearnhead, 2021. "A computationally efficient, high‐dimensional multiple changepoint procedure with application to global terrorism incidence," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(4), pages 1303-1325, October.
    20. Hoga, Yannick, 2017. "Monitoring multivariate time series," Journal of Multivariate Analysis, Elsevier, vol. 155(C), pages 105-121.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssb:v:82:y:2020:i:4:p:933-963. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.