IDEAS home Printed from https://ideas.repec.org/a/bla/jorssb/v83y2021i2p247-270.html
   My bibliography  Save this article

Finite sample change point inference and identification for high‐dimensional mean vectors

Author

Listed:
  • Mengjia Yu
  • Xiaohui Chen

Abstract

Cumulative sum (CUSUM) statistics are widely used in the change point inference and identification. For the problem of testing for existence of a change point in an independent sample generated from the mean‐shift model, we introduce a Gaussian multiplier bootstrap to calibrate critical values of the CUSUM test statistics in high dimensions. The proposed bootstrap CUSUM test is fully data dependent and it has strong theoretical guarantees under arbitrary dependence structures and mild moment conditions. Specifically, we show that with a boundary removal parameter the bootstrap CUSUM test enjoys the uniform validity in size under the null and it achieves the minimax separation rate under the sparse alternatives when the dimension p can be larger than the sample size n. Once a change point is detected, we estimate the change point location by maximising the ℓ∞‐norm of the generalised CUSUM statistics at two different weighting scales corresponding to covariance stationary and non‐stationary CUSUM statistics. For both estimators, we derive their rates of convergence and show that dimension impacts the rates only through logarithmic factors, which implies that consistency of the CUSUM estimators is possible when p is much larger than n. In the presence of multiple change points, we propose a principled bootstrap‐assisted binary segmentation (BABS) algorithm to dynamically adjust the change point detection rule and recursively estimate their locations. We derive its rate of convergence under suitable signal separation and strength conditions. The results derived in this paper are non‐asymptotic and we provide extensive simulation studies to assess the finite sample performance. The empirical evidence shows an encouraging agreement with our theoretical results.

Suggested Citation

  • Mengjia Yu & Xiaohui Chen, 2021. "Finite sample change point inference and identification for high‐dimensional mean vectors," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(2), pages 247-270, April.
  • Handle: RePEc:bla:jorssb:v:83:y:2021:i:2:p:247-270
    DOI: 10.1111/rssb.12406
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/rssb.12406
    Download Restriction: no

    File URL: https://libkey.io/10.1111/rssb.12406?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Klaus Frick & Axel Munk & Hannes Sieling, 2014. "Multiscale change point inference," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(3), pages 495-580, June.
    2. István Berkes & Robertas Gabrys & Lajos Horváth & Piotr Kokoszka, 2009. "Detecting changes in the mean of functional observations," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(5), pages 927-946, November.
    3. Harchaoui, Z. & Lévy-Leduc, C., 2010. "Multiple Change-Point Estimation With a Total Variation Penalty," Journal of the American Statistical Association, American Statistical Association, vol. 105(492), pages 1480-1493.
    4. Haeran Cho & Piotr Fryzlewicz, 2015. "Multiple-change-point detection for high dimensional time series via sparsified binary segmentation," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 77(2), pages 475-507, March.
    5. Barigozzi, Matteo & Cho, Haeran & Fryzlewicz, Piotr, 2018. "Simultaneous multiple change-point and factor analysis for high-dimensional time series," Journal of Econometrics, Elsevier, vol. 206(1), pages 187-225.
    6. Victor Chernozhukov & Denis Chetverikov & Kengo Kato, 2012. "Gaussian approximations and multiplier bootstrap for maxima of sums of high-dimensional random vectors," Papers 1212.6906, arXiv.org, revised Jan 2018.
    7. Jushan Bai, 1997. "Estimation Of A Change Point In Multiple Regression Models," The Review of Economics and Statistics, MIT Press, vol. 79(4), pages 551-563, November.
    8. Nancy R. Zhang & David O. Siegmund & Hanlee Ji & Jun Z. Li, 2010. "Detecting simultaneous changepoints in multiple sequences," Biometrika, Biometrika Trust, vol. 97(3), pages 631-645.
    9. Cho, Haeran & Fryzlewicz, Piotr, 2015. "Multiple-change-point detection for high dimensional time series via sparsified binary segmentation," LSE Research Online Documents on Economics 57147, London School of Economics and Political Science, LSE Library.
    10. James, Nicholas A. & Matteson, David S., 2015. "ecp: An R Package for Nonparametric Multiple Change Point Analysis of Multivariate Data," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 62(i07).
    11. Tengyao Wang & Richard J. Samworth, 2018. "High dimensional change point estimation via sparse projection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 80(1), pages 57-83, January.
    12. Ombao, Hernando & von Sachs, Rainer & Guo, Wensheng, 2005. "SLEX Analysis of Multivariate Nonstationary Time Series," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 519-531, June.
    13. Fryzlewicz, Piotr, 2014. "Wild binary segmentation for multiple change-point detection," LSE Research Online Documents on Economics 57146, London School of Economics and Political Science, LSE Library.
    14. Bhattacharya, P.K., 1987. "Maximum likelihood estimation of a change-point in the distribution of independent random variables: General multiparameter case," Journal of Multivariate Analysis, Elsevier, vol. 23(2), pages 183-208, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jiang, Feiyu & Wang, Runmin & Shao, Xiaofeng, 2023. "Robust inference for change points in high dimension," Journal of Multivariate Analysis, Elsevier, vol. 193(C).
    2. Cui, Junfeng & Wang, Guanghui & Zou, Changliang & Wang, Zhaojun, 2023. "Change-point testing for parallel data sets with FDR control," Computational Statistics & Data Analysis, Elsevier, vol. 182(C).
    3. Liu, Bin & Zhang, Xinsheng & Liu, Yufeng, 2022. "High dimensional change point inference: Recent developments and extensions," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    4. Xu, Haotian & Wang, Daren & Zhao, Zifeng & Yu, Yi, 2022. "Change point inference in high-dimensional regression models under temporal dependence," LIDAM Discussion Papers ISBA 2022027, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    5. Chang, Jinyuan & Jiang, Qing & Shao, Xiaofeng, 2023. "Testing the martingale difference hypothesis in high dimension," Journal of Econometrics, Elsevier, vol. 235(2), pages 972-1000.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Fryzlewicz, Piotr, 2020. "Detecting possibly frequent change-points: Wild Binary Segmentation 2 and steepest-drop model selection," LSE Research Online Documents on Economics 103430, London School of Economics and Political Science, LSE Library.
    2. Cui, Junfeng & Wang, Guanghui & Zou, Changliang & Wang, Zhaojun, 2023. "Change-point testing for parallel data sets with FDR control," Computational Statistics & Data Analysis, Elsevier, vol. 182(C).
    3. Liu, Bin & Zhang, Xinsheng & Liu, Yufeng, 2022. "High dimensional change point inference: Recent developments and extensions," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    4. Barigozzi, Matteo & Cho, Haeran & Fryzlewicz, Piotr, 2018. "Simultaneous multiple change-point and factor analysis for high-dimensional time series," Journal of Econometrics, Elsevier, vol. 206(1), pages 187-225.
    5. Holger Dette & Theresa Eckle & Mathias Vetter, 2020. "Multiscale change point detection for dependent data," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 47(4), pages 1243-1274, December.
    6. Bill Russell & Dooruj Rambaccussing, 2019. "Breaks and the statistical process of inflation: the case of estimating the ‘modern’ long-run Phillips curve," Empirical Economics, Springer, vol. 56(5), pages 1455-1475, May.
    7. Oleksandr Gromenko & Piotr Kokoszka & Matthew Reimherr, 2017. "Detection of change in the spatiotemporal mean function," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(1), pages 29-50, January.
    8. Wu Wang & Xuming He & Zhongyi Zhu, 2020. "Statistical inference for multiple change‐point models," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 47(4), pages 1149-1170, December.
    9. Qing Yang & Yu-Ning Li & Yi Zhang, 2020. "Change point detection for nonparametric regression under strongly mixing process," Statistical Papers, Springer, vol. 61(4), pages 1465-1506, August.
    10. Chen, Likai & Wang, Weining & Wu, Wei Biao, 2019. "Inference of Break-Points in High-Dimensional Time Series," IRTG 1792 Discussion Papers 2019-013, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    11. V. Brault & C. Lévy-Leduc & A. Mathieu & A. Jullien, 2018. "Change-Point Estimation in the Multivariate Model Taking into Account the Dependence: Application to the Vegetative Development of Oilseed Rape," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 23(3), pages 374-389, September.
    12. Cho, Haeran & Korkas, Karolos K., 2022. "High-dimensional GARCH process segmentation with an application to Value-at-Risk," Econometrics and Statistics, Elsevier, vol. 23(C), pages 187-203.
    13. Jiang, Feiyu & Zhao, Zifeng & Shao, Xiaofeng, 2023. "Time series analysis of COVID-19 infection curve: A change-point perspective," Journal of Econometrics, Elsevier, vol. 232(1), pages 1-17.
    14. Bin Liu & Cheng Zhou & Xinsheng Zhang & Yufeng Liu, 2020. "A unified data‐adaptive framework for high dimensional change point detection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 82(4), pages 933-963, September.
    15. Ma, Chenchen & Tu, Yundong, 2023. "Group fused Lasso for large factor models with multiple structural breaks," Journal of Econometrics, Elsevier, vol. 233(1), pages 132-154.
    16. Hajra Siddiqa & Sajid Ali & Ismail Shah, 2021. "Most recent changepoint detection in censored panel data," Computational Statistics, Springer, vol. 36(1), pages 515-540, March.
    17. Chen, Cathy Yi-hsuan & Okhrin, Yarema & Wang, Tengyao, 2022. "Monitoring network changes in social media," LSE Research Online Documents on Economics 113742, London School of Economics and Political Science, LSE Library.
    18. S. O. Tickle & I. A. Eckley & P. Fearnhead, 2021. "A computationally efficient, high‐dimensional multiple changepoint procedure with application to global terrorism incidence," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(4), pages 1303-1325, October.
    19. Rice, Gregory & Zhang, Chi, 2022. "Consistency of binary segmentation for multiple change-point estimation with functional data," Statistics & Probability Letters, Elsevier, vol. 180(C).
    20. Simon Bussy & Mokhtar Z. Alaya & Anne‐Sophie Jannot & Agathe Guilloux, 2022. "Binacox: automatic cut‐point detection in high‐dimensional Cox model with applications in genetics," Biometrics, The International Biometric Society, vol. 78(4), pages 1414-1426, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssb:v:83:y:2021:i:2:p:247-270. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.