IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v182y2023ics0167947323000166.html
   My bibliography  Save this article

Change-point testing for parallel data sets with FDR control

Author

Listed:
  • Cui, Junfeng
  • Wang, Guanghui
  • Zou, Changliang
  • Wang, Zhaojun

Abstract

Large parallel data sets—consisting of paired measurements of responses and covariates—collected over time from numerous sources are ubiquitous. It is of great interest to identify the data sources where the underlying regression relationship of each data set has shifted. To be specific, the regression coefficient is changed to another one at some time point for each data set. Borrowing the strength of recent developments of multiple testing procedures, a residual-aggregated testing (RAT) method is proposed for recovering such data sources by controlling the false discovery rate (FDR). The proposed method can effectively incorporate the dependence structure among different data sets, and is more robust than the conventional Benjamini-Hochberg method based on asymptotic p-values or numerical approximations. Under mild conditions, the asymptotic validity for both the false discovery proportion and FDR control is established. Extensive numerical results further confirm the effectiveness and robustness of the proposed method.

Suggested Citation

  • Cui, Junfeng & Wang, Guanghui & Zou, Changliang & Wang, Zhaojun, 2023. "Change-point testing for parallel data sets with FDR control," Computational Statistics & Data Analysis, Elsevier, vol. 182(C).
  • Handle: RePEc:eee:csdana:v:182:y:2023:i:c:s0167947323000166
    DOI: 10.1016/j.csda.2023.107705
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947323000166
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2023.107705?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Jianqing Fan & Yuan Liao & Martina Mincheva, 2013. "Large covariance estimation by thresholding principal orthogonal complements," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(4), pages 603-680, September.
    2. Mengjia Yu & Xiaohui Chen, 2021. "Finite sample change point inference and identification for high‐dimensional mean vectors," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(2), pages 247-270, April.
    3. Haeran Cho & Piotr Fryzlewicz, 2015. "Multiple-change-point detection for high dimensional time series via sparsified binary segmentation," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 77(2), pages 475-507, March.
    4. Chen, Liang & Dolado, Juan J. & Gonzalo, Jesús, 2014. "Detecting big structural breaks in large factor models," Journal of Econometrics, Elsevier, vol. 180(1), pages 30-48.
    5. Jushan Bai & Pierre Perron, 1998. "Estimating and Testing Linear Models with Multiple Structural Changes," Econometrica, Econometric Society, vol. 66(1), pages 47-78, January.
    6. Rafal Baranowski & Yining Chen & Piotr Fryzlewicz, 2019. "Narrowest‐over‐threshold detection of multiple change points and change‐point‐like features," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 81(3), pages 649-672, July.
    7. Sokbae Lee & Myung Hwan Seo & Youngki Shin, 2016. "The lasso for high dimensional regression with a possible change point," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(1), pages 193-210, January.
    8. Barigozzi, Matteo & Cho, Haeran & Fryzlewicz, Piotr, 2018. "Simultaneous multiple change-point and factor analysis for high-dimensional time series," Journal of Econometrics, Elsevier, vol. 206(1), pages 187-225.
    9. Xu Cheng & Zhipeng Liao & Frank Schorfheide, 2016. "Shrinkage Estimation of High-Dimensional Factor Models with Structural Instabilities," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 83(4), pages 1511-1543.
    10. Andrews, Donald W K, 1993. "Tests for Parameter Instability and Structural Change with Unknown Change Point," Econometrica, Econometric Society, vol. 61(4), pages 821-856, July.
    11. Shujie Ma & Wei Lan & Liangjun Su & Chih-Ling Tsai, 2020. "Testing Alphas in Conditional Time-Varying Factor Models With High-Dimensional Assets," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 38(1), pages 214-227, January.
    12. Bai, Jushan, 2010. "Common breaks in means and variances for panel data," Journal of Econometrics, Elsevier, vol. 157(1), pages 78-92, July.
    13. Bin Liu & Cheng Zhou & Xinsheng Zhang & Yufeng Liu, 2020. "A unified data‐adaptive framework for high dimensional change point detection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 82(4), pages 933-963, September.
    14. Gombay, Edit & Horváth, Lajos, 1994. "Limit theorems for change in linear regression," Journal of Multivariate Analysis, Elsevier, vol. 48(1), pages 43-69, January.
    15. Cho, Haeran & Fryzlewicz, Piotr, 2015. "Multiple-change-point detection for high dimensional time series via sparsified binary segmentation," LSE Research Online Documents on Economics 57147, London School of Economics and Political Science, LSE Library.
    16. Tengyao Wang & Richard J. Samworth, 2018. "High dimensional change point estimation via sparse projection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 80(1), pages 57-83, January.
    17. Yangfan Zhang & Runmin Wang & Xiaofeng Shao, 2022. "Adaptive Inference for Change Points in High-Dimensional Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 117(540), pages 1751-1762, October.
    18. Y. Mei, 2010. "Efficient scalable schemes for monitoring a large number of data streams," Biometrika, Biometrika Trust, vol. 97(2), pages 419-433.
    19. Fryzlewicz, Piotr, 2014. "Wild binary segmentation for multiple change-point detection," LSE Research Online Documents on Economics 57146, London School of Economics and Political Science, LSE Library.
    20. Wenguang Sun & T. Tony Cai, 2009. "Large‐scale multiple testing under dependence," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(2), pages 393-424, April.
    21. Bai, Jushan & Han, Xu & Shi, Yutang, 2020. "Estimation and inference of change points in high-dimensional factor models," Journal of Econometrics, Elsevier, vol. 219(1), pages 66-100.
    22. Efron, Bradley, 2007. "Correlation and Large-Scale Simultaneous Significance Testing," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 93-103, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Liu, Bin & Zhang, Xinsheng & Liu, Yufeng, 2022. "High dimensional change point inference: Recent developments and extensions," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    2. Barigozzi, Matteo & Cho, Haeran & Fryzlewicz, Piotr, 2018. "Simultaneous multiple change-point and factor analysis for high-dimensional time series," Journal of Econometrics, Elsevier, vol. 206(1), pages 187-225.
    3. Barigozzi, Matteo & Trapani, Lorenzo, 2020. "Sequential testing for structural stability in approximate factor models," Stochastic Processes and their Applications, Elsevier, vol. 130(8), pages 5149-5187.
    4. Bai, Jushan & Duan, Jiangtao & Han, Xu, 2024. "The likelihood ratio test for structural changes in factor models," Journal of Econometrics, Elsevier, vol. 238(2).
    5. Ma, Chenchen & Tu, Yundong, 2023. "Group fused Lasso for large factor models with multiple structural breaks," Journal of Econometrics, Elsevier, vol. 233(1), pages 132-154.
    6. Duan, Jiangtao & Bai, Jushan & Han, Xu, 2023. "Quasi-maximum likelihood estimation of break point in high-dimensional factor models," Journal of Econometrics, Elsevier, vol. 233(1), pages 209-236.
    7. Chen, Likai & Wang, Weining & Wu, Wei Biao, 2019. "Inference of Break-Points in High-Dimensional Time Series," IRTG 1792 Discussion Papers 2019-013, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    8. Jiang, Feiyu & Zhao, Zifeng & Shao, Xiaofeng, 2023. "Time series analysis of COVID-19 infection curve: A change-point perspective," Journal of Econometrics, Elsevier, vol. 232(1), pages 1-17.
    9. Wang, Lu & Wu, Jianhong, 2022. "Estimation of high-dimensional factor models with multiple structural changes," Economic Modelling, Elsevier, vol. 108(C).
    10. Alessandro Casini & Pierre Perron, 2018. "Structural Breaks in Time Series," Boston University - Department of Economics - Working Papers Series WP2019-02, Boston University - Department of Economics.
    11. Jiang, Feiyu & Wang, Runmin & Shao, Xiaofeng, 2023. "Robust inference for change points in high dimension," Journal of Multivariate Analysis, Elsevier, vol. 193(C).
    12. Thomas Despois & Catherine Doz, 2021. "Identifying and interpreting the factors in factor models via sparsity: Different approaches," Working Papers halshs-02235543, HAL.
    13. Pang, Tianxiao & Du, Lingjie & Chong, Terence Tai-Leung, 2021. "Estimating multiple breaks in nonstationary autoregressive models," Journal of Econometrics, Elsevier, vol. 221(1), pages 277-311.
    14. Thomas Despois & Catherine Doz, 2021. "Identifying and interpreting the factors in factor models via sparsity: Different approaches," PSE Working Papers halshs-02235543, HAL.
    15. Mengjia Yu & Xiaohui Chen, 2021. "Finite sample change point inference and identification for high‐dimensional mean vectors," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(2), pages 247-270, April.
    16. Holger Dette & Theresa Eckle & Mathias Vetter, 2020. "Multiscale change point detection for dependent data," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 47(4), pages 1243-1274, December.
    17. Abhimanyu Gupta & Myung Hwan Seo, 2023. "Robust Inference on Infinite and Growing Dimensional Time‐Series Regression," Econometrica, Econometric Society, vol. 91(4), pages 1333-1361, July.
    18. Matteo Barigozzi & Marc Hallin, 2023. "Dynamic Factor Models: a Genealogy," Papers 2310.17278, arXiv.org, revised Jan 2024.
    19. Smith, Simon C. & Timmermann, Allan & Zhu, Yinchu, 2019. "Variable selection in panel models with breaks," Journal of Econometrics, Elsevier, vol. 212(1), pages 323-344.
    20. Chen, Sanpan & Cui, Guowei & Zhang, Jianhua, 2017. "On testing for structural break of coefficients in factor-augmented regression models," Economics Letters, Elsevier, vol. 161(C), pages 141-145.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:182:y:2023:i:c:s0167947323000166. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.