IDEAS home Printed from https://ideas.repec.org/a/gam/jstats/v7y2024i2p28-480d1393471.html
   My bibliography  Save this article

Multivariate Time Series Change-Point Detection with a Novel Pearson-like Scaled Bregman Divergence

Author

Listed:
  • Tong Si

    (Department of Mathematics and Statistics, Saint Louis University, St. Louis, MO 63103, USA)

  • Yunge Wang

    (Department of Mathematics and Statistics, Saint Louis University, St. Louis, MO 63103, USA)

  • Lingling Zhang

    (Department of Mathematics and Statistics, University at Albany SUNY, Albany, NY 12222, USA)

  • Evan Richmond

    (Department of Mathematics and Statistics, Saint Louis University, St. Louis, MO 63103, USA)

  • Tae-Hyuk Ahn

    (Department of Computer Science, Saint Louis University, St. Louis, MO 63103, USA)

  • Haijun Gong

    (Department of Mathematics and Statistics, Saint Louis University, St. Louis, MO 63103, USA)

Abstract

Change-point detection is a challenging problem that has a number of applications across various real-world domains. The primary objective of CPD is to identify specific time points where the underlying system undergoes transitions between different states, each characterized by its distinct data distribution. Precise identification of change points in time series omics data can provide insights into the dynamic and temporal characteristics inherent to complex biological systems. Many change-point detection methods have traditionally focused on the direct estimation of data distributions. However, these approaches become unrealistic in high-dimensional data analysis. Density ratio methods have emerged as promising approaches for change-point detection since estimating density ratios is easier than directly estimating individual densities. Nevertheless, the divergence measures used in these methods may suffer from numerical instability during computation. Additionally, the most popular α -relative Pearson divergence cannot measure the dissimilarity between two distributions of data but a mixture of distributions. To overcome the limitations of existing density ratio-based methods, we propose a novel approach called the Pearson-like scaled-Bregman divergence-based (PLsBD) density ratio estimation method for change-point detection. Our theoretical studies derive an analytical expression for the Pearson-like scaled Bregman divergence using a mixture measure. We integrate the PLsBD with a kernel regression model and apply a random sampling strategy to identify change points in both synthetic data and real-world high-dimensional genomics data of Drosophila. Our PLsBD method demonstrates superior performance compared to many other change-point detection methods.

Suggested Citation

  • Tong Si & Yunge Wang & Lingling Zhang & Evan Richmond & Tae-Hyuk Ahn & Haijun Gong, 2024. "Multivariate Time Series Change-Point Detection with a Novel Pearson-like Scaled Bregman Divergence," Stats, MDPI, vol. 7(2), pages 1-19, May.
  • Handle: RePEc:gam:jstats:v:7:y:2024:i:2:p:28-480:d:1393471
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2571-905X/7/2/28/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2571-905X/7/2/28/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Hajra Siddiqa & Sajid Ali & Ismail Shah, 2021. "Most recent changepoint detection in censored panel data," Computational Statistics, Springer, vol. 36(1), pages 515-540, March.
    2. Tengyao Wang & Richard J. Samworth, 2018. "High dimensional change point estimation via sparse projection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 80(1), pages 57-83, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Steland, Ansgar, 2020. "Testing and estimating change-points in the covariance matrix of a high-dimensional time series," Journal of Multivariate Analysis, Elsevier, vol. 177(C).
    2. Li, Degui, 2024. "Estimation of Large Dynamic Covariance Matrices: A Selective Review," Econometrics and Statistics, Elsevier, vol. 29(C), pages 16-30.
    3. Chen, Likai & Wang, Weining & Wu, Wei Biao, 2019. "Inference of Break-Points in High-Dimensional Time Series," IRTG 1792 Discussion Papers 2019-013, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    4. Cho, Haeran & Korkas, Karolos K., 2022. "High-dimensional GARCH process segmentation with an application to Value-at-Risk," Econometrics and Statistics, Elsevier, vol. 23(C), pages 187-203.
    5. Jiang, Feiyu & Wang, Runmin & Shao, Xiaofeng, 2023. "Robust inference for change points in high dimension," Journal of Multivariate Analysis, Elsevier, vol. 193(C).
    6. Abhimanyu Gupta & Myung Hwan Seo, 2023. "Robust Inference on Infinite and Growing Dimensional Time‐Series Regression," Econometrica, Econometric Society, vol. 91(4), pages 1333-1361, July.
    7. Haeran Cho & Claudia Kirch, 2022. "Two-stage data segmentation permitting multiscale change points, heavy tails and dependence," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(4), pages 653-684, August.
    8. Bin Liu & Cheng Zhou & Xinsheng Zhang & Yufeng Liu, 2020. "A unified data‐adaptive framework for high dimensional change point detection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 82(4), pages 933-963, September.
    9. Pang, Tianxiao & Du, Lingjie & Chong, Terence Tai-Leung, 2021. "Estimating multiple breaks in nonstationary autoregressive models," Journal of Econometrics, Elsevier, vol. 221(1), pages 277-311.
    10. Aaron Paul Lowther & Rebecca Killick & Idris Arthur Eckley, 2023. "Detecting changes in mixed‐sampling rate data sequences," Environmetrics, John Wiley & Sons, Ltd., vol. 34(1), February.
    11. Mengjia Yu & Xiaohui Chen, 2021. "Finite sample change point inference and identification for high‐dimensional mean vectors," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(2), pages 247-270, April.
    12. Hajra Siddiqa & Sajid Ali & Ismail Shah, 2021. "Most recent changepoint detection in censored panel data," Computational Statistics, Springer, vol. 36(1), pages 515-540, March.
    13. Ping‐Shou Zhong & Jun Li & Piotr Kokoszka, 2021. "Multivariate analysis of variance and change points estimation for high‐dimensional longitudinal data," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 48(2), pages 375-405, June.
    14. Barigozzi, Matteo & Cho, Haeran & Fryzlewicz, Piotr, 2018. "Simultaneous multiple change-point and factor analysis for high-dimensional time series," Journal of Econometrics, Elsevier, vol. 206(1), pages 187-225.
    15. Chen, Cathy Yi-hsuan & Okhrin, Yarema & Wang, Tengyao, 2022. "Monitoring network changes in social media," LSE Research Online Documents on Economics 113742, London School of Economics and Political Science, LSE Library.
    16. Chen, Cathy Yi-Hsuan & Härdle, Wolfgang Karl & Klochkov, Yegor, 2019. "SONIC: SOcial Network with Influencers and Communities," IRTG 1792 Discussion Papers 2019-025, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    17. Fryzlewicz, Piotr, 2020. "Detecting possibly frequent change-points: Wild Binary Segmentation 2 and steepest-drop model selection," LSE Research Online Documents on Economics 103430, London School of Economics and Political Science, LSE Library.
    18. Yudong Chen & Tengyao Wang & Richard J. Samworth, 2022. "High‐dimensional, multiscale online changepoint detection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(1), pages 234-266, February.
    19. Chen, Yudong & Wang, Tengyao & Samworth, Richard J., 2022. "High-dimensional, multiscale online changepoint detection," LSE Research Online Documents on Economics 113665, London School of Economics and Political Science, LSE Library.
    20. Cui, Junfeng & Wang, Guanghui & Zou, Changliang & Wang, Zhaojun, 2023. "Change-point testing for parallel data sets with FDR control," Computational Statistics & Data Analysis, Elsevier, vol. 182(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jstats:v:7:y:2024:i:2:p:28-480:d:1393471. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.