IDEAS home Printed from https://ideas.repec.org/a/spr/stabio/v17y2025i1d10.1007_s12561-023-09384-8.html
   My bibliography  Save this article

Simultaneous Denoising and Heterogeneity Learning for Time Series Data

Author

Listed:
  • Xiwen Jiang

    (University of California, Irvine)

  • Weining Shen

    (University of California, Irvine)

Abstract

Noisy time series data are often collected in biomedical applications, and it remains an important task to understand the data heterogeneity. We propose an approach that combines the strength of trend filtering and distance-based clustering to simultaneously perform temporal mean denoising and subject-level clustering. We discuss an iterative algorithm that efficiently computes the cluster structure and clusterwise mean trends. Simulation studies confirm the excellent numerical performance of our method. We further consider two data application examples including an U.S. lung cancer mortality study and a suicide rate study.

Suggested Citation

  • Xiwen Jiang & Weining Shen, 2025. "Simultaneous Denoising and Heterogeneity Learning for Time Series Data," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 17(1), pages 62-77, April.
  • Handle: RePEc:spr:stabio:v:17:y:2025:i:1:d:10.1007_s12561-023-09384-8
    DOI: 10.1007/s12561-023-09384-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s12561-023-09384-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s12561-023-09384-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Weibin Mo & Zhengling Qi & Yufeng Liu, 2021. "Rejoinder: Learning Optimal Distributionally Robust Individualized Treatment Rules," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 116(534), pages 699-707, April.
    2. Robert Tibshirani & Guenther Walther & Trevor Hastie, 2001. "Estimating the number of clusters in a data set via the gap statistic," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 63(2), pages 411-423.
    3. Zhengling Qi & Dacheng Liu & Haoda Fu & Yufeng Liu, 2020. "Multi-Armed Angle-Based Direct Learning for Estimating Optimal Individualized Treatment Rules With Various Outcomes," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(530), pages 678-691, April.
    4. Hempstead, Katherine, 2006. "The geography of self-injury: Spatial patterns in attempted and completed suicide," Social Science & Medicine, Elsevier, vol. 62(12), pages 3186-3196, June.
    5. Robert Tibshirani & Michael Saunders & Saharon Rosset & Ji Zhu & Keith Knight, 2005. "Sparsity and smoothness via the fused lasso," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(1), pages 91-108, February.
    6. Weibin Mo & Zhengling Qi & Yufeng Liu, 2021. "Learning Optimal Distributionally Robust Individualized Treatment Rules," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 116(534), pages 659-674, April.
    7. Xu Gao & Weining Shen & Jing Ning & Ziding Feng & Jianhua Hu, 2022. "Addressing patient heterogeneity in disease predictive model development," Biometrics, The International Biometric Society, vol. 78(3), pages 1045-1055, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hou, Zhaohan & Wang, Lei, 2024. "Heterogeneous quantile regression for longitudinal data with subgroup structures," Computational Statistics & Data Analysis, Elsevier, vol. 194(C).
    2. Pan Zhao & Yifan Cui, 2023. "A Semiparametric Instrumented Difference-in-Differences Approach to Policy Learning," Papers 2310.09545, arXiv.org.
    3. Lee, Kyu Ha & Chakraborty, Sounak & Sun, Jianguo, 2017. "Variable selection for high-dimensional genomic data with censored outcomes using group lasso prior," Computational Statistics & Data Analysis, Elsevier, vol. 112(C), pages 1-13.
    4. Zhu, Jin & Wan, Runzhe & Qi, Zhengling & Luo, Shikai & Shi, Chengchun, 2024. "Robust offline reinforcement learning with heavy-tailed rewards," LSE Research Online Documents on Economics 122740, London School of Economics and Political Science, LSE Library.
    5. Gao, Yuhe & Shi, Chengchun & Song, Rui, 2023. "Deep spectral Q-learning with application to mobile health," LSE Research Online Documents on Economics 119445, London School of Economics and Political Science, LSE Library.
    6. Peter Radchenko & Gourab Mukherjee, 2017. "Convex clustering via l 1 fusion penalization," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(5), pages 1527-1546, November.
    7. Daido Kido, 2023. "Locally Asymptotically Minimax Statistical Treatment Rules Under Partial Identification," Papers 2311.08958, arXiv.org.
    8. Kohei Yata, 2021. "Optimal Decision Rules Under Partial Identification," Papers 2111.04926, arXiv.org, revised Mar 2025.
    9. Daido Kido, 2022. "Distributionally Robust Policy Learning with Wasserstein Distance," Papers 2205.04637, arXiv.org, revised Aug 2022.
    10. Christopher Adjaho & Timothy Christensen, 2022. "Externally Valid Policy Choice," Papers 2205.05561, arXiv.org, revised Jul 2023.
    11. Tutz, Gerhard & Pößnecker, Wolfgang & Uhlmann, Lorenz, 2015. "Variable selection in general multinomial logit models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 207-222.
    12. Mkhadri, Abdallah & Ouhourane, Mohamed, 2013. "An extended variable inclusion and shrinkage algorithm for correlated variables," Computational Statistics & Data Analysis, Elsevier, vol. 57(1), pages 631-644.
    13. Thiemo Fetzer & Samuel Marden, 2017. "Take What You Can: Property Rights, Contestability and Conflict," Economic Journal, Royal Economic Society, vol. 0(601), pages 757-783, May.
    14. Bailly, Hugo & Mortier, Frédéric & Giraud, Gaël, 2024. "Empirical analysis of a debt-augmented Goodwin model for the United States," Structural Change and Economic Dynamics, Elsevier, vol. 70(C), pages 619-633.
    15. Khanh Duong, 2024. "Is meritocracy just? New evidence from Boolean analysis and Machine learning," Journal of Computational Social Science, Springer, vol. 7(2), pages 1795-1821, October.
    16. Daniel Agness & Travis Baseler & Sylvain Chassang & Pascaline Dupas & Erik Snowberg, 2022. "Valuing the Time of the Self-Employed," CESifo Working Paper Series 9567, CESifo.
    17. Angelo Milfont & Alvaro Veiga, 2025. "Structural breaks detection and variable selection in dynamic linear regression via the Iterative Fused LASSO in high dimension," Papers 2502.20816, arXiv.org, revised Apr 2025.
    18. Orietta Nicolis & Jean Paul Maidana & Fabian Contreras & Danilo Leal, 2024. "Analyzing the Impact of COVID-19 on Economic Sustainability: A Clustering Approach," Sustainability, MDPI, vol. 16(4), pages 1-30, February.
    19. Li, Houjian & Tang, Mengqian & Cao, Andi & Guo, Lili, 2024. "How to reduce firm pollution discharges: Does political leaders' gender matter?," Technological Forecasting and Social Change, Elsevier, vol. 204(C).
    20. Li, Pai-Ling & Chiou, Jeng-Min, 2011. "Identifying cluster number for subspace projected functional data clustering," Computational Statistics & Data Analysis, Elsevier, vol. 55(6), pages 2090-2103, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stabio:v:17:y:2025:i:1:d:10.1007_s12561-023-09384-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.