IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0310194.html
   My bibliography  Save this article

Sequential label shift detection in classification data: An application to dengue fever

Author

Listed:
  • Ciaran Evans
  • Max G’Sell

Abstract

Classifiers have been developed to help diagnose dengue fever in patients presenting with febrile symptoms. However, classifier predictions often rely on the assumption that new observations come from the same distribution as training data. If the population prevalence of dengue changes, as would happen with a dengue outbreak, it is important to raise an alarm as soon as possible, so that appropriate public health measures can be taken and also so that the classifier can be re-calibrated. In this paper, we consider the problem of detecting such a change in distribution in sequentially-observed, unlabeled classification data. We focus on label shift changes to the distribution, where the class priors shift but the class conditional distributions remain unchanged. We reduce this problem to the problem of detecting a change in the one-dimensional classifier scores, leading to simple nonparametric sequential changepoint detection procedures. Our procedures leverage classifier training data to estimate the detection statistic, and converge to their parametric counterparts in the size of the training data. In simulated outbreaks with real dengue data, we show that our method outperforms other detection procedures in this label shift setting.

Suggested Citation

  • Ciaran Evans & Max G’Sell, 2024. "Sequential label shift detection in classification data: An application to dengue fever," PLOS ONE, Public Library of Science, vol. 19(9), pages 1-16, September.
  • Handle: RePEc:plo:pone00:0310194
    DOI: 10.1371/journal.pone.0310194
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0310194
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0310194&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0310194?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. David McDonald, 1990. "A cusum procedure based on sequential ranks," Naval Research Logistics (NRL), John Wiley & Sons, vol. 37(5), pages 627-646, October.
    2. Ross, Gordon J., 2015. "Parametric and Nonparametric Sequential Change Detection in R: The cpm Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 66(i03).
    3. Oscar Hernan Madrid Padilla & Alex Athey & Alex Reinhart & James G. Scott, 2019. "Sequential Nonparametric Tests for a Change in Distribution: An Application to Detecting Radiological Anomalies," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(526), pages 514-528, April.
    4. Tzong-Shiann Ho & Ting-Chia Weng & Jung-Der Wang & Hsieh-Cheng Han & Hao-Chien Cheng & Chun-Chieh Yang & Chih-Hen Yu & Yen-Jung Liu & Chien Hsiang Hu & Chun-Yu Huang & Ming-Hong Chen & Chwan-Chuen Kin, 2020. "Comparing machine learning with case-control models to identify confirmed dengue cases," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 14(11), pages 1-21, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Park, Beum-Jo, 2022. "The COVID-19 pandemic, volatility, and trading behavior in the bitcoin futures market," Research in International Business and Finance, Elsevier, vol. 59(C).
    2. Austin, Edward & Romano, Gaetano & Eckley, Idris A. & Fearnhead, Paul, 2023. "Online non-parametric changepoint detection with application to monitoring operational performance of network devices," Computational Statistics & Data Analysis, Elsevier, vol. 177(C).
    3. Andreas Anastasiou & Piotr Fryzlewicz, 2022. "Detecting multiple generalized change-points by isolating single ones," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 85(2), pages 141-174, February.
    4. Arjun Prakash & Nick James & Max Menzies & Gilad Francis, 2020. "Structural clustering of volatility regimes for dynamic trading strategies," Papers 2004.09963, arXiv.org, revised Nov 2021.
    5. Lombard, F. & Hawkins, Douglas M. & Potgieter, Cornelis J., 2017. "Sequential rank CUSUM charts for angular data," Computational Statistics & Data Analysis, Elsevier, vol. 105(C), pages 268-279.
    6. Yanlin Shi, 2023. "Long memory and regime switching in the stochastic volatility modelling," Annals of Operations Research, Springer, vol. 320(2), pages 999-1020, January.
    7. Corbet, Shaen & Lucey, Brian & Peat, Maurice & Vigne, Samuel, 2018. "Bitcoin Futures—What use are they?," Economics Letters, Elsevier, vol. 172(C), pages 23-27.
    8. repec:osf:osfxxx:fzqxv_v1 is not listed on IDEAS
    9. James, Nick & Menzies, Max & Chan, Jennifer, 2021. "Changes to the extreme and erratic behaviour of cryptocurrencies during COVID-19," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 565(C).
    10. James, Nick, 2021. "Dynamics, behaviours, and anomaly persistence in cryptocurrencies and equities surrounding COVID-19," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 570(C).
    11. Baolong Ying & Qijing Yan & Zehua Chen & Jinchao Du, 2024. "A sequential feature selection approach to change point detection in mean-shift change point models," Statistical Papers, Springer, vol. 65(6), pages 3893-3915, August.
    12. Yanhong Wu, 1996. "A less sensitive linear detector for the change point based on kernel smoothing method," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 43(1), pages 43-55, December.
    13. Gian Luca Vriz & Luigi Grossi, 2024. "Green bubbles: a four-stage paradigm for detection and propagation," Papers 2410.06564, arXiv.org.
    14. Lembris Laanyuni Njotto & Wilfred Senyoni & Ottmar Cronie & Michael Alifrangis & Anna-Sofie Stensgaard, 2024. "Quantitative modelling for dengue and Aedes mosquitoes in Africa: A systematic review of current approaches and future directions for Early Warning System development," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 18(11), pages 1-22, November.
    15. Chao Liu & Qingjie Xu & Ya Fan & Hao Wu & Jian Chen & Peng Lin, 2023. "Detection of Abnormal Data in GNSS Coordinate Series Based on an Improved Cumulative Sum," Sustainability, MDPI, vol. 15(9), pages 1-16, April.
    16. Hang Xu & Philip L.H. Yu & Mayer Alvo, 2019. "Detecting change points in the stress‐strength reliability P(X," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 35(3), pages 837-857, May.
    17. Brice B. Hanberry, 2021. "Timing of Tree Density Increases, Influence of Climate Change, and a Land Use Proxy for Tree Density Increases in the Eastern United States," Land, MDPI, vol. 10(11), pages 1-17, October.
    18. Peter M C Harrison & Roberta Bianco & Maria Chait & Marcus T Pearce, 2020. "PPM-Decay: A computational model of auditory prediction with memory decay," PLOS Computational Biology, Public Library of Science, vol. 16(11), pages 1-41, November.
    19. Lindeløv, Jonas Kristoffer, 2020. "mcp: An R Package for Regression With Multiple Change Points," OSF Preprints fzqxv, Center for Open Science.
    20. Magda Monteiro & Marco Costa, 2023. "Change Point Detection by State Space Modeling of Long-Term Air Temperature Series in Europe," Stats, MDPI, vol. 6(1), pages 1-18, January.
    21. Nora M. Villanueva & Marta Sestelo & Miguel M. Fonseca & Javier Roca-Pardiñas, 2023. "seq2R: An R Package to Detect Change Points in DNA Sequences," Mathematics, MDPI, vol. 11(10), pages 1-20, May.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0310194. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.