IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v16y2025i1d10.1038_s41467-025-61280-5.html
   My bibliography  Save this article

Early detection of emerging SARS-CoV-2 Variants from wastewater through genome sequencing and machine learning

Author

Listed:
  • Xiaowei Zhuang

    (University of Nevada Las Vegas
    University of Nevada Las Vegas
    Cleveland Clinic Lou Ruvo Center for Brain Health)

  • Van Vo

    (University of Nevada Las Vegas)

  • Michael A. Moshi

    (University of Nevada Las Vegas
    University of Nevada Las Vegas)

  • Ketan Dhede

    (University of Nevada Las Vegas
    University of Nevada Las Vegas)

  • Nabih Ghani

    (University of Nevada Las Vegas)

  • Shahraiz Akbar

    (University of Nevada Las Vegas)

  • Ching-Lan Chang

    (University of Nevada Las Vegas
    University of Nevada Las Vegas)

  • Angelia K. Young

    (Southern Nevada Health District)

  • Erin Buttery

    (Southern Nevada Health District)

  • William Bendik

    (Southern Nevada Health District)

  • Hong Zhang

    (Southern Nevada Health District)

  • Salman Afzal

    (Southern Nevada Health District)

  • Duane Moser

    (Desert Research Institute)

  • Dietmar Cordes

    (Cleveland Clinic Lou Ruvo Center for Brain Health)

  • Cassius Lockett

    (Southern Nevada Health District)

  • Daniel Gerrity

    (P.O. Box 99954)

  • Horng-Yuan Kan

    (Southern Nevada Health District)

  • Edwin C. Oh

    (University of Nevada Las Vegas
    University of Nevada Las Vegas
    University of Nevada Las Vegas
    University of Nevada Las Vegas)

Abstract

Genome sequencing from wastewater enables accurate and cost-effective identification of SARS-CoV-2 variants. However, existing computational pipelines have limitations in detecting emerging variants not yet characterized in humans. Here, we present an unsupervised learning approach that clusters co-varying and time-evolving mutation patterns to identify SARS-CoV-2 variants. To build our model, we sequence 3659 wastewater samples collected over two years from urban and rural locations in Southern Nevada. We then develop a multivariate independent component analysis (ICA)-based pipeline to transform mutation frequencies into independent sources. These data-driven time-evolving and co-varying sources are compared to 8810 SARS-CoV-2 clinical genomes from Nevadans. Our method accurately detects the Delta variant in late 2021, Omicron variants in 2022, and emerging recombinant XBB variants in 2023. Our approach also reveals the spatial and temporal dynamics of variants in both urban and rural regions; achieves earlier detection of most variants compared to other computational tools; and uncovers unique co-varying mutation patterns not associated with any known variant. The multivariate nature of our pipeline boosts statistical power and supports accurate early detection of SARS-CoV-2 variants. This feature offers a unique opportunity to detect emerging variants and pathogens, even in the absence of clinical testing.

Suggested Citation

  • Xiaowei Zhuang & Van Vo & Michael A. Moshi & Ketan Dhede & Nabih Ghani & Shahraiz Akbar & Ching-Lan Chang & Angelia K. Young & Erin Buttery & William Bendik & Hong Zhang & Salman Afzal & Duane Moser &, 2025. "Early detection of emerging SARS-CoV-2 Variants from wastewater through genome sequencing and machine learning," Nature Communications, Nature, vol. 16(1), pages 1-12, December.
  • Handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-61280-5
    DOI: 10.1038/s41467-025-61280-5
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-025-61280-5
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-025-61280-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Nicolae Sapoval & Yunxi Liu & Esther G. Lou & Loren Hopkins & Katherine B. Ensor & Rebecca Schneider & Lauren B. Stadler & Todd J. Treangen, 2023. "Enabling accurate and early detection of recently emerged SARS-CoV-2 variants of concern in wastewater," Nature Communications, Nature, vol. 14(1), pages 1-7, December.
    2. Jones, Malia & Bhattar, Mahima & Henning, Emma & Monnat, Shannon M., 2023. "Explaining the U.S. rural disadvantage in COVID-19 case and death rates during the Delta-Omicron surge: The role of politics, vaccinations, population health, and social determinants," Social Science & Medicine, Elsevier, vol. 335(C).
    3. Fan Wu & Su Zhao & Bin Yu & Yan-Mei Chen & Wen Wang & Zhi-Gang Song & Yi Hu & Zhao-Wu Tao & Jun-Hua Tian & Yuan-Yuan Pei & Ming-Li Yuan & Yu-Ling Zhang & Fa-Hui Dai & Yi Liu & Qi-Min Wang & Jiao-Jiao , 2020. "A new coronavirus associated with human respiratory disease in China," Nature, Nature, vol. 579(7798), pages 265-269, March.
    4. Fan Wu & Su Zhao & Bin Yu & Yan-Mei Chen & Wen Wang & Zhi-Gang Song & Yi Hu & Zhao-Wu Tao & Jun-Hua Tian & Yuan-Yuan Pei & Ming-Li Yuan & Yu-Ling Zhang & Fa-Hui Dai & Yi Liu & Qi-Min Wang & Jiao-Jiao , 2020. "Author Correction: A new coronavirus associated with human respiratory disease in China," Nature, Nature, vol. 580(7803), pages 7-7, April.
    5. Petra Mlcochova & Steven A. Kemp & Mahesh Shanker Dhar & Guido Papa & Bo Meng & Isabella A. T. M. Ferreira & Rawlings Datir & Dami A. Collier & Anna Albecka & Sujeet Singh & Rajesh Pandey & Jonathan B, 2021. "SARS-CoV-2 B.1.617.2 Delta variant replication and immune evasion," Nature, Nature, vol. 599(7883), pages 114-119, November.
    6. Smruthi Karthikeyan & Joshua I. Levy & Peter Hoff & Greg Humphrey & Amanda Birmingham & Kristen Jepsen & Sawyer Farmer & Helena M. Tubb & Tommy Valles & Caitlin E. Tribelhorn & Rebecca Tsai & Stefan A, 2022. "Wastewater sequencing reveals early cryptic SARS-CoV-2 variant transmission," Nature, Nature, vol. 609(7925), pages 101-108, September.
    7. Juan Li & Shengjie Lai & George F. Gao & Weifeng Shi, 2021. "The emergence, genomic diversity and global spread of SARS-CoV-2," Nature, Nature, vol. 600(7889), pages 408-418, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Geng Liu & Wenya Du & Xiongbo Sang & Qiyu Tong & Ye Wang & Guoqing Chen & Yi Yuan & Lili Jiang & Wei Cheng & Dan Liu & Yan Tian & Xianghui Fu, 2022. "RNA G-quadruplex in TMPRSS2 reduces SARS-CoV-2 infection," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    2. Giulia Orilisi & Marco Mascitti & Lucrezia Togni & Riccardo Monterubbianesi & Vincenzo Tosco & Flavia Vitiello & Andrea Santarelli & Angelo Putignano & Giovanna Orsini, 2021. "Oral Manifestations of COVID-19 in Hospitalized Patients: A Systematic Review," IJERPH, MDPI, vol. 18(23), pages 1-19, November.
    3. David Gomez-Zepeda & Danielle Arnold-Schild & Julian Beyrle & Arthur Declercq & Ralf Gabriels & Elena Kumm & Annica Preikschat & Mateusz Krzysztof Łącki & Aurélie Hirschler & Jeewan Babu Rijal & Chris, 2024. "Thunder-DDA-PASEF enables high-coverage immunopeptidomics and is boosted by MS2Rescore with MS2PIP timsTOF fragmentation prediction model," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    4. José M. Núñez-Sánchez & Jesús Molina-Gómez & Pere Mercadé-Melé & Santiago Almadana-Abón, 2024. "Boosting Competitiveness Through the Alignment of Corporate Social Responsibility, Strategic Management and Compensation Systems in Technology Companies: A Case Study," Sustainability, MDPI, vol. 16(21), pages 1-15, October.
    5. Alessandro Germani & Livia Buratta & Elisa Delvecchio & Claudia Mazzeschi, 2020. "Emerging Adults and COVID-19: The Role of Individualism-Collectivism on Perceived Risks and Psychological Maladjustment," IJERPH, MDPI, vol. 17(10), pages 1-15, May.
    6. Gabriela Dias Noske & Yun Song & Rafaela Sachetto Fernandes & Rod Chalk & Haitem Elmassoudi & Lizbé Koekemoer & C. David Owen & Tarick J. El-Baba & Carol V. Robinson & Glaucius Oliva & Andre Schutzer , 2023. "An in-solution snapshot of SARS-COV-2 main protease maturation process and inhibition," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    7. Karthikeyan Dhamotharan & Sophie M. Korn & Anna Wacker & Matthias A. Becker & Sebastian Günther & Harald Schwalbe & Andreas Schlundt, 2024. "A core network in the SARS-CoV-2 nucleocapsid NTD mediates structural integrity and selective RNA-binding," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    8. Eugene Song & Jae-Eun Lee & Seola Kwon, 2021. "Effect of Public Empathy with Infection-Control Guidelines on Infection-Prevention Attitudes and Behaviors: Based on the Case of COVID-19," IJERPH, MDPI, vol. 18(24), pages 1-18, December.
    9. Kow-Tong Chen, 2022. "Emerging Infectious Diseases and One Health: Implication for Public Health," IJERPH, MDPI, vol. 19(15), pages 1-4, July.
    10. Shujuan Li & Lingli Zhu & Lidan Zhang & Guoyan Zhang & Hongyan Ren & Liang Lu, 2023. "Urbanization-Related Environmental Factors and Hemorrhagic Fever with Renal Syndrome: A Review Based on Studies Taken in China," IJERPH, MDPI, vol. 20(4), pages 1-20, February.
    11. Umit Cirakli & Ibrahim Dogan & Mehmet Gozlu, 2022. "The Relationship Between COVID-19 Cases and COVID-19 Testing: a Panel Data Analysis on OECD Countries," Journal of the Knowledge Economy, Springer;Portland International Center for Management of Engineering and Technology (PICMET), vol. 13(3), pages 1737-1750, September.
    12. Neeltje van Doremalen & Jonathan E. Schulz & Danielle R. Adney & Taylor A. Saturday & Robert J. Fischer & Claude Kwe Yinda & Nazia Thakur & Joseph Newman & Marta Ulaszewska & Sandra Belij-Rammerstorfe, 2022. "ChAdOx1 nCoV-19 (AZD1222) or nCoV-19-Beta (AZD2816) protect Syrian hamsters against Beta Delta and Omicron variants," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    13. Jaeyong Lee & Calem Kenward & Liam J. Worrall & Marija Vuckovic & Francesco Gentile & Anh-Tien Ton & Myles Ng & Artem Cherkasov & Natalie C. J. Strynadka & Mark Paetzel, 2022. "X-ray crystallographic characterization of the SARS-CoV-2 main protease polyprotein cleavage sites essential for viral processing and maturation," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    14. Seán R. O’Connor & Charlene Treanor & Elizabeth Ward & Robin A. Wickens & Abby O’Connell & Lucy A. Culliford & Chris A. Rogers & Eleanor A. Gidman & Tunde Peto & Paul C. Knox & Benjamin J. L. Burton &, 2022. "The COVID-19 Pandemic and Ophthalmic Care: A Qualitative Study of Patients with Neovascular Age-Related Macular Degeneration (nAMD)," IJERPH, MDPI, vol. 19(15), pages 1-10, August.
    15. Maria de Lourdes Aguiar-Oliveira & Aline Campos & Aline R. Matos & Caroline Rigotto & Adriana Sotero-Martins & Paulo F. P. Teixeira & Marilda M. Siqueira, 2020. "Wastewater-Based Epidemiology (WBE) and Viral Detection in Polluted Surface Water: A Valuable Tool for COVID-19 Surveillance—A Brief Review," IJERPH, MDPI, vol. 17(24), pages 1-19, December.
    16. August F. Jernbom & Lovisa Skoglund & Elisa Pin & Ronald Sjöberg & Hanna Tegel & Sophia Hober & Elham Rostami & Annica Rasmusson & Janet L. Cunningham & Sebastian Havervall & Charlotte Thålin & Anna M, 2024. "Prevalent and persistent new-onset autoantibodies in mild to severe COVID-19," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    17. Wasim Ahmed & Josep Vidal-Alaball & Francesc Lopez Segui & Pedro A. Moreno-Sánchez, 2020. "A Social Network Analysis of Tweets Related to Masks during the COVID-19 Pandemic," IJERPH, MDPI, vol. 17(21), pages 1-9, November.
    18. Ben Zhang & Chenxu Ming, 2023. "Digital Transformation and Open Innovation Planning of Response to COVID-19 Outbreak: A Systematic Literature Review and Future Research Agenda," IJERPH, MDPI, vol. 20(3), pages 1-26, February.
    19. Yongin Choi & James Slghee Kim & Heejin Choi & Hyojung Lee & Chang Hyeong Lee, 2020. "Assessment of Social Distancing for Controlling COVID-19 in Korea: An Age-Structured Modeling Approach," IJERPH, MDPI, vol. 17(20), pages 1-16, October.
    20. Abdel-Salam G. Abdel-Salam & Edward L. Boone & Ryad Ghanam, 2024. "Multivariate Techniques for Monitoring Susceptible, Exposed, Infected, Recovered, Death, and Vaccination Model Parameters for the COVID-19 Pandemic for Qatar," IJERPH, MDPI, vol. 21(12), pages 1-20, November.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-61280-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.