Score: 0

Predicting COVID-19 Prevalence Using Wastewater RNA Surveillance: A Semi-Supervised Learning Approach with Temporal Feature Trust

Published: November 27, 2025 | arXiv ID: 2512.00100v1

By: Yifei Chen, Eric Liang

Potential Business Impact:

Tracks COVID-19 spread using sewer water.

Business Areas:
Predictive Analytics Artificial Intelligence, Data and Analytics, Software

As COVID-19 transitions into an endemic disease that remains constantly present in the population at a stable level, monitoring its prevalence without invasive measures becomes increasingly important. In this paper, we present a deep neural network estimator for the COVID-19 daily case count based on wastewater surveillance data and other confounding factors. This work builds upon the study by Jiang, Kolozsvary, and Li (2024), which connects the COVID-19 case counts with testing data collected early in the pandemic. Using the COVID-19 testing data and the wastewater surveillance data during the period when both data were highly reliable, one can train an artificial neural network that learns the nonlinear relation between the COVID-19 daily case count and the wastewater viral RNA concentration. From a machine learning perspective, the main challenge lies in addressing temporal feature reliability, as the training data has different reliability over different time periods.

Country of Origin
🇺🇸 United States

Page Count
24 pages

Category
Quantitative Biology:
Quantitative Methods