HydroGEM: A Self Supervised Zero Shot Hybrid TCN Transformer Foundation Model for Continental Scale Streamflow Quality Control
Ijaz Ul Haq, Byung Suk Lee, Julia N. Perdrial, David Baude

TL;DR
HydroGEM is a self-supervised hybrid TCN-Transformer model that effectively detects and reconstructs streamflow anomalies at a continental scale, supporting hydrological quality control with high accuracy and cross-national generalization.
Contribution
This paper introduces HydroGEM, a novel foundation model combining TCN and Transformer architectures trained on large-scale hydrological data for improved streamflow anomaly detection.
Findings
Achieves F1=0.792 for anomaly detection on USGS data.
Reduces reconstruction error by 68.7%.
Demonstrates cross-national generalization with Tolerant F1=0.70.
Abstract
Advances in sensor networks have enabled real-time stream discharge monitoring, yet persistent sensor malfunctions limit data utility. Manual quality control by expert hydrologists cannot scale with networks generating millions of measurements annually. We introduce HydroGEM, a foundation model for continental-scale streamflow quality control designed to support human expertise. HydroGEM uses self-supervised pretraining on 6.03 million clean sequences from 3,724 USGS stations to learn general hydrological representations, followed by fine-tuning with synthetic anomalies for detection and reconstruction. A hybrid TCN-Transformer architecture (14.2M parameters) captures both local and long-range temporal dependencies, while hierarchical normalization handles six orders of magnitude in discharge. On held-out observations from 799 stations with 18 synthetic anomaly types grounded in USGS…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnvironmental Monitoring and Data Management · Flood Risk Assessment and Management · Hydrological Forecasting Using AI
