Spatiotemporal Classification with limited labels using Constrained Clustering for large datasets
Praveen Ravirathinam, Rahul Ghosh, Ke Wang, Keyang Xuan, Ankush, Khandelwal, Hilary Dugan, Paul Hanson, Vipin Kumar

TL;DR
This paper introduces a spatiotemporal constrained clustering method that leverages limited labels to learn separable representations from large datasets, demonstrated on a global lakes dataset for improved classification and sample labeling.
Contribution
The paper proposes a novel spatiotemporal clustering approach with constrained loss that effectively utilizes few labels to enhance representation learning on large datasets.
Findings
Spatiotemporal representation outperforms spatial or temporal alone.
Constrained loss improves representation with limited labels.
Method aids in selecting new labeled samples for better classification.
Abstract
Creating separable representations via representation learning and clustering is critical in analyzing large unstructured datasets with only a few labels. Separable representations can lead to supervised models with better classification capabilities and additionally aid in generating new labeled samples. Most unsupervised and semisupervised methods to analyze large datasets do not leverage the existing small amounts of labels to get better representations. In this paper, we propose a spatiotemporal clustering paradigm that uses spatial and temporal features combined with a constrained loss to produce separable representations. We show the working of this method on the newly published dataset ReaLSAT, a dataset of surface water dynamics for over 680,000 lakes across the world, making it an essential dataset in terms of ecology and sustainability. Using this large unlabelled dataset, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHydrological Forecasting Using AI · Hydrology and Watershed Management Studies · Machine Learning and Data Classification
