OceanCBM: A Concept Bottleneck Model for Mechanistic Interpretability in Ocean Forecasting
Sanah Suri, Kieran Ringel, Maike Sonnewald

TL;DR
OceanCBM is a novel concept bottleneck model that enhances interpretability in ocean forecasting by integrating physical concepts, improving mechanistic understanding without compromising predictive accuracy.
Contribution
It introduces a mixed supervision approach with prescribed and free concepts, enabling interpretable ocean dynamics modeling with consistent physical representations.
Findings
Mixed supervision yields stable mechanistic representations.
OceanCBM maintains predictive skill while providing interpretability.
Baseline models learn variable latent structures despite similar accuracy.
Abstract
Extreme ocean phenomena are challenging not only to predict but to diagnose, as accurate forecasts alone do not reveal the underlying physical drivers. While recent machine learning approaches achieve strong predictive skill, they remain largely opaque and provide limited guarantees of fidelity to ground-truth physics. We introduce OceanCBM, the first concept bottleneck model (CBM) for spatiotemporal prediction and mechanistic interrogation of ocean dynamics. OceanCBM uses mixed supervision to predict mixed layer heat content, a key precursor of marine heatwaves, while routing information through an intermediate layer of prescribed concepts derived from geophysical fluid dynamics and a 'free' concept. This design imposes soft physical structure without over-constraining the model, and the free concept both regularizes concept predictions and captures residual physical processes. Across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
