Rebalancing gradient to improve self-supervised co-training of depth, odometry and optical flow predictions

Marwane Hariat; Antoine Manzanera; David Filliat

arXiv:2605.07945·cs.CV·May 11, 2026

Rebalancing gradient to improve self-supervised co-training of depth, odometry and optical flow predictions

Marwane Hariat, Antoine Manzanera, David Filliat

PDF

TL;DR

CoopNet enhances self-supervised depth, odometry, and optical flow training by adaptively balancing gradients, leading to improved or comparable results on KITTI and CityScapes datasets.

Contribution

It introduces a novel gradient rebalancing method for co-trained networks, using a hybrid loss based on pixel disagreement to improve motion-aware predictions.

Findings

01

Improves depth, odometry, and optical flow accuracy on KITTI and CityScapes.

02

Effectively discards moving object pixels based on reconstruction disagreement.

03

Achieves state-of-the-art or comparable results with existing methods.

Abstract

We present CoopNet, an approach that improves the cooperation of co-trained networks by dynamically adapting the apportionment of gradient, to ensure equitable learning progress. It is applied to motion-aware self-supervised prediction of depth maps, by introducing a new hybrid loss, based on a distribution model of photo-metric reconstruction errors made by, on the one hand the depth + odometry paired networks, and on the other hand the optical flow network. This model essentially assumes that the pixels from moving objects (that must be discarded for training depth and odometry), correspond to those where the two reconstructions strongly disagree. We justify this model by theoretical considerations and experimental evidences. A comparative evaluation on KITTI and CityScapes datasets shows that CoopNet improves or is comparable to the state-of-the-art in depth, odometry and optical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.