ViSAGE @ NTIRE 2026 Challenge on Video Saliency Prediction

Kun Wang; Yupeng Hu; Zhiran Li; Hao Liu; Qianlong Xiang; Liqiang Nie

arXiv:2604.08613·cs.CV·April 13, 2026

ViSAGE @ NTIRE 2026 Challenge on Video Saliency Prediction

Kun Wang, Yupeng Hu, Zhiran Li, Hao Liu, Qianlong Xiang, Liqiang Nie

PDF

1 Repo

TL;DR

The paper introduces ViSAGE, a multi-expert ensemble framework for video saliency prediction, which achieved top rankings in the NTIRE 2026 Challenge and demonstrates strong generalization.

Contribution

It proposes a novel adaptive gated ensemble approach that leverages diverse inductive biases for improved video saliency prediction.

Findings

01

ViSAGE ranked first on two evaluation metrics.

02

It outperformed most competitors on remaining metrics.

03

The method demonstrated strong generalization ability.

Abstract

In this report, we present our champion solution for the NTIRE 2026 Challenge on Video Saliency Prediction held in conjunction with CVPR 2026. To exploit complementary inductive biases for video saliency, we propose Video Saliency with Adaptive Gated Experts (ViSAGE), a multi-expert ensemble framework. Each specialized decoder performs adaptive gating and modulation to refine spatio-temporal features. The complementary predictions from different experts are then fused at inference. ViSAGE thereby aggregates diverse inductive biases to capture complex spatio-temporal saliency cues in videos. On the Private Test set, ViSAGE ranked first on two out of four evaluation metrics, and outperformed most competing solutions on the other two metrics, demonstrating its effectiveness and generalization ability. Our code has been released at https://github.com/iLearn-Lab/CVPRW26-ViSAGE.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

iLearn-Lab/CVPRW26-ViSAGE
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.