GALAR-TemporalNet v2: Anatomy-Guided Dual-Branch Temporal Classification with Bidirectional Mamba and Dual-Graph GCN for Video Capsule Endoscopy -- after competition results

Jiye Won (1); Seangmin Lee (1); Soon Ki Jung (1) ((1) Kyungpook National University)

arXiv:2605.22209·cs.CV·May 22, 2026

GALAR-TemporalNet v2: Anatomy-Guided Dual-Branch Temporal Classification with Bidirectional Mamba and Dual-Graph GCN for Video Capsule Endoscopy -- after competition results

Jiye Won (1), Seangmin Lee (1), Soon Ki Jung (1) ((1) Kyungpook National University)

PDF

TL;DR

GALAR-TemporalNet v2 is a hierarchical model designed for multi-label temporal classification in Video Capsule Endoscopy, effectively addressing class imbalance, long-range dependencies, and pathology-anatomy entanglement.

Contribution

It introduces a novel architecture combining local self-attention, global graph reasoning, and boundary encoding, with a new anatomy-pathology decoupling pathway, improving classification accuracy.

Findings

01

Achieved [email protected] of 0.3409 and [email protected] of 0.3333 after redesign.

02

Effectively handles class imbalance and long-range dependencies.

03

Outperforms previous competition results.

Abstract

Video Capsule Endoscopy (VCE) poses a challenging multi-label temporal classification problem, requiring simultaneous localization of 8 anatomical regions and detection of 9 pathological findings across tens of thousands of frames. We present GALAR-TemporalNet v2, a hierarchical temporal model that addresses three core challenges: extreme class imbalance, long-range temporal dependencies, and pathology--anatomy entanglement. Our architecture combines windowed self-attention for local modeling, a Dual-Graph GCN for global frame relationships, and Bidirectional Mamba for selective boundary context encoding. A novel anatomy prototype residual pathway decouples pathological deviation signals from normal organ appearance, and a frame-level GCN skip connection stabilizes training of visually confusable rare classes. The competition version, GALAR-TemporalNet, achieved an overall [email protected] of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.