CurConMix+: A Unified Spatio-Temporal Framework for Hierarchical Surgical Workflow Understanding
Yongjun Jeon, Jongmin Shin, Kanggil Park, Seonmin Park, Soyoung Lim, Jung Yong Kim, Jinsoo Rhu, Jongman Kim, Gyu-Seong Choi, Namkee Oh, Kyu-Hwan Jung

TL;DR
CurConMix+ is a comprehensive spatio-temporal framework that enhances surgical workflow understanding by addressing class imbalance and semantic interdependence through contrastive learning and multi-scale temporal fusion, supported by a new hierarchical benchmark.
Contribution
It introduces CurConMix+, a novel hierarchical framework combining contrastive learning and temporal transformers, and LLS48, a new benchmark for complex laparoscopic procedures.
Findings
Outperforms state-of-the-art in triplet recognition
Demonstrates strong cross-level generalization
Provides a new hierarchical surgical dataset
Abstract
Surgical action triplet recognition aims to understand fine-grained surgical behaviors by modeling the interactions among instruments, actions, and anatomical targets. Despite its clinical importance for workflow analysis and skill assessment, progress has been hindered by severe class imbalance, subtle visual variations, and the semantic interdependence among triplet components. Existing approaches often address only a subset of these challenges rather than tackling them jointly, which limits their ability to form a holistic understanding. This study builds upon CurConMix, a spatial representation framework. At its core, a curriculum-guided contrastive learning strategy learns discriminative and progressively correlated features, further enhanced by structured hard-pair sampling and feature-level mixup. Its temporal extension, CurConMix+, integrates a Multi-Resolution Temporal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical Simulation and Training · Domain Adaptation and Few-Shot Learning · Medical Imaging and Analysis
