DenseTRF: Texture-Aware Unsupervised Representation Adaptation for Surgical Scene Dense Prediction
Guiqiu Liao, Matja\v{z} Jogan, and Daniel A. Hashimoto

TL;DR
DenseTRF is a self-supervised framework that enhances surgical scene dense prediction models by learning texture-aware representations, significantly improving their robustness to domain shifts without requiring labeled data.
Contribution
It introduces a novel texture-centric attention mechanism using slot attention for unsupervised adaptation in surgical dense prediction tasks.
Findings
Improved cross-distribution generalization over state-of-the-art models.
Effective adaptation without supervision in diverse surgical procedures.
Enhanced robustness to domain shifts in surgical scene analysis.
Abstract
Dense prediction tasks in surgical computer vision, such as segmentation and surgical zone prediction, can provide valuable guidance for laparoscopic and robotic surgery. However, these models often suffer from distribution shifts, as training datasets rarely cover the variability encountered during deployment, leading to poor generalization. We propose DenseTRF, a self-supervised representation adaptation framework based on texture-centric attention. Our method leverages slot attention to learn texture-aware representations that capture invariant visual structures. By adapting these representations to the target distribution without supervision, DenseTRF significantly improves robustness to domain shifts. The framework is implemented through conditioning dense prediction on slot attention and model merging strategies. Experiments across multiple surgical procedures demonstrate improved…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
