SkipClick: Combining Quick Responses and Low-Level Features for Interactive Segmentation in Winter Sports Contexts
Robin Sch\"on, Julian Lorenz, Daniel Kienzle, Rainer Lienhart

TL;DR
This paper introduces SkipClick, an interactive segmentation architecture optimized for winter sports objects, achieving state-of-the-art results by combining quick response mechanisms with low-level features.
Contribution
The paper proposes a novel architecture for interactive segmentation tailored to winter sports, improving response speed and segmentation accuracy over existing models.
Findings
Outperforms SAM and HQ-SAM on WSESeg dataset.
Achieves state-of-the-art results on HQSeg-44k dataset.
Effective on a new skiing human masks dataset.
Abstract
In this paper, we present a novel architecture for interactive segmentation in winter sports contexts. The field of interactive segmentation deals with the prediction of high-quality segmentation masks by informing the network about the objects position with the help of user guidance. In our case the guidance consists of click prompts. For this task, we first present a baseline architecture which is specifically geared towards quickly responding after each click. Afterwards, we motivate and describe a number of architectural modifications which improve the performance when tasked with segmenting winter sports equipment on the WSESeg dataset. With regards to the average NoC@85 metric on the WSESeg classes, we outperform SAM and HQ-SAM by 2.336 and 7.946 clicks, respectively. When applied to the HQSeg-44k dataset, our system delivers state-of-the-art results with a NoC@90 of 6.00 and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization
MethodsSegment Anything Model
