# Cluster2Former: Semisupervised Clustering Transformers for Video Instance Segmentation

**Authors:** Áron Fóthi, Adrián Szlatincsán, Ellák Somfai

PMC · DOI: 10.3390/s24030997 · Sensors (Basel, Switzerland) · 2024-02-03

## TL;DR

Cluster2Former is a video instance segmentation method that uses minimal annotations to achieve strong performance, reducing the need for detailed labeling.

## Contribution

The novel use of scribble-based annotations with a similarity-based constraint loss for efficient video instance segmentation.

## Key findings

- Cluster2Former achieves competitive performance using only 0.5% of annotated pixels.
- The method is computationally efficient and cost-effective for limited annotation scenarios.

## Abstract

A novel approach for video instance segmentation is presented using semisupervised learning. Our Cluster2Former model leverages scribble-based annotations for training, significantly reducing the need for comprehensive pixel-level masks. We augment a video instance segmenter, for example, the Mask2Former architecture, with similarity-based constraint loss to handle partial annotations efficiently. We demonstrate that despite using lightweight annotations (using only 0.5% of the annotated pixels), Cluster2Former achieves competitive performance on standard benchmarks. The approach offers a cost-effective and computationally efficient solution for video instance segmentation, especially in scenarios with limited annotation resources.

## Full-text entities

- **Diseases:** VIS (MESH:C537538), injury to people or property (MESH:C000719191)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC10857389/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC10857389/full.md

## References

44 references — full list in the complete paper: https://tomesphere.com/paper/PMC10857389/full.md

---
Source: https://tomesphere.com/paper/PMC10857389