Loading paper
TACO: Training-free Sound Prompted Segmentation via Semantically Constrained Audio-visual CO-factorization | Tomesphere