Loading paper
ClawCraneNet: Leveraging Object-level Relation for Text-based Video Segmentation | Tomesphere