Loading paper
ClipCrop: Conditioned Cropping Driven by Vision-Language Model | Tomesphere