Loading paper
Turning a CLIP Model into a Scene Text Detector | Tomesphere