Loading paper
Turning a CLIP Model into a Scene Text Spotter | Tomesphere