Loading paper
SAViL-Det: Semantic-Aware Vision-Language Model for Multi-Script Text Detection | Tomesphere