Loading paper
M2IST: Multi-Modal Interactive Side-Tuning for Efficient Referring Expression Comprehension | Tomesphere