Loading paper
Shikra: Unleashing Multimodal LLM's Referential Dialogue Magic | Tomesphere