Loading paper
Lumos : Empowering Multimodal LLMs with Scene Text Recognition | Tomesphere