Loading paper
Multimodal Large Language Models for Text-rich Image Understanding: A Comprehensive Review | Tomesphere