Loading paper
DocSLM: A Small Vision-Language Model for Long Multimodal Document Understanding | Tomesphere