Loading paper
DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming | Tomesphere