Loading paper
VDInstruct: Zero-Shot Key Information Extraction via Content-Aware Vision Tokenization | Tomesphere