Loading paper
Beyond Next-Token Alignment: Distilling Multimodal Large Language Models via Token Interactions | Tomesphere