Loading paper
Delta-LLaVA: Base-then-Specialize Alignment for Token-Efficient Vision-Language Models | Tomesphere