Loading paper
Towards Multimodal Understanding via Stable Diffusion as a Task-Aware Feature Extractor | Tomesphere