Loading paper
AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning | Tomesphere