Loading paper
FineBench: Benchmarking and Enhancing Vision-Language Models for Fine-grained Human Activity Understanding | Tomesphere