Loading paper
One Identity, Many Roles: Multimodal Entity Coreference for Enhanced Video Situation Recognition | Tomesphere