Loading paper
MLLMs Know When Before Speaking: Revealing and Recovering Temporal Grounding via Attention Cues | Tomesphere