Loading paper
Mitigating Attention Sinks and Massive Activations in Audio-Visual Speech Recognition with LLMs | Tomesphere