Advancing Compressed Video Action Recognition through Progressive Knowledge Distillation
Efstathia Soufleri, Deepak Ravikumar, Kaushik Roy

TL;DR
This paper introduces Progressive Knowledge Distillation (PKD) and WISE, novel methods for improving compressed video action recognition by transferring knowledge across modalities and combining classifier outputs, leading to significant accuracy gains.
Contribution
The paper proposes PKD and WISE, innovative techniques for knowledge transfer and ensemble inference in compressed video action recognition, enhancing accuracy and generalization.
Findings
PKD improves IC accuracy by up to 11.42% on HMDB-51.
WISE boosts accuracy by up to 9.30% on HMDB-51.
Training ICs with PKD outperforms standard cross-entropy training.
Abstract
Compressed video action recognition classifies video samples by leveraging the different modalities in compressed videos, namely motion vectors, residuals, and intra-frames. For this purpose, three neural networks are deployed, each dedicated to processing one modality. Our observations indicate that the network processing intra-frames tend to converge to a flatter minimum than the network processing residuals, which in turn converges to a flatter minimum than the motion vector network. This hierarchy in convergence motivates our strategy for knowledge transfer among modalities to achieve flatter minima, which are generally associated with better generalization. With this insight, we propose Progressive Knowledge Distillation (PKD), a technique that incrementally transfers knowledge across the modalities. This method involves attaching early exits (Internal Classifiers - ICs) to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Advanced Neural Network Applications
MethodsKnowledge Distillation
