CPFD: Confidence-aware Privileged Feature Distillation for Short Video   Classification

Jinghao Shi; Xiang Shen; Kaili Zhao; Xuedong Wang; Vera Wen; Zixuan; Wang; Yifan Wu; Zhixin Zhang

arXiv:2410.03038·cs.LG·October 8, 2024

CPFD: Confidence-aware Privileged Feature Distillation for Short Video Classification

Jinghao Shi, Xiang Shen, Kaili Zhao, Xuedong Wang, Vera Wen, Zixuan, Wang, Yifan Wu, Zhixin Zhang

PDF

TL;DR

This paper introduces CPFD, a novel confidence-aware feature distillation method that enhances short video classification by adaptively transferring privileged dense features, improving performance and stability over existing methods.

Contribution

The paper proposes CPFD, a confidence-based distillation approach that adaptively weights privileged features during training, outperforming traditional uniform-weight methods and reducing performance gaps.

Findings

01

CPFD improves video classification F1 score by 6.76% over end-to-end models.

02

CPFD reduces the performance gap between teacher and student models by 84.6%.

03

The method is effective in diverse tasks and has been deployed in production systems.

Abstract

Dense features, customized for different business scenarios, are essential in short video classification. However, their complexity, specific adaptation requirements, and high computational costs make them resource-intensive and less accessible during online inference. Consequently, these dense features are categorized as `Privileged Dense Features'.Meanwhile, end-to-end multi-modal models have shown promising results in numerous computer vision tasks. In industrial applications, prioritizing end-to-end multi-modal features, can enhance efficiency but often leads to the loss of valuable information from historical privileged dense features. To integrate both features while maintaining efficiency and manageable resource costs, we present Confidence-aware Privileged Feature Distillation (CPFD), which empowers features of an end-to-end multi-modal model by adaptively distilling privileged…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.