Semi-supervised and Deep learning Frameworks for Video Classification and Key-frame Identification
Sohini Roychowdhury

TL;DR
This paper introduces two semi-supervised deep learning frameworks that automate scene classification and key-frame extraction in videos, significantly reducing manual effort and enabling scalable training of perception systems.
Contribution
It presents novel rule-based and deep learning-based methods for automated scene and key-frame identification in videos, improving efficiency and scalability.
Findings
Achieved 64-93% accuracy in scene categorization.
Filtered less than 10% of frames as key-frames.
Framework scalable to large video datasets.
Abstract
Automating video-based data and machine learning pipelines poses several challenges including metadata generation for efficient storage and retrieval and isolation of key-frames for scene understanding tasks. In this work, we present two semi-supervised approaches that automate this process of manual frame sifting in video streams by automatically classifying scenes for content and filtering frames for fine-tuning scene understanding tasks. The first rule-based method starts from a pre-trained object detector and it assigns scene type, uncertainty and lighting categories to each frame based on probability distributions of foreground objects. Next, frames with the highest uncertainty and structural dissimilarity are isolated as key-frames. The second method relies on the simCLR model for frame encoding followed by label-spreading from 20% of frame samples to label the remaining frames…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Image Processing Techniques and Applications
MethodsBitcoin Customer Service Number +1-833-534-1729 · *Communicated@Fast*How Do I Communicate to Expedia? · Average Pooling · Global Average Pooling · Convolution · Batch Normalization · Max Pooling · 1x1 Convolution · Random Gaussian Blur · Bottleneck Residual Block
