Scalable Frame Sampling for Video Classification: A Semi-Optimal Policy Approach with Reduced Search Space
Junho Lee, Jeongwoo Shin, Seung Woo Ko, Seongsu Ha, Joonseok Lee

TL;DR
This paper proposes a semi-optimal frame sampling policy for video classification that reduces the search space from combinatorial to linear, enabling efficient and stable selection of key frames across diverse datasets.
Contribution
It introduces a novel semi-optimal policy that simplifies frame sampling by independently estimating frame importance, significantly reducing computational complexity.
Findings
Reduces search space from O(T^N) to O(T)
Achieves stable high performance across various datasets and models
Efficiently approximates the optimal sampling policy
Abstract
Given a video with frames, frame sampling is a task to select frames, so as to maximize the performance of a fixed video classifier. Not just brute-force search, but most existing methods suffer from its vast search space of , especially when gets large. To address this challenge, we introduce a novel perspective of reducing the search space from to . Instead of exploring the entire space, our proposed semi-optimal policy selects the top frames based on the independently estimated value of each frame using per-frame confidence, significantly reducing the computational complexity. We verify that our semi-optimal policy can efficiently approximate the optimal policy, particularly under practical settings. Additionally, through extensive experiments on various datasets and model architectures, we demonstrate that learning our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Water Systems and Optimization · Anomaly Detection Techniques and Applications
