Strategies for Searching Video Content with Text Queries or Video Examples
Shoou-I Yu, Yi Yang, Zhongwen Xu, Shicheng Xu, Deyu Meng, Zexi Mao,, Zhigang Ma, Ming Lin, Xuanchong Li, Huan Li, Zhenzhong Lan, Lu Jiang,, Alexander G. Hauptmann, Chuang Gan, Xingzhong Du, Xiaojun Chang

TL;DR
This paper introduces novel strategies for content-based video retrieval that improve accuracy and speed when searching videos using text queries or video examples, addressing metadata scarcity issues.
Contribution
The paper presents new methods in feature design, fusion, and semantic detection that enhance CBVR performance for different query types, validated through TRECVID 2014 evaluations.
Findings
Outperformed other submissions in TRECVID 2014 in both text and video query searches.
Enhanced retrieval accuracy and speed with proposed strategies.
Demonstrated effectiveness of CBVR improvements in real-world evaluation.
Abstract
The large number of user-generated videos uploaded on to the Internet everyday has led to many commercial video search engines, which mainly rely on text metadata for search. However, metadata is often lacking for user-generated videos, thus these videos are unsearchable by current search engines. Therefore, content-based video retrieval (CBVR) tackles this metadata-scarcity problem by directly analyzing the visual and audio streams of each video. CBVR encompasses multiple research topics, including low-level feature design, feature fusion, semantic detector training and video search/reranking. We present novel strategies in these topics to enhance CBVR in both accuracy and speed under different query inputs, including pure textual queries and query by video examples. Our proposed strategies have been incorporated into our submission for the TRECVID 2014 Multimedia Event Detection…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Video Analysis and Summarization
