Key Frame Extraction with Attention Based Deep Neural Networks

Samed Arslan; Senem Tanberk

arXiv:2306.13176·cs.CV·June 26, 2023·1 cites

Key Frame Extraction with Attention Based Deep Neural Networks

Samed Arslan, Senem Tanberk

PDF

Open Access

TL;DR

This paper presents a deep learning approach using an attention-based autoencoder for automatic keyframe extraction from videos, improving summarization efficiency and accuracy.

Contribution

It introduces a novel deep autoencoder with attention for keyframe detection, combining feature extraction, clustering, and selection for better summarization.

Findings

01

Achieved 77% accuracy on TVSUM dataset.

02

Outperformed many existing keyframe extraction methods.

03

Demonstrated effectiveness for video summarization and retrieval.

Abstract

Automatic keyframe detection from videos is an exercise in selecting scenes that can best summarize the content for long videos. Providing a summary of the video is an important task to facilitate quick browsing and content summarization. The resulting photos are used for automated works (e.g. summarizing security footage, detecting different scenes used in music clips) in different industries. In addition, processing high-volume videos in advanced machine learning methods also creates resource costs. Keyframes obtained; It can be used as an input feature to the methods and models to be used. In this study; We propose a deep learning-based approach for keyframe detection using a deep auto-encoder model with an attention layer. The proposed method first extracts the features from the video frames using the encoder part of the autoencoder and applies segmentation using the k-means…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Analysis and Summarization · Music and Audio Processing · Shakespeare, Adaptation, and Literary Criticism

Methodsk-Means Clustering