# VideoMem: Constructing, Analyzing, Predicting Short-term and Long-term   Video Memorability

**Authors:** Romain Cohendet, Claire-H\'el\`ene Demarty, Ngoc Q. K. Duong, Martin, Engilberge

arXiv: 1812.01973 · 2024-02-28

## TL;DR

This paper introduces VideoMem, a large dataset with short-term and long-term video memorability annotations, and develops deep learning models to predict video memorability, providing new insights into what makes content memorable.

## Contribution

The paper presents the VideoMem dataset with dual time-scale memorability annotations and proposes neural network models, including attention mechanisms, for predicting video memorability.

## Key findings

- Best model achieves 0.494 Spearman's rank correlation for short-term memorability
- Dataset includes 10,000 videos with dual annotations for short-term and long-term memorability
- Attention-based model offers interpretability of content features affecting memorability

## Abstract

Humans share a strong tendency to memorize/forget some of the visual information they encounter. This paper focuses on providing computational models for the prediction of the intrinsic memorability of visual content. To address this new challenge, we introduce a large scale dataset (VideoMem) composed of 10,000 videos annotated with memorability scores. In contrast to previous work on image memorability -- where memorability was measured a few minutes after memorization -- memory performance is measured twice: a few minutes after memorization and again 24-72 hours later. Hence, the dataset comes with short-term and long-term memorability annotations. After an in-depth analysis of the dataset, we investigate several deep neural network based models for the prediction of video memorability. Our best model using a ranking loss achieves a Spearman's rank correlation of 0.494 for short-term memorability prediction, while our proposed model with attention mechanism provides insights of what makes a content memorable. The VideoMem dataset with pre-extracted features is publicly available.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1812.01973/full.md

## Figures

18 figures with captions in the complete paper: https://tomesphere.com/paper/1812.01973/full.md

## References

33 references — full list in the complete paper: https://tomesphere.com/paper/1812.01973/full.md

---
Source: https://tomesphere.com/paper/1812.01973