# Defining Image Memorability using the Visual Memory Schema

**Authors:** Erdem Akagunduz, Adrian G. Bors, Karla K. Evans

arXiv: 1903.02056 · 2019-03-07

## TL;DR

This paper introduces the Visual Memory Schema (VMS), a new concept capturing shared image components that influence memorability, and employs deep learning to predict memorable regions based on human annotations and eye fixation data.

## Contribution

It proposes the VMS concept, operationalizes it through human annotations, and adapts deep learning models for predicting memorable image regions, advancing understanding of image memorability.

## Key findings

- VMS shows high consistency across observers for recognized images.
- Deep learning models can predict memorable regions with transfer learning.
- Associations between VMS, eye fixations, and saliency are analyzed.

## Abstract

Memorability of an image is a characteristic determined by the human observers' ability to remember images they have seen. Yet recent work on image memorability defines it as an intrinsic property that can be obtained independent of the observer. {The current study aims to enhance our understanding and prediction of image memorability, improving upon existing approaches by incorporating the properties of cumulative human annotations.} We propose a new concept called the Visual Memory Schema (VMS) referring to an organisation of image components human observers share when encoding and recognising images. The concept of VMS is operationalised by asking human observers to define memorable regions of images they were asked to remember during an episodic memory test. We then statistically assess the consistency of VMSs across observers for either correctly or incorrectly recognised images. The associations of the VMSs with eye fixations and saliency are analysed separately as well. Lastly, we adapt various deep learning architectures for the reconstruction and prediction of memorable regions in images and analyse the results when using transfer learning at the outputs of different convolutional network layers.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.02056/full.md

## Figures

46 figures with captions in the complete paper: https://tomesphere.com/paper/1903.02056/full.md

## References

32 references — full list in the complete paper: https://tomesphere.com/paper/1903.02056/full.md

---
Source: https://tomesphere.com/paper/1903.02056