Neural Photofit: Gaze-based Mental Image Reconstruction

Florian Strohm; Ekta Sood; Sven Mayer; Philipp M\"uller; Mihai B\^ace,; Andreas Bulling

arXiv:2108.07524·cs.CV·August 18, 2021

Neural Photofit: Gaze-based Mental Image Reconstruction

Florian Strohm, Ekta Sood, Sven Mayer, Philipp M\"uller, Mihai B\^ace,, Andreas Bulling

PDF

TL;DR

This paper introduces a neural network-based approach that uses gaze data to reconstruct facial images from mental images, combining encoding, scoring, and decoding networks trained on gaze data.

Contribution

It presents a novel neural framework that decodes mental images into photofits using gaze data, outperforming baseline methods and validated by human studies.

Findings

01

Significantly outperforms baseline predictors

02

Produces visually plausible and accurate photofits

03

Validated by human subject study

Abstract

We propose a novel method that leverages human fixations to visually decode the image a person has in mind into a photofit (facial composite). Our method combines three neural networks: An encoder, a scoring network, and a decoder. The encoder extracts image features and predicts a neural activation map for each face looked at by a human observer. A neural scoring network compares the human and neural attention and predicts a relevance score for each extracted image feature. Finally, image features are aggregated into a single feature vector as a linear combination of all features weighted by relevance which a decoder decodes into the final photofit. We train the neural scoring network on a novel dataset containing gaze data of 19 participants looking at collages of synthetic faces. We show that our method significantly outperforms a mean baseline predictor and report on a human study…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.