CYBORG: Blending Human Saliency Into the Loss Improves Deep Learning
Aidan Boyd, Patrick Tinsley, Kevin Bowyer, Adam Czajka

TL;DR
CYBORG is a novel training strategy that integrates human perceptual saliency maps into the loss function of deep learning models, significantly improving generalization in synthetic face detection tasks.
Contribution
This paper introduces CYBORG, a new method that incorporates human-annotated saliency into training loss to enhance deep learning model generalization.
Findings
CYBORG improves accuracy on unseen synthetic face images.
Scaling data or using non-human saliency does not outperform CYBORG.
Adding explicit saliency annotations increases human classification accuracy.
Abstract
Can deep learning models achieve greater generalization if their training is guided by reference to human perceptual abilities? And how can we implement this in a practical manner? This paper proposes a training strategy to ConveY Brain Oversight to Raise Generalization (CYBORG). This new approach incorporates human-annotated saliency maps into a loss function that guides the model's learning to focus on image regions that humans deem salient for the task. The Class Activation Mapping (CAM) mechanism is used to probe the model's current saliency in each training batch, juxtapose this model saliency with human saliency, and penalize large differences. Results on the task of synthetic face detection, selected to illustrate the effectiveness of the approach, show that CYBORG leads to significant improvement in accuracy on unseen samples consisting of face images generated from six…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Visual Attention and Saliency Detection
