Human detection of machine manipulated media

Matthew Groh; Ziv Epstein; Nick Obradovich; Manuel Cebrian; Iyad; Rahwan

arXiv:1907.05276·cs.CV·February 28, 2022

Human detection of machine manipulated media

Matthew Groh, Ziv Epstein, Nick Obradovich, Manuel Cebrian, Iyad, Rahwan

PDF

TL;DR

This study investigates how exposure and feedback influence humans' ability to detect AI-generated manipulated media, showing significant learning effects over a short series of examples.

Contribution

It provides empirical evidence that iterative feedback enhances human detection of machine-manipulated media, highlighting a potential method to improve media literacy.

Findings

01

Participants improved detection accuracy by over 10 percentage points after ten images.

02

Exposure to manipulated media with feedback increases detection skills.

03

Human ability to identify fake content can be trained and improved.

Abstract

Recent advances in neural networks for content generation enable artificial intelligence (AI) models to generate high-quality media manipulations. Here we report on a randomized experiment designed to study the effect of exposure to media manipulations on over 15,000 individuals' ability to discern machine-manipulated media. We engineer a neural network to plausibly and automatically remove objects from images, and we deploy this neural network online with a randomized experiment where participants can guess which image out of a pair of images has been manipulated. The system provides participants feedback on the accuracy of each guess. In the experiment, we randomize the order in which images are presented, allowing causal identification of the learning curve surrounding participants' ability to detect fake content. We find sizable and robust evidence that individuals learn to detect…

Tables4

Table 1. Table 1 : Ordinary least squares regression with participant and image fixed effects evaluating image position on users’ accuracy in identifying manipulated images. Robust standard errors clustered at the image level in parentheses. *, **, and *** indicates statistical significance at the 90, 95, and 99 percent confidence intervals, respectively. All columns include participant and image fixed effects. Column (1) includes all images (2) drops all users who submitted fewer than 10 guesses and removes all control images where nothing was removed (3) drops all observations where a user has already seen a particular image (4) keeps only the images qualitatively judged as very high quality.

	(1)	(2)	(3)	(4)
Log(Image Position)	0.0261***	0.0259***	0.0259***	0.0255***
	(0.0012)	(0.0012)	(0.0013)	(0.0029)
$N$	242216	192665	172434	55692
Mean Accuracy on $1^{s t}$ Image	0.73	0.78	0.78	0.74
Mean Accuracy on $10^{t h}$ Image	0.88	0.88	0.88	0.83
$R^{2}$	0.29	0.19	0.20	0.26

Table 2. Table 2 : Ordinary least squares regression with participant and image fixed effects evaluating image position on users’ accuracy in identifying manipulated images. Robust standard errors clustered at the image level in parentheses. *, **, and *** indicates statistical significance at the 90, 95, and 99 percent confidence intervals, respectively. All columns include participant and image fixed effects. Column (1) includes all images (2) drops all users who submitted fewer than 10 guesses and removes all control images where nothing was removed (3) drops all observations where a user has already seen a particular image (4) keeps only the images qualitatively judged as very high quality.

	(1)	(2)	(3)	(4)
2nd	0.0507***	0.0569***	0.0571***	0.0378***
	(0.0042)	(0.0059)	(0.0060)	(0.0131)
3rd	0.0672***	0.0744***	0.0746***	0.0454***
	(0.0048)	(0.0060)	(0.0059)	(0.0123)
4th	0.0775***	0.0888***	0.0885***	0.0686***
	(0.0050)	(0.0058)	(0.0058)	(0.0121)
5th	0.0859***	0.0978***	0.0967***	0.0749***
	(0.0052)	(0.0062)	(0.0064)	(0.0129)
6th	0.0817***	0.0962***	0.0963***	0.0613***
	(0.0057)	(0.0064)	(0.0064)	(0.0130)
7th	0.0900***	0.1032***	0.1039***	0.0741***
	(0.0056)	(0.0064)	(0.0065)	(0.0134)
8th	0.1019***	0.1120***	0.1106***	0.0904***
	(0.0055)	(0.0065)	(0.0065)	(0.0137)
9th	0.1028***	0.1136***	0.1134***	0.0959***
	(0.0055)	(0.0063)	(0.0063)	(0.0142)
10th	0.1030***	0.1135***	0.1123***	0.1014***
	(0.0056)	(0.0062)	(0.0064)	(0.0135)
More than 10	0.1106***	0.1215***	0.1197***	0.0985***
	(0.0051)	(0.0059)	(0.0059)	(0.0122)
$N$	242216	192665	172434	55692
Mean Accuracy on $1^{s t}$ Image	0.73	0.78	0.78	0.74
Mean Accuracy on $10^{t h}$ Image	0.88	0.88	0.88	0.83
$R^{2}$	0.29	0.20	0.20	0.26

Table 3. Table 3 : Ordinary least squares regression with image fixed effects evaluating image position on users’ accuracy in identifying manipulated images. Robust standard errors clustered at the image level in parentheses. *, **, and *** indicates statistical significance at the 90, 95, and 99 percent confidence intervals, respectively. All columns drop users who submitted fewer than 10 guesses, drop all control images where nothing was removed, drop all guesses beyond each participants’ 10th guess, and include image fixed effects.

	(1)	(2)	(3)	(4)	(5)	(6)	(7)	(8)	(9)	(10)
Log(Image Position)	0.0477***	0.0386***	0.0565***	0.0401***	0.0460***	0.0505***	0.0367***	0.0492***	0.0410***	0.0406***
	(0.0029)	(0.0034)	(0.0044)	(0.0061)	(0.0043)	(0.0049)	(0.0050)	(0.0054)	(0.0028)	(0.0034)
High Subjective Quality Interaction	-0.0076
	(0.0066)
High Subjective Quality	-0.0879***
	(0.0098)
Low Accuracy Interaction		-0.0044
		(0.0081)
Low Accuracy		-0.2045***
		(0.0120)
Small Mask Interaction			-0.0190**
			(0.0074)
Small Mask			-0.0551***
			(0.0108)
Low Entropy Interaction				0.0138*
				(0.0082)
Low Entropy				0.0264**
				(0.0120)
1 Object Disappeared Interaction					-0.0016
					(0.0056)
1 Object Disappeared					0.0086
					(0.0082)
First Correct Interaction						-0.0217***
						(0.0035)
First Correct						-0.0017
						(0.0037)
Has Person Interaction							0.0130**
							(0.0060)
Has Person							0.0028
							(0.0088)
Fast Completion Interaction								-0.0108*
								(0.0066)
Fast Completion								0.0430***
								(0.0121)
Mobile Interaction									0.0264***
									(0.0067)
Mobile									-0.0753***
									(0.0120)
Right Placement Interaction										0.0088**
										(0.0043)
Right Placement										-0.0108
										(0.0074)
Constant	0.8377***	0.8915***	0.8249***	0.7778***	0.8021***	0.8478***	0.8051***	0.7836***	0.8184***	0.8122***
	(0.0043)	(0.0050)	(0.0065)	(0.0090)	(0.0064)	(0.0061)	(0.0073)	(0.0091)	(0.0043)	(0.0054)
$N$	51611	25637	25655	25868	51611	38454	51611	24963	51611	51611
$R^{2}$	0.04	0.11	0.03	0.02	0.01	0.01	0.01	0.02	0.02	0.01

Table 4. Table 4 : Top 10 Target Object Removal Selections for Uploaded Images and Targeted Instagram Crawls on Deep Angel. Each selection of an Instagram username initiated a targeted crawl of Instagram for the three most recently uploaded images of the selected user.

Object	Count	Order
Image Uploads
Person	13450	1
Car	1229	6
Dog	1086	2
Cat	1082	3
Elephant	185	4
Bicycle	158	7
Bird	139	22
Tie	120	31
Airplane	106	13
Stop Sign	99	8

Equations6

y_{i, j} = α X_{i, j} + β lo g (T_{i_{n}}) + μ_{i} + ν_{j} + ϵ_{i, j}

y_{i, j} = α X_{i, j} + β lo g (T_{i_{n}}) + μ_{i} + ν_{j} + ϵ_{i, j}

y_{i, j} = α X_{i, j} + β_{1} T_{i_{1}} + β_{2} T_{i_{2}} + β_{3} T_{i_{3}} + ... + β_{9} T_{i_{9}} + β_{10} T_{i_{10}} + μ_{i} + ν_{j} + ϵ_{i, j}

y_{i, j} = α X_{i, j} + β_{1} T_{i_{1}} + β_{2} T_{i_{2}} + β_{3} T_{i_{3}} + ... + β_{9} T_{i_{9}} + β_{10} T_{i_{10}} + μ_{i} + ν_{j} + ϵ_{i, j}

G min D_{1}, D_{2}, D_{3} max k = 1, 2, 3 \sum L_{G A N} (G, D_{k}) + λ_{V GG} L_{V GG} (G (x), y) + λ_{f m} k = 1, 2, 3 \sum L_{f m} (G, D_{k})

G min D_{1}, D_{2}, D_{3} max k = 1, 2, 3 \sum L_{G A N} (G, D_{k}) + λ_{V GG} L_{V GG} (G (x), y) + λ_{f m} k = 1, 2, 3 \sum L_{f m} (G, D_{k})

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Human detection of machine manipulated media

Matthew Groh