Stay On-Topic: Generating Context-specific Fake Restaurant Reviews

Mika Juuti; Bo Sun; Tatsuya Mori; and N. Asokan

arXiv:1805.02400·cs.CR·June 29, 2018·1 cites

Stay On-Topic: Generating Context-specific Fake Restaurant Reviews

Mika Juuti, Bo Sun, Tatsuya Mori, and N. Asokan

PDF

Open Access 1 Repo

TL;DR

This paper introduces a neural machine translation approach to generate context-specific fake restaurant reviews that are highly undetectable, surpassing previous methods in evasion success, while also developing effective detection tools.

Contribution

It presents a novel NMT-based method for generating more contextually accurate fake reviews and demonstrates its superior evasion capabilities over existing techniques.

Findings

01

Generated reviews have near-optimal undetectability (F-score 47%)

02

The method evades detection more effectively than state-of-the-art (average evasion 3.2/4 vs 1.5/4)

03

Detection tools achieve 97% accuracy in classifying fake reviews

Abstract

Automatically generated fake restaurant reviews are a threat to online review systems. Recent research has shown that users have difficulties in detecting machine-generated fake reviews hiding among real restaurant reviews. The method used in this work (char-LSTM ) has one drawback: it has difficulties staying in context, i.e. when it generates a review for specific target entity, the resulting review may contain phrases that are unrelated to the target, thus increasing its detectability. In this work, we present and evaluate a more sophisticated technique based on neural machine translation (NMT) with which we can generate reviews that stay on-topic. We test multiple variants of our technique using native English speakers on Amazon Mechanical Turk. We demonstrate that reviews generated by the best variant have almost optimal undetectability (class-averaged F-score 47%). We conduct a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hokkaido/fake-reviews
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · Sentiment Analysis and Opinion Mining · Hate Speech and Cyberbullying Detection