Different Victims, Same Layout: Email Visual Similarity Detection for   Enhanced Email Protection

Sachin Shukla; Omid Mirzaei

arXiv:2408.16945·cs.CR·September 5, 2024

Different Victims, Same Layout: Email Visual Similarity Detection for Enhanced Email Protection

Sachin Shukla, Omid Mirzaei

PDF

TL;DR

This paper introduces Pisco, a visual similarity detection method for emails that enhances spam detection by identifying layout reuse, effectively catching evasive emails that traditional keyword-based systems miss.

Contribution

The paper presents a novel visual similarity approach, Pisco, that improves email spam detection by identifying reused email layouts, addressing limitations of existing keyword and rule-based systems.

Findings

01

Email kits are extensively reused across different spam campaigns.

02

Visually similar emails are sent over various time intervals, indicating layout reuse.

03

Pisco effectively detects emails bypassing traditional keyword-based detection.

Abstract

In the pursuit of an effective spam detection system, the focus has often been on identifying known spam patterns either through rule-based detection systems or machine learning (ML) solutions that rely on keywords. However, both systems are susceptible to evasion techniques and zero-day attacks that can be achieved at low cost. Therefore, an email that bypassed the defense system once can do it again in the following days, even though rules are updated or the ML models are retrained. The recurrence of failures to detect emails that exhibit layout similarities to previously undetected spam is concerning for customers and can erode their trust in a company. Our observations show that threat actors reuse email kits extensively and can bypass detection with little effort, for example, by making changes to the content of emails. In this work, we propose an email visual similarity detection…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFocus