A Split-Window Transformer for Multi-Model Sequence Spammer Detection   using Multi-Model Variational Autoencoder

Zhou Yang; Yucai Pang; Hongbo Yin; Yunpeng Xiao

arXiv:2502.16483·cs.LG·February 25, 2025

A Split-Window Transformer for Multi-Model Sequence Spammer Detection using Multi-Model Variational Autoencoder

Zhou Yang, Yucai Pang, Hongbo Yin, Yunpeng Xiao

PDF

Open Access

TL;DR

This paper proposes MS$^2$Dformer, a novel Transformer architecture with a split-window multi-head attention mechanism and a multi-modal variational autoencoder, effectively detecting sequence spammers in multi-modal data.

Contribution

It introduces a hierarchical split-window attention mechanism and a user behavior tokenization algorithm for multi-modal sequence spammer detection.

Findings

01

MS$^2$Dformer outperforms previous state-of-the-art methods.

02

The hierarchical split-window attention effectively handles ultra-long sequences.

03

The model demonstrates strong generalization as a backbone for spam detection.

Abstract

This paper introduces a new Transformer, called MS $^{2}$ Dformer, that can be used as a generalized backbone for multi-modal sequence spammer detection. Spammer detection is a complex multi-modal task, thus the challenges of applying Transformer are two-fold. Firstly, complex multi-modal noisy information about users can interfere with feature mining. Secondly, the long sequence of users' historical behaviors also puts a huge GPU memory pressure on the attention computation. To solve these problems, we first design a user behavior Tokenization algorithm based on the multi-modal variational autoencoder (MVAE). Subsequently, a hierarchical split-window multi-head attention (SW/W-MHA) mechanism is proposed. The split-window strategy transforms the ultra-long sequences hierarchically into a combination of intra-window short-term and inter-window overall attention. Pre-trained on the public…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNetwork Security and Intrusion Detection · Advanced Malware Detection Techniques · Spam and Phishing Detection

MethodsAttention Is All You Need · Absolute Position Encodings · Linear Layer · Layer Normalization · Byte Pair Encoding · Dense Connections · Residual Connection · Label Smoothing · Multi-Head Attention · Position-Wise Feed-Forward Layer