Beyond a Single Perspective: Text Anomaly Detection with Multi-View Language Representations
Yixin Liu, Kehan Yan, Shiyuan Li, Qingfeng Chen, Shirui Pan

TL;DR
This paper introduces MCA^2, a multi-view framework for text anomaly detection that leverages multiple pretrained language models and adaptive modules to improve detection across diverse datasets and anomaly types.
Contribution
It proposes a novel multi-view TAD framework that integrates multiple embeddings, uses contrastive collaboration, and adaptively weights views for enhanced performance.
Findings
Outperforms strong baselines on 10 benchmark datasets.
Effectively captures normal textual patterns from multiple views.
Demonstrates adaptability across diverse datasets and anomaly types.
Abstract
Text anomaly detection (TAD) plays a critical role in various language-driven real-world applications, including harmful content moderation, phishing detection, and spam review filtering. While two-step "embedding-detector" TAD methods have shown state-of-the-art performance, their effectiveness is often limited by the use of a single embedding model and the lack of adaptability across diverse datasets and anomaly types. To address these limitations, we propose to exploit the embeddings from multiple pretrained language models and integrate them into , a multi-view TAD framework. adopts a multi-view reconstruction model to effectively extract normal textual patterns from multiple embedding perspectives. To exploit inter-view complementarity, a contrastive collaboration module is designed to leverage and strengthen the interactions across different views. Moreover, an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Topic Modeling · Spam and Phishing Detection
