Heterogeneous Subgraph Transformer for Fake News Detection
Yuchen Zhang, Xiaoxiao Ma, Jia Wu, Jian Yang, Hao Fan

TL;DR
This paper introduces HeteroSGT, a novel transformer-based method that leverages heterogeneous graph structures and textual semantics to improve fake news detection accuracy on social media.
Contribution
It proposes a heterogeneous subgraph transformer that effectively captures complex relations in news data, enhancing fake news detection beyond existing methods.
Findings
HeteroSGT outperforms five baseline models on five datasets.
The use of subgraph extraction improves detection accuracy.
Ablation studies confirm the effectiveness of the proposed components.
Abstract
Fake news is pervasive on social media, inflicting substantial harm on public discourse and societal well-being. We investigate the explicit structural information and textual features of news pieces by constructing a heterogeneous graph concerning the relations among news topics, entities, and content. Through our study, we reveal that fake news can be effectively detected in terms of the atypical heterogeneous subgraphs centered on them, which encapsulate the essential semantics and intricate relations between news elements. However, suffering from the heterogeneity, exploring such heterogeneous subgraphs remains an open problem. To bridge the gap, this work proposes a heterogeneous subgraph transformer (HeteroSGT) to exploit subgraphs in our constructed heterogeneous graph. In HeteroSGT, we first employ a pre-trained language model to derive both word-level and sentence-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Spam and Phishing Detection · Advanced Malware Detection Techniques
MethodsAttention Is All You Need · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Absolute Position Encodings · Dropout · Dense Connections · Label Smoothing · Residual Connection · Softmax · Adam
