ROBAD: Robust Adversary-aware Local-Global Attended Bad Actor Detection Sequential Model
Bing He, Mustaque Ahamad, Srijan Kumar

TL;DR
ROBAD is a transformer-based model designed to detect bad actors on online platforms, emphasizing robustness against adversarial attacks by capturing local and global sequence information and using contrastive learning.
Contribution
The paper introduces ROBAD, a novel transformer-based model that enhances robustness in bad actor detection through local-global attention and adversarial sequence training.
Findings
ROBAD outperforms existing models under adversarial attacks.
The model effectively captures local and global sequence features.
Contrastive learning improves detection robustness.
Abstract
Detecting bad actors is critical to ensure the safety and integrity of internet platforms. Several deep learning-based models have been developed to identify such users. These models should not only accurately detect bad actors, but also be robust against adversarial attacks that aim to evade detection. However, past deep learning-based detection models do not meet the robustness requirement because they are sensitive to even minor changes in the input sequence. To address this issue, we focus on (1) improving the model understanding capability and (2) enhancing the model knowledge such that the model can recognize potential input modifications when making predictions. To achieve these goals, we create a novel transformer-based classification model, called ROBAD (RObust adversary-aware local-global attended Bad Actor Detection model), which uses the sequence of user posts to generate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
