Safe Guard: an LLM-agent for Real-time Voice-based Hate Speech Detection   in Social Virtual Reality

Yiwen Xu; Qinyang Hou; Hongyu Wan; Mirjana Prpa

arXiv:2409.15623·eess.AS·September 25, 2024

Safe Guard: an LLM-agent for Real-time Voice-based Hate Speech Detection in Social Virtual Reality

Yiwen Xu, Qinyang Hou, Hongyu Wan, Mirjana Prpa

PDF

Open Access

TL;DR

Safe Guard is a real-time voice-based hate speech detection system in social VR using LLMs and audio features, demonstrating improved accuracy and reduced false positives to promote safer virtual environments.

Contribution

The paper introduces a novel LLM-agent system for real-time hate speech detection in social VR, combining GPT and audio features, with evaluation showing enhanced performance over existing methods.

Findings

01

Effective real-time hate speech detection in social VR

02

Reduced false positive rate compared to existing approaches

03

Potential for safer virtual social environments

Abstract

In this paper, we present Safe Guard, an LLM-agent for the detection of hate speech in voice-based interactions in social VR (VRChat). Our system leverages Open AI GPT and audio feature extraction for real-time voice interactions. We contribute a system design and evaluation of the system that demonstrates the capability of our approach in detecting hate speech, and reducing false positives compared to currently available approaches. Our results indicate the potential of LLM-based agents in creating safer virtual environments and set the groundwork for further advancements in LLM-driven moderation approaches.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Face recognition and analysis

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Sparse Evolutionary Training · Linear Layer · Cosine Annealing · Multi-Head Attention · Weight Decay · Linear Warmup With Cosine Annealing · Adam · Residual Connection