PHISH in MESH: Korean Adversarial Phonetic Substitution and Phonetic-Semantic Feature Integration Defense

Byungjun Kim; Minju Kim; Hyeonchu Park; Bugeun Kim

arXiv:2505.21380·cs.CL·May 28, 2025

PHISH in MESH: Korean Adversarial Phonetic Substitution and Phonetic-Semantic Feature Integration Defense

Byungjun Kim, Minju Kim, Hyeonchu Park, Bugeun Kim

PDF

Open Access

TL;DR

This paper introduces novel phonetic-aware methods to improve hate speech detection in Korean, addressing adversarial phonetic substitutions by leveraging language-specific features and architectural integration.

Contribution

It presents PHISH and MESH, the first phonetic-informed and architecture-based defenses tailored for Korean, enhancing robustness against phonetic adversarial attacks.

Findings

01

Improved detection accuracy on perturbed datasets

02

Effective integration of phonetic features enhances robustness

03

Reflects realistic adversarial attack strategies

Abstract

As malicious users increasingly employ phonetic substitution to evade hate speech detection, researchers have investigated such strategies. However, two key challenges remain. First, existing studies have overlooked the Korean language, despite its vulnerability to phonetic perturbations due to its phonographic nature. Second, prior work has primarily focused on constructing datasets rather than developing architectural defenses. To address these challenges, we propose (1) PHonetic-Informed Substitution for Hangul (PHISH) that exploits the phonological characteristics of the Korean writing system, and (2) Mixed Encoding of Semantic-pHonetic features (MESH) that enhances the detector's robustness by incorporating phonetic information at the architectural level. Our experimental results demonstrate the effectiveness of our proposed methods on both perturbed and unperturbed datasets,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Speech Recognition and Synthesis · Topic Modeling