Addressing Segmentation Ambiguity in Neural Linguistic Steganography

Jumon Nozaki; Yugo Murawaki

arXiv:2211.06662·cs.CL·November 15, 2022·6 cites

Addressing Segmentation Ambiguity in Neural Linguistic Steganography

Jumon Nozaki, Yugo Murawaki

PDF

Open Access 1 Repo

TL;DR

This paper investigates how segmentation ambiguity affects neural linguistic steganography, causing decoding failures, and proposes simple solutions applicable across languages, including those without explicit word boundaries.

Contribution

It highlights the impact of segmentation ambiguity on decoding success and introduces effective tricks to mitigate this issue in neural linguistic steganography.

Findings

01

Segmentation ambiguity causes decoding failures in neural linguistic steganography.

02

Proposed tricks effectively reduce decoding errors across languages.

03

Solutions are applicable even to languages without explicit word boundaries.

Abstract

Previous studies on neural linguistic steganography, except Ueoka et al. (2021), overlook the fact that the sender must detokenize cover texts to avoid arousing the eavesdropper's suspicion. In this paper, we demonstrate that segmentation ambiguity indeed causes occasional decoding failures at the receiver's side. With the near-ubiquity of subwords, this problem now affects any language. We propose simple tricks to overcome this problem, which are even applicable to languages without explicit word boundaries.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jumon/himitsu
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Advanced Steganography and Watermarking Techniques · Natural Language Processing Techniques