Semantic Steganography: A Framework for Robust and High-Capacity   Information Hiding using Large Language Models

Minhao Bai; Jinshuai Yang; Kaiyi Pang; Yongfeng Huang; Yue Gao

arXiv:2412.11043·cs.CR·December 17, 2024

Semantic Steganography: A Framework for Robust and High-Capacity Information Hiding using Large Language Models

Minhao Bai, Jinshuai Yang, Kaiyi Pang, Yongfeng Huang, Yue Gao

PDF

Open Access

TL;DR

This paper introduces a semantic steganography framework leveraging large language models to embed secret messages into generated texts, achieving high capacity, robustness, and indistinguishability from cover texts.

Contribution

It presents a novel semantic space construction using ontology-entity trees for robust, high-capacity information hiding with LLMs, outperforming existing methods.

Findings

01

Higher embedding capacity than state-of-the-art methods

02

Stegos are indistinguishable from cover texts

03

Enhanced robustness against text rendering and word blocking

Abstract

In the era of Large Language Models (LLMs), generative linguistic steganography has become a prevalent technique for hiding information within model-generated texts. However, traditional steganography methods struggle to effectively align steganographic texts with original model-generated texts due to the lower entropy of the predicted probability distribution of LLMs. This results in a decrease in embedding capacity and poses challenges for decoding stegos in real-world communication channels. To address these challenges, we propose a semantic steganography framework based on LLMs, which construct a semantic space and map secret messages onto this space using ontology-entity trees. This framework offers robustness and reliability for transmission in complex channels, as well as resistance to text rendering and word blocking. Additionally, the stegos generated by our framework are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Steganography and Watermarking Techniques · Internet Traffic Analysis and Secure E-voting · Chaos-based Image/Signal Encryption