Steganography Beyond Space-Time with Chain of Multimodal AI
Ching-Chun Chang, Isao Echizen

TL;DR
This paper introduces a novel steganography method that embeds messages within audiovisual media by leveraging a chain of multimodal AI, ensuring covert communication across spatial and temporal domains while maintaining fidelity and robustness.
Contribution
It proposes a new paradigm for audiovisual steganography using multimodal AI to embed messages beyond space and time, with a detailed encoding and decoding process.
Findings
High message transmission accuracy in zero-bit and multi-bit settings
Maintains biometric and semantic fidelity of audiovisual content
Demonstrates robustness against resampling, face-swapping, and voice-cloning
Abstract
Steganography is the art and science of covert writing, with a broad range of applications interwoven within the realm of cybersecurity. As artificial intelligence continues to evolve, its ability to synthesise realistic content emerges as a threat in the hands of cybercriminals who seek to manipulate and misrepresent the truth. Such synthetic content introduces a non-trivial risk of overwriting the subtle changes made for the purpose of steganography. When the signals in both the spatial and temporal domains are vulnerable to unforeseen overwriting, it calls for reflection on what, if any, remains invariant. This study proposes a paradigm in steganography for audiovisual media, where messages are concealed beyond both spatial and temporal domains. A chain of multimodal artificial intelligence is developed to deconstruct audiovisual content into a cover text, embed a message within the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
