Generative AI for Multimedia Communication: Recent Advances, An Information-Theoretic Framework, and Future Opportunities
Yili Jin, Xue Liu, Jiangchuan Liu

TL;DR
This paper reviews recent advances in generative AI for multimedia communication, introduces a novel semantic information-theoretic framework, and discusses future research directions to enhance semantic fidelity in multimedia systems.
Contribution
It proposes an innovative semantic information-theoretic framework tailored for multimedia, bridging generative AI and information theory for improved semantic communication.
Findings
Introduction of semantic entropy and mutual information concepts
Redefinition of multimedia communication focusing on semantics
Identification of future research opportunities in semantic AI
Abstract
Recent breakthroughs in generative artificial intelligence (AI) are transforming multimedia communication. This paper systematically reviews key recent advancements across generative AI for multimedia communication, emphasizing transformative models like diffusion and transformers. However, conventional information-theoretic frameworks fail to address semantic fidelity, critical to human perception. We propose an innovative semantic information-theoretic framework, introducing semantic entropy, mutual information, channel capacity, and rate-distortion concepts specifically adapted to multimedia applications. This framework redefines multimedia communication from purely syntactic data transmission to semantic information conveyance. We further highlight future opportunities and critical research directions. We chart a path toward robust, efficient, and semantically meaningful multimedia…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
