Generative AI for Multimedia Communication: Recent Advances, An Information-Theoretic Framework, and Future Opportunities

Yili Jin; Xue Liu; Jiangchuan Liu

arXiv:2508.17163·cs.MM·August 26, 2025

Generative AI for Multimedia Communication: Recent Advances, An Information-Theoretic Framework, and Future Opportunities

Yili Jin, Xue Liu, Jiangchuan Liu

PDF

TL;DR

This paper reviews recent advances in generative AI for multimedia communication, introduces a novel semantic information-theoretic framework, and discusses future research directions to enhance semantic fidelity in multimedia systems.

Contribution

It proposes an innovative semantic information-theoretic framework tailored for multimedia, bridging generative AI and information theory for improved semantic communication.

Findings

01

Introduction of semantic entropy and mutual information concepts

02

Redefinition of multimedia communication focusing on semantics

03

Identification of future research opportunities in semantic AI

Abstract

Recent breakthroughs in generative artificial intelligence (AI) are transforming multimedia communication. This paper systematically reviews key recent advancements across generative AI for multimedia communication, emphasizing transformative models like diffusion and transformers. However, conventional information-theoretic frameworks fail to address semantic fidelity, critical to human perception. We propose an innovative semantic information-theoretic framework, introducing semantic entropy, mutual information, channel capacity, and rate-distortion concepts specifically adapted to multimedia applications. This framework redefines multimedia communication from purely syntactic data transmission to semantic information conveyance. We further highlight future opportunities and critical research directions. We chart a path toward robust, efficient, and semantically meaningful multimedia…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.