Information Theoretic Analysis of the Fundamental Limits of Content Identification
Sait Tunc, Yucel Altug, Serdar Kozat, Kivanc Mihcak

TL;DR
This paper models content identification as an information-theoretic communication problem with asymmetric codebooks and security constraints, deriving fundamental limits on system capacity to balance identification accuracy and privacy.
Contribution
It introduces a novel information-theoretic framework for content identification with asymmetric codebooks and security constraints, establishing the fundamental capacity limits.
Findings
Capacity equals the operation capacity under i.i.d. encoder codewords
Closed-form capacity expressions for binary symmetric channels
Provides theoretical bounds for privacy-preserving content identification
Abstract
We investigate the content identification problem from an information theoretic perspective and derive its fundamental limits. Here, a rights-holder company desires to keep track of illegal uses of its commercial content, by utilizing resources of a security company, while securing the privacy of its content. Due to privacy issues, the rights-holder company only reveals certain hash values of the original content to the security company. We view the commercial content of the rights-holder company as the codebook of an encoder and the hash values of the content (made available to the security company) as the codebook of a decoder, i.e., the corresponding codebooks of the encoder and the decoder are not the same. Hence, the content identification is modelled as a communication problem using asymmetric codebooks by an encoder and a decoder. We further address "the privacy issue" in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWireless Communication Security Techniques · Digital Media Forensic Detection · Advanced Steganography and Watermarking Techniques
