Content Fuzzing for Escaping Information Cocoons on Digital Social Media

Yifeng He; Ziye Tang; Hao Chen

arXiv:2604.05461·cs.CL·April 8, 2026

Content Fuzzing for Escaping Information Cocoons on Digital Social Media

Yifeng He, Ziye Tang, Hao Chen

PDF

TL;DR

ContentFuzz is a framework that rewrites social media posts to promote exposure to diverse viewpoints by altering stance labels while preserving original meaning, thus helping escape information cocoons.

Contribution

It introduces a confidence-guided LLM-based fuzzing method to generate meaning-preserving post rewrites that alter stance detection outcomes.

Findings

01

Effectively changes stance labels across multiple models and datasets.

02

Maintains semantic integrity of original posts.

03

Works in multiple languages.

Abstract

Information cocoons on social media limit users' exposure to posts with diverse viewpoints. Modern platforms use stance detection as an important signal in recommendation and ranking pipelines, which can route posts primarily to like-minded audiences and reduce cross-cutting exposure. This restricts the reach of dissenting opinions and hinders constructive discourse. We take the creator's perspective and investigate how content can be revised to reach beyond existing affinity clusters. We present ContentFuzz, a confidence-guided fuzzing framework that rewrites posts while preserving their human-interpreted intent and induces different machine-inferred stance labels. ContentFuzz aims to route posts beyond their original cocoons. Our method guides a large language model (LLM) to generate meaning-preserving rewrites using confidence feedback from stance detection models. Evaluated on four…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.