ColoDiff: Integrating Dynamic Consistency With Content Awareness for Colonoscopy Video Generation
Junhu Fu, Shuyu Liang, Wutong Li, Chen Ma, Peng Huang, Kehao Wang, Ke Chen, Shengli Lin, Pinghong Zhou, Zeju Li, Yuanyuan Wang, Yi Guo

TL;DR
ColoDiff is a diffusion-based framework that generates dynamic, content-aware colonoscopy videos with high temporal consistency and precise clinical attribute control, addressing data scarcity and aiding clinical diagnosis.
Contribution
It introduces novel inter-frame and intra-frame modules for dynamic modeling and content control, along with a non-Markovian sampling strategy for real-time video generation.
Findings
Generates smooth, dynamic colonoscopy videos with high quality.
Improves downstream clinical tasks like disease diagnosis and lesion segmentation.
Reduces sampling steps by over 90% for real-time synthesis.
Abstract
Colonoscopy video generation delivers dynamic, information-rich data critical for diagnosing intestinal diseases, particularly in data-scarce scenarios. High-quality video generation demands temporal consistency and precise control over clinical attributes, but faces challenges from irregular intestinal structures, diverse disease representations, and various imaging modalities. To this end, we propose ColoDiff, a diffusion-based framework that generates dynamic-consistent and content-aware colonoscopy videos, aiming to alleviate data shortage and assist clinical analysis. At the inter-frame level, our TimeStream module decouples temporal dependency from video sequences through a cross-frame tokenization mechanism, enabling intricate dynamic modeling despite irregular intestinal structures. At the intra-frame level, our Content-Aware module incorporates noise-injected embeddings and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsColorectal Cancer Screening and Detection · Generative Adversarial Networks and Image Synthesis · AI in cancer detection
