Disco: Densely-overlapping Cell Instance Segmentation via Adjacency-aware Collaborative Coloring
Rui Sun, Yiwen Yang, Kaiyu Guo, Chen Jiang, Dongli Xu, Zhaonan Liu, Tan Pan, Limei Han, Xue Jiang, Wu Wei, Yuan Cheng

TL;DR
Disco introduces a novel adjacency-aware framework for dense cell instance segmentation, addressing complex overlaps by leveraging topological graph properties and a collaborative coloring approach, validated on a new large-scale dataset.
Contribution
The paper presents the first systematic analysis of cell adjacency graph chromaticity and proposes a new method combining topological labeling with deep learning for complex tissue segmentation.
Findings
Most real-world cell graphs are non-bipartite with many odd cycles.
Disco effectively resolves complex adjacency conflicts in dense cellular regions.
The approach outperforms existing methods on complex tissue datasets.
Abstract
Accurate cell instance segmentation is foundational for digital pathology analysis. Existing methods based on contour detection and distance mapping still face significant challenges in processing complex and dense cellular regions. Graph coloring-based methods provide a new paradigm for this task, yet the effectiveness of this paradigm in real-world scenarios with dense overlaps and complex topologies has not been verified. Addressing this issue, we release a large-scale dataset GBC-FS 2025, which contains highly complex and dense sub-cellular nuclear arrangements. We conduct the first systematic analysis of the chromatic properties of cell adjacency graphs across four diverse datasets and reveal an important discovery: most real-world cell graphs are non-bipartite, with a high prevalence of odd-length cycles (predominantly triangles). This makes simple 2-coloring theory insufficient…
Peer Reviews
Decision·ICLR 2026 Poster
- This is the first study revealing that non-bipartite is actually more prevalent in cell datasets, and hence the need for more sophisticated methods aside from 2-coloring or 4-coloring approaches. The authors provide statistics on percentage of bipartite nodes, conflict nodes, and that 3-cycles are more prevalent among the odd-length cycles. - The authors perform adequate experiments, in terms of 4 datasets, several recent and relevant baselines and ablation studies. Their method is usually eit
- The actual method doesn't seem very novel, as BFS exists in literature, and forcing adjacent nodes to have different feature representation is common - The authors do not provide standard deviation in the results. The mean numbers seem very close to baselines. t-test needs to be done to determine if the results are statistically significant or not. - Authors need to provide inference run-times. Because constructing the graph and performing BFS can be time-consuming. - Hyperparameter tuning: Au
Compelling topology evidence. Cross‑dataset analysis shows frequent odd cycles with >90% of odd cycles being triangles in non‑bipartite graphs; conflict/secondary‑conflict ratios quantify difficulty (Table 1, p. 2; Fig. 3b, p. 5). Clean “2 + 1” design. Explicit Marking extracts a large bipartite backbone and pools the rest into a conflict set; Implicit Disambiguation resolves label ambiguity via 𝐿_adj over CAG edges (Sec. 4.2–4.3; Eqs. 1–3, pp. 6–7). Graph‑aware loss that works. Ablations show
1. Is there a formal definition of "dense" and "complex" cells for segmentation analysis? If no, can the authors quantify this aspect of the data to define "highly dense" cell segmentation? 2. "First Topological Analysis" claim is overstated: The repeated assertion of conducting the "first systematic topological analysis" of cell adjacency graphs appears throughout the paper: This claim is incorrect given extensive prior work: Topological Tumor Graphs (Failmezger et al., Cancer Research 2020),
1. The paper provides the first systematic quantitative topological analysis of CAGs. Its key finding (that real-world cell graphs are non-bipartite, 3-cycle dominant, and empirically have $\chi(G)=3$) provides a solid empirical foundation for the graph-coloring paradigm in this field. This analysis moves beyond simple theoretical assumptions (e.g., bipartiteness or the 4-color theorem) and points toward designing more efficient, targeted models. 2. The release of the GBC-FS 2025 dataset is a c
1. The (2+1) "Explicit Marking" strategy of Disco is logically flawed. The authors' own analysis (Appendix A.4.2) clearly states that an empirical chromatic number of $\chi(G)=3$ is almost always sufficient. However, instead of generating an unambiguous 3-color label (as shown in Appendix A.1(d)), the authors' strategy lumps all conflicting nodes—regardless of adjacency—into the same conflict class ($c=t=3$). This artificially creates "secondary conflicts" at the label level (i.e., two adjacent
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Single-cell and spatial transcriptomics · Cell Image Analysis Techniques
