TL;DR
SCRWKV is an ultra-compact, structure-calibrated vision model designed for precise topological crack segmentation, balancing high accuracy with low resource consumption.
Contribution
It introduces a novel Structure-Field Encoder backbone with modules like AMCM and SCIU, achieving high-precision modeling with linear complexity.
Findings
Outperforms state-of-the-art methods on multiple benchmarks.
Achieves an F1 score of 0.8428 and mIoU of 0.8512 on TUT dataset.
Uses only 1.22 million parameters, demonstrating efficiency.
Abstract
Achieving pixel-level accurate segmentation of structural cracks across diverse scenarios remains a formidable challenge. Existing methods face significant bottlenecks in balancing crack topology modeling with computational efficiency, often failing to reconcile high segmentation quality with low resource demands. To address these limitations, we propose the Ultra-Compact Structure-Calibrated Vision RWKV (SCRWKV), a network that achieves high-precision modeling via a novel Structure-Field Encoder (SFE) backbone while maintaining linear complexity. The SFE integrates the Adaptive Multi-scale Cascaded Modulator (AMCM) to enhance texture representation and utilizes the Structure-Calibrated Insight Unit (SCIU) as its core engine. Specifically, the SCIU employs the Geometry-guided Bidirectional Structure Transformation (GBST) to capture topological correlations and integrates the Dynamic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
