LOGICSEG: Parsing Visual Semantics with Neural Logic Learning and Reasoning
Liulei Li, Wenguan Wang, Yi Yang

TL;DR
LOGICSEG introduces a neural-logic framework for semantic segmentation that incorporates structured hierarchical knowledge and logical reasoning, improving prediction coherence and interpretability.
Contribution
It presents a novel neural-logic approach that integrates symbolic reasoning with deep learning for semantic segmentation, filling a gap in structured visual understanding.
Findings
Effective across multiple datasets and models
Enhances hierarchy-coherent predictions
Demonstrates generality and robustness
Abstract
Current high-performance semantic segmentation models are purely data-driven sub-symbolic approaches and blind to the structured nature of the visual world. This is in stark contrast to human cognition which abstracts visual perceptions at multiple levels and conducts symbolic reasoning with such structured abstraction. To fill these fundamental gaps, we devise LOGICSEG, a holistic visual semantic parser that integrates neural inductive learning and logic reasoning with both rich data and symbolic knowledge. In particular, the semantic concepts of interest are structured as a hierarchy, from which a set of constraints are derived for describing the symbolic relations and formalized as first-order logic rules. After fuzzy logic-based continuous relaxation, logical formulae are grounded onto data and neural computational graphs, hence enabling logic-induced network training. During…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications
