Model-based inexact graph matching on top of CNNs for semantic scene understanding
J\'er\'emy Chopin, Jean-Baptiste Fasquel, Harold Mouch\`ere and, Rozenn Dahyot, Isabelle Bloch

TL;DR
This paper introduces a novel post-processing module that leverages structural information through inexact graph matching to enhance CNN-based semantic segmentation, demonstrating significant improvements especially with limited training data.
Contribution
The paper presents a new graph matching-based post-processing approach that enforces structural constraints to improve segmentation accuracy in deep learning models.
Findings
Improves CNN segmentation accuracy by about 6.3% on face data.
Reduces Hausdorff distance by 51% on brain MRI data.
More effective with smaller training datasets.
Abstract
Deep learning based pipelines for semantic segmentation often ignore structural information available on annotated images used for training. We propose a novel post-processing module enforcing structural knowledge about the objects of interest to improve segmentation results provided by deep learning. This module corresponds to a "many-to-one-or-none" inexact graph matching approach, and is formulated as a quadratic assignment problem. Our approach is compared to a CNN-based segmentation (for various CNN backbones) on two public datasets, one for face segmentation from 2D RGB images (FASSEG), and the other for brain segmentation from 3D MRIs (IBSR). Evaluations are performed using two types of structural information (distances and directional relations, , this choice being a hyper-parameter of our generic framework). On FASSEG data, results show that our module improves accuracy of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications
