Model-based inexact graph matching on top of CNNs for semantic scene   understanding

J\'er\'emy Chopin; Jean-Baptiste Fasquel; Harold Mouch\`ere and; Rozenn Dahyot; Isabelle Bloch

arXiv:2301.07468·cs.CV·August 2, 2023·1 cites

Model-based inexact graph matching on top of CNNs for semantic scene understanding

J\'er\'emy Chopin, Jean-Baptiste Fasquel, Harold Mouch\`ere and, Rozenn Dahyot, Isabelle Bloch

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel post-processing module that leverages structural information through inexact graph matching to enhance CNN-based semantic segmentation, demonstrating significant improvements especially with limited training data.

Contribution

The paper presents a new graph matching-based post-processing approach that enforces structural constraints to improve segmentation accuracy in deep learning models.

Findings

01

Improves CNN segmentation accuracy by about 6.3% on face data.

02

Reduces Hausdorff distance by 51% on brain MRI data.

03

More effective with smaller training datasets.

Abstract

Deep learning based pipelines for semantic segmentation often ignore structural information available on annotated images used for training. We propose a novel post-processing module enforcing structural knowledge about the objects of interest to improve segmentation results provided by deep learning. This module corresponds to a "many-to-one-or-none" inexact graph matching approach, and is formulated as a quadratic assignment problem. Our approach is compared to a CNN-based segmentation (for various CNN backbones) on two public datasets, one for face segmentation from 2D RGB images (FASSEG), and the other for brain segmentation from 3D MRIs (IBSR). Evaluations are performed using two types of structural information (distances and directional relations, , this choice being a hyper-parameter of our generic framework). On FASSEG data, results show that our module improves accuracy of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jeremy-chopin/apacosi
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications