Localizing Occluders with Compositional Convolutional Networks

Adam Kortylewski; Qing Liu; Huiyu Wang; Zhishuai Zhang; Alan Yuille

arXiv:1911.08571·cs.CV·November 21, 2019

Localizing Occluders with Compositional Convolutional Networks

Adam Kortylewski, Qing Liu, Huiyu Wang, Zhishuai Zhang, Alan Yuille

PDF

Open Access

TL;DR

This paper improves compositional convolutional networks' ability to localize occluders in images by modeling feature activations with von-Mises-Fisher distributions, enhancing occlusion handling without requiring occluded training data.

Contribution

The paper introduces a novel modeling approach using von-Mises-Fisher distributions for compositional networks, enabling end-to-end training and better occluder localization.

Findings

01

Enhanced occluder localization accuracy

02

Improved classification of partially occluded objects

03

Effective end-to-end training of the model

Abstract

Compositional convolutional networks are generative compositional models of neural network features, that achieve state of the art results when classifying partially occluded objects, even when they have not been exposed to occluded objects during training. In this work, we study the performance of CompositionalNets at localizing occluders in images. We show that the original model is not able to localize occluders well. We propose to overcome this limitation by modeling the feature activations as a mixture of von-Mises-Fisher distributions, which also allows for an end-to-end training of CompositionalNets. Our experimental results demonstrate that the proposed extensions increase the model's performance at localizing occluders as well as at classifying partially occluded objects.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · 3D Surveying and Cultural Heritage · Human Pose and Action Recognition