Inference for Generative Capsule Models

Alfredo Nazabal; Nikolaos Tsagkas; Christopher K.I. Williams

arXiv:2103.06676·cs.LG·March 16, 2022

Inference for Generative Capsule Models

Alfredo Nazabal, Nikolaos Tsagkas, Christopher K.I. Williams

PDF

Open Access 2 Repos

TL;DR

This paper introduces a generative model for capsule networks, along with a variational inference algorithm, demonstrating superior performance over previous methods on geometric and face data.

Contribution

It formulates a generative capsule model and derives a variational inference algorithm, advancing the understanding and capability of capsule networks.

Findings

01

Outperforms stacked capsule autoencoders on constellation data

02

Successfully infers object transformations and part assignments

03

Demonstrates effectiveness on geometric and face datasets

Abstract

Capsule networks (see e.g. Hinton et al., 2018) aim to encode knowledge and reason about the relationship between an object and its parts. In this paper we specify a \emph{generative} model for such data, and derive a variational algorithm for inferring the transformation of each object and the assignments of observed parts to the objects. We apply this model to (i) data generated from multiple geometric objects like squares and triangles ("constellations"), and (ii) data from a parts-based model of faces. Recent work by Kosiorek et al. [2019] has used amortized inference via stacked capsule autoencoders (SCAEs) to tackle this problem -- our results show that we significantly outperform them where we can make comparisons (on the constellations data).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Image Processing and 3D Reconstruction · Handwritten Text Recognition Techniques