Effective and Interpretable Information Aggregation with Capacity Networks
Markus Zopf

TL;DR
Capacity networks offer a novel approach to multiple instance learning by generating interpretable intermediate results, improving aggregation performance, and enabling semantic inspection and regularization of the model internals.
Contribution
This work introduces Capacity networks, a new neural architecture inspired by Choquet capacities, that produces interpretable intermediate outputs for better aggregation and interpretability.
Findings
Outperform encoder-decoder architectures in various experiments.
Provide interpretable intermediate results for semantic inspection.
Enable regularization through interpretability.
Abstract
How to aggregate information from multiple instances is a key question multiple instance learning. Prior neural models implement different variants of the well-known encoder-decoder strategy according to which all input features are encoded a single, high-dimensional embedding which is then decoded to generate an output. In this work, inspired by Choquet capacities, we propose Capacity networks. Unlike encoder-decoders, Capacity networks generate multiple interpretable intermediate results which can be aggregated in a semantically meaningful space to obtain the final output. Our experiments show that implementing this simple inductive bias leads to improvements over different encoder-decoder architectures in a wide range of experiments. Moreover, the interpretable intermediate results make Capacity networks interpretable by design, which allows a semantically meaningful inspection,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Text and Document Classification Technologies · Machine Learning and Data Classification
