Set Aggregation Network as a Trainable Pooling Layer

{\L}ukasz Maziarka; Marek \'Smieja; Aleksandra Nowak; Jacek Tabor,; {\L}ukasz Struski; Przemys{\l}aw Spurek

arXiv:1810.01868·cs.LG·January 23, 2020

Set Aggregation Network as a Trainable Pooling Layer

{\L}ukasz Maziarka, Marek \'Smieja, Aleksandra Nowak, Jacek Tabor,, {\L}ukasz Struski, Przemys{\l}aw Spurek

PDF

1 Repo

TL;DR

This paper introduces the Set Aggregation Network (SAN), a trainable pooling layer that can embed sets of features into fixed-size vectors, improving classification accuracy and reducing overfitting in neural networks.

Contribution

SAN provides a flexible, trainable pooling mechanism that preserves input information and enhances model performance compared to traditional pooling methods.

Findings

01

SAN improves classification accuracy.

02

SAN reduces overfitting and acts as a regularizer.

03

SAN can embed sets into vectors of arbitrary size.

Abstract

Global pooling, such as max- or sum-pooling, is one of the key ingredients in deep neural networks used for processing images, texts, graphs and other types of structured data. Based on the recent DeepSets architecture proposed by Zaheer et al. (NIPS 2017), we introduce a Set Aggregation Network (SAN) as an alternative global pooling layer. In contrast to typical pooling operators, SAN allows to embed a given set of features to a vector representation of arbitrary size. We show that by adjusting the size of embedding, SAN is capable of preserving the whole information from the input. In experiments, we demonstrate that replacing global pooling layer by SAN leads to the improvement of classification accuracy. Moreover, it is less prone to overfitting and can be used as a regularizer.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gmum/set-aggregation
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.