Robust Noise Attenuation via Adaptive Pooling of Transformer Outputs

Greyson Brothers

arXiv:2506.09215·cs.LG·June 12, 2025

Robust Noise Attenuation via Adaptive Pooling of Transformer Outputs

Greyson Brothers

PDF

Open Access 1 Video

TL;DR

This paper introduces an attention-based adaptive pooling method for transformer outputs that maintains robust performance across varying signal-to-noise ratios, outperforming standard pooling techniques in noisy environments.

Contribution

The work proposes a novel adaptive pooling approach that approximates signal-optimal vector quantization, improving transformer robustness to noise in diverse applications.

Findings

01

Adaptive pooling outperforms AvgPool, MaxPool, and ClsToken in noisy conditions.

02

Theoretical bounds show the method approximates optimal vector quantization.

03

Experimental results confirm robustness across vision, reinforcement learning, and reasoning tasks.

Abstract

We investigate the design of pooling methods used to summarize the outputs of transformer embedding models, primarily motivated by reinforcement learning and vision applications. This work considers problems where a subset of the input vectors contains requisite information for a downstream task (signal) while the rest are distractors (noise). By framing pooling as vector quantization with the goal of minimizing signal loss, we demonstrate that the standard methods used to aggregate transformer outputs, AvgPool, MaxPool, and ClsToken, are vulnerable to performance collapse as the signal-to-noise ratio (SNR) of inputs fluctuates. We then show that an attention-based adaptive pooling method can approximate the signal-optimal vector quantizer within derived error bounds for any SNR. Our theoretical results are first validated by supervised experiments on a synthetic dataset designed to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Robust Noise Attenuation via Adaptive Pooling of Transformer Outputs· slideslive

Taxonomy

TopicsMultimodal Machine Learning Applications · Reinforcement Learning in Robotics · Advanced Neural Network Applications