Exploring Linear Feature Disentanglement For Neural Networks

Tiantian He; Zhibin Li; Yongshun Gong; Yazhou Yao; Xiushan Nie; Yilong; Yin

arXiv:2203.11700·cs.LG·March 23, 2022

Exploring Linear Feature Disentanglement For Neural Networks

Tiantian He, Zhibin Li, Yongshun Gong, Yazhou Yao, Xiushan Nie, Yilong, Yin

PDF

Open Access

TL;DR

This paper investigates whether certain features in neural networks become linearly separable earlier and can be detached, proposing a learnable mask to identify and prune such features with minimal performance impact.

Contribution

It introduces a learnable mask module to distinguish linear from non-linear features, enabling partial feature detachment and a pruning strategy in neural networks.

Findings

01

Some features reach linear separability earlier than others.

02

Partial feature detachment minimally affects model performance.

03

The method is validated on four datasets with promising results.

Abstract

Non-linear activation functions, e.g., Sigmoid, ReLU, and Tanh, have achieved great success in neural networks (NNs). Due to the complex non-linear characteristic of samples, the objective of those activation functions is to project samples from their original feature space to a linear separable feature space. This phenomenon ignites our interest in exploring whether all features need to be transformed by all non-linear functions in current typical NNs, i.e., whether there exists a part of features arriving at the linear separable feature space in the intermediate layers, that does not require further non-linear variation but an affine transformation instead. To validate the above hypothesis, we explore the problem of linear feature disentanglement for neural networks in this paper. Specifically, we devise a learnable mask module to distinguish between linear and non-linear features.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Face and Expression Recognition · Machine Learning and ELM

MethodsPruning