Kernelized Classification in Deep Networks

Sadeep Jayasumana; Srikumar Ramalingam; Sanjiv Kumar

arXiv:2012.09607·cs.LG·March 22, 2021

Kernelized Classification in Deep Networks

Sadeep Jayasumana, Srikumar Ramalingam, Sanjiv Kumar

PDF

Open Access

TL;DR

This paper introduces a novel kernelized classification layer for deep networks that automatically learns the optimal kernel function, enhancing the nonlinear classification capability beyond traditional linear classifiers.

Contribution

It proposes a theoretically grounded method to optimize over all positive definite kernels within deep networks, enabling automatic kernel selection during training.

Findings

01

Improved classification performance on multiple datasets

02

Automatic kernel learning enhances nonlinear decision boundaries

03

Theoretical proof of kernel optimization feasibility

Abstract

We propose a kernelized classification layer for deep networks. Although conventional deep networks introduce an abundance of nonlinearity for representation (feature) learning, they almost universally use a linear classifier on the learned feature vectors. We advocate a nonlinear classification layer by using the kernel trick on the softmax cross-entropy loss function during training and the scorer function during testing. However, the choice of the kernel remains a challenge. To tackle this, we theoretically show the possibility of optimizing over all possible positive definite kernels applicable to our problem setting. This theory is then used to device a new kernelized classification layer that learns the optimal kernel function for a given problem automatically within the deep network itself. We show the usefulness of the proposed nonlinear classification layer on several datasets…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Face and Expression Recognition · Machine Learning and Data Classification

MethodsSoftmax