Generalized Bilinear Deep Convolutional Neural Networks for Multimodal   Biometric Identification

Sobhan Soleymani; Amirsina Torfi; Jeremy Dawson; Nasser M. Nasrabadi

arXiv:1807.01298·cs.LG·July 4, 2018

Generalized Bilinear Deep Convolutional Neural Networks for Multimodal Biometric Identification

Sobhan Soleymani, Amirsina Torfi, Jeremy Dawson, Nasser M. Nasrabadi

PDF

TL;DR

This paper introduces a multimodal biometric identification system using modality-specific CNNs fused at the feature level, employing generalized compact bilinear fusion for improved accuracy and efficiency across multiple challenging datasets.

Contribution

It proposes a novel generalized compact bilinear fusion method for multimodal CNNs, enhancing biometric identification performance with reduced parameters.

Findings

01

Significant performance improvement over unimodal systems.

02

Effective fusion at the fully-connected layer without performance loss.

03

Robust results across three challenging biometric datasets.

Abstract

In this paper, we propose to employ a bank of modality-dedicated Convolutional Neural Networks (CNNs), fuse, train, and optimize them together for person classification tasks. A modality-dedicated CNN is used for each modality to extract modality-specific features. We demonstrate that, rather than spatial fusion at the convolutional layers, the fusion can be performed on the outputs of the fully-connected layers of the modality-specific CNNs without any loss of performance and with significant reduction in the number of parameters. We show that, using multiple CNNs with multimodal fusion at the feature-level, we significantly outperform systems that use unimodal representation. We study weighted feature, bilinear, and compact bilinear feature-level fusion algorithms for multimodal biometric person identification. Finally, We propose generalized compact bilinear fusion algorithm to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.