Multi-Level Feature Abstraction from Convolutional Neural Networks for Multimodal Biometric Identification
Sobhan Soleymani, Ali Dabouei, Hadi Kazemi, Jeremy Dawson, Nasser, M. Nasrabadi

TL;DR
This paper introduces a deep multimodal fusion network that combines features from multiple CNN layers across face, iris, and fingerprint modalities, significantly improving person identification accuracy over unimodal methods.
Contribution
The paper presents a novel multi-level feature fusion approach in deep CNNs for multimodal biometric identification, optimizing multiple abstraction levels jointly for enhanced performance.
Findings
Multi-level feature fusion outperforms single-layer fusion.
Joint optimization improves identification accuracy.
Significant parameter reduction with multi-level features.
Abstract
In this paper, we propose a deep multimodal fusion network to fuse multiple modalities (face, iris, and fingerprint) for person identification. The proposed deep multimodal fusion algorithm consists of multiple streams of modality-specific Convolutional Neural Networks (CNNs), which are jointly optimized at multiple feature abstraction levels. Multiple features are extracted at several different convolutional layers from each modality-specific CNN for joint feature fusion, optimization, and classification. Features extracted at different convolutional layers of a modality-specific CNN represent the input at several different levels of abstract representations. We demonstrate that an efficient multimodal classification can be accomplished with a significant reduction in the number of network parameters by exploiting these multi-level abstract representations extracted from all the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
