Deep Complex Networks

Chiheb Trabelsi; Olexa Bilaniuk; Ying Zhang; Dmitriy Serdyuk; Sandeep; Subramanian; Jo\~ao Felipe Santos; Soroush Mehri; Negar Rostamzadeh; Yoshua; Bengio; Christopher J Pal

arXiv:1705.09792·cs.NE·February 27, 2018·169 cites

Deep Complex Networks

Chiheb Trabelsi, Olexa Bilaniuk, Ying Zhang, Dmitriy Serdyuk, Sandeep, Subramanian, Jo\~ao Felipe Santos, Soroush Mehri, Negar Rostamzadeh, Yoshua, Bengio, Christopher J Pal

PDF

Open Access 5 Repos

TL;DR

This paper introduces the foundational components for deep complex-valued neural networks, demonstrating their competitive performance across vision, music transcription, and speech prediction tasks, and achieving state-of-the-art results in audio applications.

Contribution

It provides essential algorithms and techniques for constructing complex-valued deep neural networks, enabling their application in various domains.

Findings

01

Complex models are competitive with real-valued counterparts.

02

Achieved state-of-the-art results on audio-related tasks.

03

Demonstrated effectiveness in vision, music, and speech domains.

Abstract

At present, the vast majority of building blocks, techniques, and architectures for deep learning are based on real-valued operations and representations. However, recent work on recurrent neural networks and older fundamental theoretical analysis suggests that complex numbers could have a richer representational capacity and could also facilitate noise-robust memory retrieval mechanisms. Despite their attractive properties and potential for opening up entirely new neural architectures, complex-valued deep neural networks have been marginalized due to the absence of the building blocks required to design such models. In this work, we provide the key atomic components for complex-valued deep neural networks and apply them to convolutional feed-forward networks and convolutional LSTMs. More precisely, we rely on complex convolutions and present algorithms for complex batch-normalization,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech Recognition and Synthesis · Speech and Audio Processing