Audio Classification of Bit-Representation Waveform
Masaki Okawa, Takuya Saito, Naoki Sawada, Hiromitsu Nishizaki

TL;DR
This paper introduces a novel bit-sequence waveform representation for audio classification, demonstrating superior performance over traditional methods like raw waveforms and spectral analysis in neural network-based tasks.
Contribution
The study proposes a new bit-based waveform representation method for audio classification, bypassing traditional frequency analysis techniques.
Findings
Bit representation waveform outperformed other representations in classification accuracy.
The method was effective for both acoustic event and sound/music classification tasks.
Experimental results confirmed the superiority of the proposed approach.
Abstract
This study investigated the waveform representation for audio signal classification. Recently, many studies on audio waveform classification such as acoustic event detection and music genre classification have been published. Most studies on audio waveform classification have proposed the use of a deep learning (neural network) framework. Generally, a frequency analysis method such as Fourier transform is applied to extract the frequency or spectral information from the input audio waveform before inputting the raw audio waveform into the neural network. In contrast to these previous studies, in this paper, we propose a novel waveform representation method, in which audio waveforms are represented as a bit sequence, for audio classification. In our experiment, we compare the proposed bit representation waveform, which is directly given to a neural network, to other representations of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing · Music Technology and Sound Studies
