Knock-Knock: Acoustic Object Recognition by using Stacked Denoising   Autoencoders

Shan Luo; Leqi Zhu; Kaspar Althoefer; Hongbin Liu

arXiv:1708.04432·cs.CV·August 16, 2017

Knock-Knock: Acoustic Object Recognition by using Stacked Denoising Autoencoders

Shan Luo, Leqi Zhu, Kaspar Althoefer, Hongbin Liu

PDF

TL;DR

This paper demonstrates that deep learning with stacked denoising autoencoders significantly improves acoustic object recognition accuracy and speed compared to traditional handcrafted feature methods.

Contribution

The study introduces a deep learning approach using stacked denoising autoencoders for acoustic object recognition, eliminating the need for handcrafted features and achieving higher accuracy and efficiency.

Findings

01

Achieved 91.50% recognition accuracy with deep learning.

02

Deep learning classification is over 6 times faster than traditional methods.

03

Traditional handcrafted features yielded only 58.22% accuracy.

Abstract

This paper presents a successful application of deep learning for object recognition based on acoustic data. The shortcomings of previously employed approaches where handcrafted features describing the acoustic data are being used, include limiting the capability of the found representation to be widely applicable and facing the risk of capturing only insignificant characteristics for a task. In contrast, there is no need to define the feature representation format when using multilayer/deep learning architecture methods: features can be learned from raw sensor data without defining discriminative characteristics a-priori. In this paper, stacked denoising autoencoders are applied to train a deep learning model. Knocking each object in our test set 120 times with a marker pen to obtain the auditory data, thirty different objects were successfully classified in our experiment and each…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.