Automat Parsing of Audio Recordings. Testing Children with Dyslalia.   Theoretical Background

Ovidiu-Andrei Schipor; Titus-Marian Nestor

arXiv:1406.4879·cs.CY·June 20, 2014

Automat Parsing of Audio Recordings. Testing Children with Dyslalia. Theoretical Background

Ovidiu-Andrei Schipor, Titus-Marian Nestor

PDF

Open Access

TL;DR

This paper introduces an automated method for parsing audio recordings of children with dyslalia, utilizing the ADM algorithm and specialized audio processing techniques to improve speech problem identification.

Contribution

The paper presents the ADM algorithm and a novel marker placement method for accurate parsing of audio recordings from children with speech disorders.

Findings

01

Developed the ADM algorithm for audio parsing.

02

Analyzed the complexity of the proposed solution.

03

Created a C# software toolkit for audio stream handling.

Abstract

In this paper we present our researches regarding automat parsing of audio recordings. These recordings are obtained from children with dyslalia and are necessary for an accurate identification of speech problems. We develop the ADM algorithm and we analyze the complexity of this solution. We utilize a digital voice recorder in High Quality mode and with VCVA (Variable Control Voice Actuator) activated. The record format is IMA-ADPCM, 16KHz and 4bits (16 bits PCM). A microphone was placed at 10 cm from mouth in order to minimize environment noise. A software set of classes (C#) was created for handling audio stream (read, conversion between different format, write). We also propose an original solution for placing markers in audio stream. These markers are needed for a correct parsing af full recoding.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Advanced Data Compression Techniques