Modality-Order Matters! A Novel Hierarchical Feature Fusion Method for CoSAm: A Code-Switched Autism Corpus
Mohd Mujtaba Akhtar, Girish, Muskaan Singh, Orchid Chetia Phukan

TL;DR
This paper presents a hierarchical feature fusion method using Transformer Encoders to improve early ASD detection from code-switched speech, achieving high accuracy with a novel bilingual speech corpus.
Contribution
It introduces a new hierarchical fusion strategy for multimodal speech features and applies it to a novel code-switched ASD speech dataset, enhancing classification accuracy.
Findings
Achieved 98.75% classification accuracy.
Developed a novel hierarchical feature fusion approach.
Created the CoSAm code-switched speech corpus.
Abstract
Autism Spectrum Disorder (ASD) is a complex neuro-developmental challenge, presenting a spectrum of difficulties in social interaction, communication, and the expression of repetitive behaviors in different situations. This increasing prevalence underscores the importance of ASD as a major public health concern and the need for comprehensive research initiatives to advance our understanding of the disorder and its early detection methods. This study introduces a novel hierarchical feature fusion method aimed at enhancing the early detection of ASD in children through the analysis of code-switched speech (English and Hindi). Employing advanced audio processing techniques, the research integrates acoustic, paralinguistic, and linguistic information using Transformer Encoders. This innovative fusion strategy is designed to improve classification robustness and accuracy, crucial for early…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAttention Is All You Need · Residual Connection · Adam · Dropout · Hierarchical Feature Fusion · Byte Pair Encoding · Layer Normalization · Label Smoothing · Linear Layer · Softmax
