Beyond Deep Learning: Speech Segmentation and Phone Classification with Neural Assemblies

Trevor Adelson; Vidhyasaharan Sethu; Ting Dang

arXiv:2603.16923·eess.AS·March 19, 2026

Beyond Deep Learning: Speech Segmentation and Phone Classification with Neural Assemblies

Trevor Adelson, Vidhyasaharan Sethu, Ting Dang

PDF

Open Access

TL;DR

This paper presents a biologically inspired neural assembly framework for speech segmentation and classification that operates directly on continuous speech, achieving competitive results without traditional deep learning training.

Contribution

It introduces an AC-based speech processing system combining neural encoding, hierarchical architecture, and cross-area updates, enabling boundary detection and classification without weight training.

Findings

01

Detects phone boundaries with F1=0.69

02

Achieves 47.5% accuracy on phone recognition

03

Operates without weight training

Abstract

Deep learning dominates speech processing but relies on massive datasets, global backpropagation-guided weight updates, and produces entangled representations. Assembly Calculus (AC), which models sparse neuronal assemblies via Hebbian plasticity and winner-take-all competition, offers a biologically grounded alternative, yet prior work focused on discrete symbolic inputs. We introduce an AC-based speech processing framework that operates directly on continuous speech by combining three key contributions:(i) neural encoding that converts speech into assembly-compatible spike patterns using probabilistic mel binarisation and population-coded MFCCs; (ii) a multi-area architecture organising assemblies across hierarchical timescales and classes; and (iii) cross-area update schemes for downstream tasks. Applied to two core tasks of boundary detection and segment classification, our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFerroelectric and Negative Capacitance Devices · Language and cultural evolution · Neural dynamics and brain function