Knowledge Distillation for Real-Time Classification of Early Media in Voice Communications
Kemal Altwlkany, Had\v{z}em Had\v{z}i\'c, Amar Kuri\'c, Emanuel Lacic

TL;DR
This paper presents a novel, resource-efficient approach using knowledge distillation and gradient-boosted trees for real-time classification of early media in voice calls, achieving faster performance with comparable accuracy.
Contribution
It introduces a low-resource, gradient-boosted tree-based method with knowledge distillation for early media classification, outperforming CNN-based models in speed while maintaining accuracy.
Findings
Significant runtime improvement over CNN models.
Comparable classification accuracy with the new approach.
Effective performance in a regional data center case study.
Abstract
This paper investigates the industrial setting of real-time classification of early media exchanged during the initialization phase of voice calls. We explore the application of state-of-the-art audio tagging models and highlight some limitations when applied to the classification of early media. While most existing approaches leverage convolutional neural networks, we propose a novel approach for low-resource requirements based on gradient-boosted trees. Our approach not only demonstrates a substantial improvement in runtime performance, but also exhibits a comparable accuracy. We show that leveraging knowledge distillation and class aggregation techniques to train a simpler and smaller model accelerates the classification of early media in voice calls. We provide a detailed analysis of the results on a proprietary and publicly available dataset, regarding accuracy and runtime…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Speech Recognition and Synthesis
MethodsKnowledge Distillation
