3M-Health: Multimodal Multi-Teacher Knowledge Distillation for Mental Health Detection
Rina Carines Cabral, Siwen Luo, Josiah Poon, Soyeon Caren Han

TL;DR
This paper introduces a novel multimodal, multi-teacher knowledge distillation approach for mental health detection that leverages cross-modal insights to improve classification accuracy on social media data.
Contribution
It proposes a multi-teacher architecture that effectively integrates diverse modalities, addressing limitations of simple feature concatenation and computational complexity.
Findings
Enhanced classification performance demonstrated experimentally
Effective integration of text and sound modalities
Multi-teacher approach outperforms single-model baselines
Abstract
The significance of mental health classification is paramount in contemporary society, where digital platforms serve as crucial sources for monitoring individuals' well-being. However, existing social media mental health datasets primarily consist of text-only samples, potentially limiting the efficacy of models trained on such data. Recognising that humans utilise cross-modal information to comprehend complex situations or issues, we present a novel approach to address the limitations of current methodologies. In this work, we introduce a Multimodal and Multi-Teacher Knowledge Distillation model for Mental Health Classification, leveraging insights from cross-modal human understanding. Unlike conventional approaches that often rely on simple concatenation to integrate diverse features, our model addresses the challenge of appropriately representing inputs of varying natures (e.g.,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOnline Learning and Analytics
MethodsKnowledge Distillation
