Using Sensory Time-cue to enable Unsupervised Multimodal Meta-learning
Qiong Liu, Yanxia Zhang

TL;DR
This paper introduces STUM, a novel unsupervised meta-learning method that leverages temporal cues in IoT sensor data to organize multimodal features without manual labels, improving cross-modal understanding.
Contribution
STUM uniquely uses time relations in sensor data to guide unsupervised feature organization across modalities, advancing meta-learning for IoT applications.
Findings
Effective organization of multimodal features using temporal cues.
Improved cross-modal feature alignment in audiovisual data.
Promising evaluation results demonstrating method's potential.
Abstract
As data from IoT (Internet of Things) sensors become ubiquitous, state-of-the-art machine learning algorithms face many challenges on directly using sensor data. To overcome these challenges, methods must be designed to learn directly from sensors without manual annotations. This paper introduces Sensory Time-cue for Unsupervised Meta-learning (STUM). Different from traditional learning approaches that either heavily depend on labels or on time-independent feature extraction assumptions, such as Gaussian distribution features, the STUM system uses time relation of inputs to guide the feature space formation within and across modalities. The fact that STUM learns from a variety of small tasks may put this method in the camp of Meta-Learning. Different from existing Meta-Learning approaches, STUM learning tasks are composed within and across multiple modalities based on time-cue co-exist…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Music and Audio Processing · Multimodal Machine Learning Applications
