HyperLearn: A Distributed Approach for Representation Learning in Datasets With Many Modalities
Devanshu Arya, Stevan Rudinac, Marcel Worring

TL;DR
HyperLearn introduces a scalable, distributed hypergraph-based framework using Graph Convolutional Networks for multimodal representation learning, efficiently handling complex intra- and inter-relations and seamlessly incorporating new modalities.
Contribution
It presents a novel distributed hypergraph and GCN-based framework that scales to many modalities and data streams, enabling efficient multimodal representation learning.
Findings
Effective in capturing complex intra- and inter-modal relations
Scales efficiently with additional modalities without increasing computational time
Demonstrated on multimedia datasets with higher-order relations
Abstract
Multimodal datasets contain an enormous amount of relational information, which grows exponentially with the introduction of new modalities. Learning representations in such a scenario is inherently complex due to the presence of multiple heterogeneous information channels. These channels can encode both (a) inter-relations between the items of different modalities and (b) intra-relations between the items of the same modality. Encoding multimedia items into a continuous low-dimensional semantic space such that both types of relations are captured and preserved is extremely challenging, especially if the goal is a unified end-to-end learning framework. The two key challenges that need to be addressed are: 1) the framework must be able to merge complex intra and inter relations without losing any valuable information and 2) the learning model should be invariant to the addition of new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Multimodal Machine Learning Applications · Topic Modeling
MethodsGraph Convolutional Networks
