HyperFormer: Learning Expressive Sparse Feature Representations via Hypergraph Transformer
Kaize Ding, Albert Jiongqian Liang, Bryan Perrozi, Ting Chen, Ruoxi, Wang, Lichan Hong, Ed H. Chi, Huan Liu, Derek Zhiyuan Cheng

TL;DR
HyperFormer introduces a hypergraph transformer model that captures correlations among instances and features to improve sparse feature representation learning in high-dimensional data.
Contribution
The paper proposes HyperFormer, a hypergraph transformer that explicitly models relational correlations among instances and features for better sparse feature representations.
Findings
Improves representation learning on sparse high-dimensional data
Effectively captures correlations among instances and features
Enhances performance in information retrieval tasks
Abstract
Learning expressive representations for high-dimensional yet sparse features has been a longstanding problem in information retrieval. Though recent deep learning methods can partially solve the problem, they often fail to handle the numerous sparse features, particularly those tail feature values with infrequent occurrences in the training data. Worse still, existing methods cannot explicitly leverage the correlations among different instances to help further improve the representation learning on sparse features since such relational prior knowledge is not provided. To address these challenges, in this paper, we tackle the problem of representation learning on feature-sparse data from a graph learning perspective. Specifically, we propose to model the sparse features of different instances using hypergraphs where each node represents a data instance and each hyperedge denotes a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Text and Document Classification Technologies · Topic Modeling
MethodsAttention Is All You Need · fail · Softmax · Layer Normalization · Byte Pair Encoding · Dropout · Linear Layer · Label Smoothing · Residual Connection · Position-Wise Feed-Forward Layer
