HyperFormer: Learning Expressive Sparse Feature Representations via   Hypergraph Transformer

Kaize Ding; Albert Jiongqian Liang; Bryan Perrozi; Ting Chen; Ruoxi; Wang; Lichan Hong; Ed H. Chi; Huan Liu; Derek Zhiyuan Cheng

arXiv:2305.17386·cs.IR·May 30, 2023·1 cites

HyperFormer: Learning Expressive Sparse Feature Representations via Hypergraph Transformer

Kaize Ding, Albert Jiongqian Liang, Bryan Perrozi, Ting Chen, Ruoxi, Wang, Lichan Hong, Ed H. Chi, Huan Liu, Derek Zhiyuan Cheng

PDF

Open Access

TL;DR

HyperFormer introduces a hypergraph transformer model that captures correlations among instances and features to improve sparse feature representation learning in high-dimensional data.

Contribution

The paper proposes HyperFormer, a hypergraph transformer that explicitly models relational correlations among instances and features for better sparse feature representations.

Findings

01

Improves representation learning on sparse high-dimensional data

02

Effectively captures correlations among instances and features

03

Enhances performance in information retrieval tasks

Abstract

Learning expressive representations for high-dimensional yet sparse features has been a longstanding problem in information retrieval. Though recent deep learning methods can partially solve the problem, they often fail to handle the numerous sparse features, particularly those tail feature values with infrequent occurrences in the training data. Worse still, existing methods cannot explicitly leverage the correlations among different instances to help further improve the representation learning on sparse features since such relational prior knowledge is not provided. To address these challenges, in this paper, we tackle the problem of representation learning on feature-sparse data from a graph learning perspective. Specifically, we propose to model the sparse features of different instances using hypergraphs where each node represents a data instance and each hyperedge denotes a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Text and Document Classification Technologies · Topic Modeling

MethodsAttention Is All You Need · fail · Softmax · Layer Normalization · Byte Pair Encoding · Dropout · Linear Layer · Label Smoothing · Residual Connection · Position-Wise Feed-Forward Layer