Can Language Models Capture Graph Semantics? From Graphs to Language Model and Vice-Versa
Tarun Garg, Kaushik Roy, Amit Sheth

TL;DR
This paper investigates whether Transformer-based deep learning models can effectively encode and reconstruct the full semantics of knowledge graphs, revealing limitations in their ability to preserve complex graph information.
Contribution
The study demonstrates that current Transformer models struggle to fully capture and reproduce the semantics of knowledge graphs due to structural differences.
Findings
Transformers cannot fully encode knowledge graph semantics
Disparity between graph structure and Transformer attention limits expressiveness
Knowledge graphs' directed and relationship-based info is not well-preserved
Abstract
Knowledge Graphs are a great resource to capture semantic knowledge in terms of entities and relationships between the entities. However, current deep learning models takes as input distributed representations or vectors. Thus, the graph is compressed in a vectorized representation. We conduct a study to examine if the deep learning model can compress a graph and then output the same graph with most of the semantics intact. Our experiments show that Transformer models are not able to express the full semantics of the input knowledge graph. We find that this is due to the disparity between the directed, relationship and type based information contained in a Knowledge Graph and the fully connected token-token undirected graphical interpretation of the Transformer Attention matrix.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Natural Language Processing Techniques
MethodsAttention Is All You Need · Linear Layer · Softmax · Dropout · Dense Connections · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Multi-Head Attention · Byte Pair Encoding · Label Smoothing
