Towards Pattern-aware Data Augmentation for Temporal Knowledge Graph Completion
Jiasheng Zhang, Deqiang Ouyang, Shuang Liang, Jie Shao

TL;DR
This paper introduces Booster, a novel data augmentation method for temporal knowledge graph completion that addresses data imbalance and model preferences, leading to significant performance improvements.
Contribution
It is the first to propose a pattern-aware data augmentation strategy specifically designed for TKGs, considering complex semantic and temporal patterns.
Findings
Booster achieves up to 8.7% performance improvement.
The hierarchical scoring algorithm effectively identifies hard-learning samples.
Pattern-aware validation enhances sample quality and model training.
Abstract
Predicting missing facts for temporal knowledge graphs (TKGs) is a fundamental task, called temporal knowledge graph completion (TKGC). One key challenge in this task is the imbalance in data distribution, where facts are unevenly spread across entities and timestamps. This imbalance can lead to poor completion performance or long-tail entities and timestamps, and unstable training due to the introduction of false negative samples. Unfortunately, few previous studies have investigated how to mitigate these effects. Moreover, for the first time, we found that existing methods suffer from model preferences, revealing that entities with specific properties (e.g., recently active) are favored by different models. Such preferences will lead to error accumulation and further exacerbate the effects of imbalanced data distribution, but are overlooked by previous studies. To alleviate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Semantic Web and Ontologies · Graph Theory and Algorithms
