PHAGE: Patent Heterogeneous Attention-Guided Graph Encoder for Representation Learning

Yongmin Yoo; Qiongkai Xu; Zhangkai Wu; Longbing Cao

arXiv:2605.10073·cs.CL·May 12, 2026

PHAGE: Patent Heterogeneous Attention-Guided Graph Encoder for Representation Learning

Yongmin Yoo, Qiongkai Xu, Zhangkai Wu, Longbing Cao

PDF

TL;DR

PHAGE introduces a novel graph-based patent claim encoder that leverages hierarchical claim dependencies and heterogeneous relations to improve representation learning for patent analysis.

Contribution

It proposes a deterministic graph construction and token-level attention mechanism that incorporate claim hierarchy and relation types into a Transformer-based encoder.

Findings

01

PHAGE outperforms baseline models on classification, retrieval, and clustering tasks.

02

Intra-document claim topology is a stronger inductive bias than inter-document structure.

03

Claim topology representations persist in encoder weights after training.

Abstract

Patent claims form a directed dependency structure in which dependent claims inherit and refine the scope of earlier claims; however, existing patent encoders linearize claims as text and discard this hierarchy. Directly encoding this structure into self-attention poses two challenges: claim dependencies mix relation types that differ in semantics and extraction reliability, and the dependency graph is defined over claims while Transformers attend over tokens. PHAGE addresses the first challenge through a deterministic graph construction pipeline that separates near-deterministic legal citations from noisier rule-based technical relations, preserving type distinctions as heterogeneous edges. It addresses the second through a connectivity mask and learnable relation-aware biases that lift claim-level topology into token-level attention, allowing the encoder to differentially weight each…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.