TAG-HGT: A Scalable and Cost-Effective Framework for Inductive Cold-Start Academic Recommendation

Zhexiang Li

arXiv:2601.02381·cs.IR·January 7, 2026

TAG-HGT: A Scalable and Cost-Effective Framework for Inductive Cold-Start Academic Recommendation

Zhexiang Li

PDF

Open Access

TL;DR

This paper introduces TAG-HGT, a scalable, cost-effective framework for inductive cold-start academic recommendation that combines semantic and structural information, achieving state-of-the-art accuracy with vastly reduced inference latency and costs.

Contribution

It proposes a neuro-symbolic framework that distills large language model knowledge into a lightweight graph transformer, enabling real-time, large-scale academic recommendations.

Findings

01

Achieves SOTA System Recall@10 of 91.97% on OpenAlex dataset.

02

Reduces inference latency by 500,000 times compared to generative models.

03

Cuts inference costs by over 99.9%, enabling democratized access.

Abstract

Inductive cold-start recommendation remains the "Achilles' Heel" of industrial academic platforms, where thousands of new scholars join daily without historical interaction records. While recent Generative Graph Models (e.g., HiGPT, OFA) demonstrate promising semantic capabilities, their prohibitive inference latency (often exceeding 13 minutes per 1,000 requests) and massive computational costs render them practically undeployable for real-time, million-scale applications. To bridge this gap between generative quality and industrial scalability, we propose TAG-HGT, a cost-effective neuro-symbolic framework. Adopting a decoupled "Semantics-First, Structure-Refined" paradigm, TAG-HGT utilizes a frozen Large Language Model (DeepSeek-V3) as an offline semantic factory and distills its knowledge into a lightweight Heterogeneous Graph Transformer (HGT) via Cross-View Contrastive Learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Topic Modeling · Machine Learning in Materials Science