Knowledge Bases in Support of Large Language Models for Processing Web   News

Yihe Zhang; Nabin Pakka; Nian-Feng Tzeng

arXiv:2411.08278·cs.CL·November 15, 2024

Knowledge Bases in Support of Large Language Models for Processing Web News

Yihe Zhang, Nabin Pakka, Nian-Feng Tzeng

PDF

Open Access

TL;DR

This paper presents a framework combining rule-based extraction and graph convolution to enhance large language models' ability to process web news by building specialized knowledge bases for improved classification.

Contribution

It introduces a novel framework that integrates explicit knowledge extraction with implicit LLM knowledge for better news processing and classification.

Findings

01

Effective news category classification demonstrated

02

Framework outperforms baseline models

03

Promising results on multiple datasets

Abstract

Large Language Models (LLMs) have received considerable interest in wide applications lately. During pre-training via massive datasets, such a model implicitly memorizes the factual knowledge of trained datasets in its hidden parameters. However, knowledge held implicitly in parameters often makes its use by downstream applications ineffective due to the lack of common-sense reasoning. In this article, we introduce a general framework that permits to build knowledge bases with an aid of LLMs, tailored for processing Web news. The framework applies a rule-based News Information Extractor (NewsIE) to news items for extracting their relational tuples, referred to as knowledge bases, which are then graph-convoluted with the implicit knowledge facts of news items obtained by LLMs, for their classification. It involves two lightweight components: 1) NewsIE: for extracting the structural…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsWeb Data Mining and Analysis · Semantic Web and Ontologies