Leveraging Large Language Models for Information Verification -- an Engineering Approach

Nguyen Nang Hung; Nguyen Thanh Trong; Vuong Thanh Toan; Nguyen An Phuoc; Dao Minh Tu; Nguyen Manh Duc Tuan; Nguyen Dinh Mau

arXiv:2506.18274·cs.LG·June 24, 2025

Leveraging Large Language Models for Information Verification -- an Engineering Approach

Nguyen Nang Hung, Nguyen Thanh Trong, Vuong Thanh Toan, Nguyen An Phuoc, Dao Minh Tu, Nguyen Manh Duc Tuan, Nguyen Dinh Mau

PDF

TL;DR

This paper presents an engineering pipeline that leverages Large Language Models like GPT-4o for multimedia news source verification, automating metadata generation, frame analysis, and discrepancy detection with minimal human intervention.

Contribution

It introduces a practical, automated approach using LLMs for multimedia verification, integrating multiple data modalities and prompt engineering to streamline the process.

Findings

01

Automated pipeline effectively verifies multimedia news sources.

02

Integration of LLMs improves efficiency and accuracy.

03

Minimal human intervention required for validation.

Abstract

For the ACMMM25 challenge, we present a practical engineering approach to multimedia news source verification, utilizing Large Language Models (LLMs) like GPT-4o as the backbone of our pipeline. Our method processes images and videos through a streamlined sequence of steps: First, we generate metadata using general-purpose queries via Google tools, capturing relevant content and links. Multimedia data is then segmented, cleaned, and converted into frames, from which we select the top-K most informative frames. These frames are cross-referenced with metadata to identify consensus or discrepancies. Additionally, audio transcripts are extracted for further verification. Noticeably, the entire pipeline is automated using GPT-4o through prompt engineering, with human intervention limited to final validation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.