Vietnamese multi-document summary using subgraph selection approach --   VLSP 2022 AbMuSu Shared Task

Huu-Thin Nguyen; Tam Doan Thanh; Cam-Van Thi Nguyen

arXiv:2306.14827·cs.CL·June 27, 2023

Vietnamese multi-document summary using subgraph selection approach -- VLSP 2022 AbMuSu Shared Task

Huu-Thin Nguyen, Tam Doan Thanh, Cam-Van Thi Nguyen

PDF

Open Access

TL;DR

This paper presents a graph-based extractive multi-document summarization method for Vietnamese, transforming the problem into subgraph selection to better capture sentence relationships, achieving top results in VLSP 2022.

Contribution

It introduces a novel subgraph selection approach for multi-document summarization that leverages graph structures to improve summary quality.

Findings

01

Achieved top 10 ranking on ROUGH-2 F1 measure in VLSP 2022.

02

Demonstrated effectiveness of graph-based subgraph selection in Vietnamese MDS.

03

Outperformed several baseline methods in the shared task.

Abstract

Document summarization is a task to generate afluent, condensed summary for a document, andkeep important information. A cluster of documents serves as the input for multi-document summarizing (MDS), while the cluster summary serves as the output. In this paper, we focus on transforming the extractive MDS problem into subgraph selection. Approaching the problem in the form of graphs helps to capture simultaneously the relationship between sentences in the same document and between sentences in the same cluster based on exploiting the overall graph structure and selected subgraphs. Experiments have been implemented on the Vietnamese dataset published in VLSP Evaluation Campaign 2022. This model currently results in the top 10 participating teams reported on the ROUGH-2 $F_1$ measure on the public test set.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques

MethodsFocus