A Deep Reinforced Model for Zero-Shot Cross-Lingual Summarization with Bilingual Semantic Similarity Rewards
Zi-Yi Dou, Sachin Kumar, Yulia Tsvetkov

TL;DR
This paper introduces an end-to-end deep reinforcement learning model for zero-shot cross-lingual summarization, optimizing bilingual semantic similarity to improve summary quality without relying on machine translation pipelines.
Contribution
It presents a novel reinforcement learning approach that directly optimizes bilingual semantic similarity, enhancing cross-lingual summarization performance without extensive data or translation errors.
Findings
Reinforcement learning improves summary fluency and relevance.
The model performs well on English-Chinese and English-German tasks.
Bilingual semantic similarity rewards lead to better summaries.
Abstract
Cross-lingual text summarization aims at generating a document summary in one language given input in another language. It is a practically important but under-explored task, primarily due to the dearth of available data. Existing methods resort to machine translation to synthesize training data, but such pipeline approaches suffer from error propagation. In this work, we propose an end-to-end cross-lingual text summarization model. The model uses reinforcement learning to directly optimize a bilingual semantic similarity metric between the summaries generated in a target language and gold summaries in a source language. We also introduce techniques to pre-train the model leveraging monolingual summarization and machine translation objectives. Experimental results in both English--Chinese and English--German cross-lingual summarization settings demonstrate the effectiveness of our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
