Causal Intervention for Abstractive Related Work Generation

Jiachang Liu; Qi Zhang; Chongyang Shi; Usman Naseem; Shoujin Wang,; Ivor Tsang

arXiv:2305.13685·cs.CL·May 24, 2023·1 cites

Causal Intervention for Abstractive Related Work Generation

Jiachang Liu, Qi Zhang, Chongyang Shi, Usman Naseem, Shoujin Wang,, Ivor Tsang

PDF

Open Access

TL;DR

This paper introduces a causal intervention module for abstractive related work generation, enhancing the coherence and quality of generated texts by modeling and intervening in causal relations within the generation process.

Contribution

It proposes a novel CaM module that captures causal relations using do-calculus and integrates it with Transformers for improved related work generation.

Findings

01

Causal intervention improves generation coherence.

02

CaM outperforms baseline models on real-world datasets.

03

Enhanced causal modeling leads to higher quality related work texts.

Abstract

Abstractive related work generation has attracted increasing attention in generating coherent related work that better helps readers grasp the background in the current research. However, most existing abstractive models ignore the inherent causality of related work generation, leading to low quality of generated related work and spurious correlations that affect the models' generalizability. In this study, we argue that causal intervention can address these limitations and improve the quality and coherence of the generated related works. To this end, we propose a novel Causal Intervention Module for Related Work Generation (CaM) to effectively capture causalities in the generation process and improve the quality and coherence of the generated related works. Specifically, we first model the relations among sentence order, document relation, and transitional content in related work…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Intelligent Tutoring Systems and Adaptive Learning · Software Engineering Research

MethodsAttention Is All You Need · Absolute Position Encodings · Softmax · Layer Normalization · Byte Pair Encoding · Dropout · Linear Layer · Label Smoothing · Multi-Head Attention · Adam