ClarET: Pre-training a Correlation-Aware Context-To-Event Transformer   for Event-Centric Generation and Classification

Yucheng Zhou; Tao Shen; Xiubo Geng; Guodong Long; Daxin Jiang

arXiv:2203.02225·cs.CL·March 10, 2022·1 cites

ClarET: Pre-training a Correlation-Aware Context-To-Event Transformer for Event-Centric Generation and Classification

Yucheng Zhou, Tao Shen, Xiubo Geng, Guodong Long, Daxin Jiang

PDF

Open Access 1 Repo

TL;DR

ClarET is a versatile pre-trained transformer model designed for event-centric reasoning tasks, effectively capturing event correlations for improved generation and classification across diverse scenarios.

Contribution

The paper introduces ClarET, a novel correlation-aware pre-training framework with three event-centric objectives, enhancing event reasoning capabilities in various applications.

Findings

01

Outperforms existing models on 9 diverse benchmarks

02

Effective in zero- and few-shot learning scenarios

03

Demonstrates strong generalization across multiple reasoning types

Abstract

Generating new events given context with correlated ones plays a crucial role in many event-centric reasoning tasks. Existing works either limit their scope to specific scenarios or overlook event-level correlations. In this paper, we propose to pre-train a general Correlation-aware context-to-Event Transformer (ClarET) for event-centric reasoning. To achieve this, we propose three novel event-centric objectives, i.e., whole event recovering, contrastive event-correlation encoding and prompt-based event locating, which highlight event-level correlations with effective training. The proposed ClarET is applicable to a wide range of event-centric reasoning scenarios, considering its versatility of (i) event-correlation types (e.g., causal, temporal, contrast), (ii) application formulations (i.e., generation and classification), and (iii) reasoning types (e.g., abductive, counterfactual and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yczhou001/ClarET
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Topic Modeling · Advanced Graph Neural Networks

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dropout · Dense Connections · Residual Connection · Label Smoothing · Softmax · Absolute Position Encodings · Position-Wise Feed-Forward Layer