Neural Contract Element Extraction Revisited: Letters from Sesame Street

Ilias Chalkidis; Manos Fergadiotis; Prodromos Malakasiotis; Ion; Androutsopoulos

arXiv:2101.04355·cs.CL·February 23, 2021·1 cites

Neural Contract Element Extraction Revisited: Letters from Sesame Street

Ilias Chalkidis, Manos Fergadiotis, Prodromos Malakasiotis, Ion, Androutsopoulos

PDF

Open Access

TL;DR

This paper compares various neural models for contract element extraction, finding LSTM encoders outperform others, and emphasizes the importance of task-specific choices in model design.

Contribution

It provides a comprehensive evaluation of neural architectures and embeddings, highlighting the superiority of LSTMs and domain-specific embeddings for contract element extraction.

Findings

01

LSTM encoders outperform CNNs, Transformers, and BERT.

02

Domain-specific WORD2VEC embeddings are more effective than GloVe.

03

Recurrency in models is crucial for context-sensitive entity extraction.

Abstract

We investigate contract element extraction. We show that LSTM-based encoders perform better than dilated CNNs, Transformers, and BERT in this task. We also find that domain-specific WORD2VEC embeddings outperform generic pre-trained GLOVE embeddings. Morpho-syntactic features in the form of POS tag and token shape embeddings, as well as context-aware ELMO embeddings do not improve performance. Several of these observations contradict choices or findings of previous work on contract element extraction and generic sequence labeling tasks, indicating that contract element extraction requires careful task-specific choices. Analyzing the results of (i) plain TRANSFORMER-based and (ii) BERT-based models, we find that in the examined task, where the entities are highly context-sensitive, the lack of recurrency in TRANSFORMERs greatly affects their performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis

MethodsLinear Layer · Sigmoid Activation · Tanh Activation · Attention Dropout · Dropout · Layer Normalization · Softmax · Refunds@Expedia|||How do I get a full refund from Expedia? · Long Short-Term Memory · Residual Connection