Will LLMs Replace the Encoder-Only Models in Temporal Relation   Classification?

Gabriel Roccabruna; Massimo Rizzoli; Giuseppe Riccardi

arXiv:2410.10476·cs.CL·November 1, 2024

Will LLMs Replace the Encoder-Only Models in Temporal Relation Classification?

Gabriel Roccabruna, Massimo Rizzoli, Giuseppe Riccardi

PDF

Open Access 1 Repo

TL;DR

This paper compares large language models and encoder-only models in temporal relation classification, finding that LLMs underperform due to their autoregressive nature and limited focus on sequence parts.

Contribution

It provides a comprehensive evaluation of LLMs versus encoder-only models in temporal relation classification, including explainability and embedding analysis.

Findings

01

LLMs with in-context learning underperform smaller encoder models

02

Autoregressive nature limits LLMs' focus on sequence parts

03

Pre-training differences affect model performance

Abstract

The automatic detection of temporal relations among events has been mainly investigated with encoder-only models such as RoBERTa. Large Language Models (LLM) have recently shown promising performance in temporal reasoning tasks such as temporal question answering. Nevertheless, recent studies have tested the LLMs' performance in detecting temporal relations of closed-source models only, limiting the interpretability of those results. In this work, we investigate LLMs' performance and decision process in the Temporal Relation Classification task. First, we assess the performance of seven open and closed-sourced LLMs experimenting with in-context learning and lightweight fine-tuning approaches. Results show that LLMs with in-context learning significantly underperform smaller encoder-only models based on RoBERTa. Then, we delve into the possible reasons for this gap by applying…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

brownfortress/llms-trc
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Softmax · Multi-Head Attention · Dense Connections · WordPiece · Residual Connection · Linear Warmup With Linear Decay · Dropout