PublishInCovid19 at WNUT 2020 Shared Task-1: Entity Recognition in Wet   Lab Protocols using Structured Learning Ensemble and Contextualised   Embeddings

Janvijay Singh; Anshul Wadhawan

arXiv:2010.02142·cs.CL·December 17, 2020

PublishInCovid19 at WNUT 2020 Shared Task-1: Entity Recognition in Wet Lab Protocols using Structured Learning Ensemble and Contextualised Embeddings

Janvijay Singh, Anshul Wadhawan

PDF

TL;DR

This paper presents a structured learning ensemble approach using contextualized embeddings for entity recognition in wet lab protocols, achieving top performance in the WNUT 2020 shared task.

Contribution

It introduces a two-phase method combining various embeddings and ensemble strategies, including structured learning ensembling, for improved entity recognition accuracy.

Findings

01

Achieved a micro F1-score of 0.8175 for partial match.

02

Ranked first in partial match and second in exact match.

03

Demonstrated effectiveness of ensemble and structured learning methods.

Abstract

In this paper, we describe the approach that we employed to address the task of Entity Recognition over Wet Lab Protocols -- a shared task in EMNLP WNUT-2020 Workshop. Our approach is composed of two phases. In the first phase, we experiment with various contextualised word embeddings (like Flair, BERT-based) and a BiLSTM-CRF model to arrive at the best-performing architecture. In the second phase, we create an ensemble composed of eleven BiLSTM-CRF models. The individual models are trained on random train-validation splits of the complete dataset. Here, we also experiment with different output merging schemes, including Majority Voting and Structured Learning Ensembling (SLE). Our final submission achieved a micro F1-score of 0.8175 and 0.7757 for the partial and exact match of the entity spans, respectively. We were ranked first and second, in terms of partial and exact match,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.