Stress Testing BERT Anaphora Resolution Models for Reaction Extraction in Chemical Patents
Chieling Yueh, Evangelos Kanoulas, Bruno Martins, Camilo Thorne, Saber, Akhondi

TL;DR
This paper evaluates the robustness of BERT-based anaphora resolution models for extracting chemical reactions from patents, comparing performance in noise-free and noisy conditions, and explores methods to enhance their noise resilience.
Contribution
It introduces an analysis of BERT-based anaphora resolution models' robustness in chemical patent reaction extraction and proposes strategies to improve their noise tolerance.
Findings
Models perform significantly worse in noisy environments.
Certain noise reduction techniques improve model robustness.
Anaphora resolution is critical for accurate reaction extraction.
Abstract
The high volume of published chemical patents and the importance of a timely acquisition of their information gives rise to automating information extraction from chemical patents. Anaphora resolution is an important component of comprehensive information extraction, and is critical for extracting reactions. In chemical patents, there are five anaphoric relations of interest: co-reference, transformed, reaction associated, work up, and contained. Our goal is to investigate how the performance of anaphora resolution models for reaction texts in chemical patents differs in a noise-free and noisy environment and to what extent we can improve the robustness against noise of the model.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Advanced Text Analysis Techniques · Semantic Web and Ontologies
