BioNLI: Generating a Biomedical NLI Dataset Using Lexico-semantic Constraints for Adversarial Examples
Mohaddeseh Bastan, Mihai Surdeanu, and Niranjan Balasubramanian

TL;DR
This paper introduces BioNLI, a new biomedical NLI dataset created through a semi-supervised approach that generates challenging negative examples, and benchmarks classifiers showing varied performance across different negative example types.
Contribution
The paper presents a novel semi-supervised method for generating biomedical NLI datasets with diverse negative examples, addressing the lack of suitable datasets in the domain.
Findings
Best classifier achieved around 75% F1 score.
Performance varies significantly across negative example types.
Neuro-logic decoding negative examples are particularly challenging.
Abstract
Natural language inference (NLI) is critical for complex decision-making in biomedical domain. One key question, for example, is whether a given biomedical mechanism is supported by experimental evidence. This can be seen as an NLI problem but there are no directly usable datasets to address this. The main challenge is that manually creating informative negative examples for this task is difficult and expensive. We introduce a novel semi-supervised procedure that bootstraps an NLI dataset from existing biomedical dataset that pairs mechanisms with experimental evidence in abstracts. We generate a range of negative examples using nine strategies that manipulate the structure of the underlying mechanisms both with rules, e.g., flip the roles of the entities in the interaction, and, more importantly, as perturbations via logical constraints in a neuro-logical decoding system. We use this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
MethodsFLIP
