Extracting Chemical-Protein Interactions via Calibrated Deep Neural Network and Self-training
Dongha Choi, Hyunju Lee

TL;DR
This paper presents a novel deep neural network approach that incorporates calibration and uncertainty estimation to improve the extraction of chemical-protein interactions from biomedical texts, achieving state-of-the-art results.
Contribution
It introduces a calibration-based DNN method with self-training and uncertainty estimation for chemical-protein interaction extraction, enhancing reliability and performance.
Findings
Achieved state-of-the-art performance on the ChemProt task.
Maintained higher calibration abilities than previous methods.
Demonstrated the potential of uncertainty estimation for performance improvement.
Abstract
The extraction of interactions between chemicals and proteins from several biomedical articles is important in many fields of biomedical research such as drug development and prediction of drug side effects. Several natural language processing methods, including deep neural network (DNN) models, have been applied to address this problem. However, these methods were trained with hard-labeled data, which tend to become over-confident, leading to degradation of the model reliability. To estimate the data uncertainty and improve the reliability, "calibration" techniques have been applied to deep learning models. In this study, to extract chemical--protein interactions, we propose a DNN-based approach incorporating uncertainty information and calibration techniques. Our model first encodes the input sequence using a pre-trained language-understanding model, following which it is trained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMixup
