A Linear Classifier Based on Entity Recognition Tools and a Statistical Approach to Method Extraction in the Protein-Protein Interaction Literature
An\'alia Louren\c{c}o, Michael Conover, Andrew Wong, Azadeh, Nematzadeh, Fengxia Pan, Hagit Shatkay, Luis M. Rocha

TL;DR
This paper presents a linear classifier enhanced with entity recognition tools for classifying articles and a statistical approach for extracting methods in protein-protein interaction literature, achieving high performance in BioCreative III.
Contribution
It introduces a novel linear classifier leveraging entity recognition and a statistical method extraction approach, outperforming existing systems in PPI article classification.
Findings
Superior classification performance in ACT task
Comparable results in IMT task to other approaches
Effective detection of evidence for PPI detection methods
Abstract
We participated, in the Article Classification and the Interaction Method subtasks (ACT and IMT, respectively) of the Protein-Protein Interaction task of the BioCreative III Challenge. For the ACT, we pursued an extensive testing of available Named Entity Recognition and dictionary tools, and used the most promising ones to extend our Variable Trigonometric Threshold linear classifier. For the IMT, we experimented with a primarily statistical approach, as opposed to employing a deeper natural language processing strategy. Finally, we also studied the benefits of integrating the method extraction approach that we have used for the IMT into the ACT pipeline. For the ACT, our linear article classifier leads to a ranking and classification performance significantly higher than all the reported submissions. For the IMT, our results are comparable to those of other systems, which took very…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies
