Effective Slot Filling Based on Shallow Distant Supervision Methods
Benjamin Roth, Tassilo Barth, Michael Wiegand, Mittul Singh, Dietrich, Klakow

TL;DR
This paper presents RelationFactory, a modular end-to-end relation extraction system for slot filling that leverages shallow distant supervision techniques, surface skip n-grams, and query expansion, achieving top performance in TAC KBP 2013.
Contribution
The paper introduces a relation extraction system that improves slot filling accuracy using shallow techniques, surface features, and Wikipedia-based query expansion, with significant performance gains.
Findings
Achieved top-ranked F1-score of 37.3% in TAC KBP 2013.
Improved relation extraction by focusing on surface skip n-grams.
Enhanced distant supervision classifiers with optimized training and tuning schemes.
Abstract
Spoken Language Systems at Saarland University (LSV) participated this year with 5 runs at the TAC KBP English slot filling track. Effective algorithms for all parts of the pipeline, from document retrieval to relation prediction and response post-processing, are bundled in a modular end-to-end relation extraction system called RelationFactory. The main run solely focuses on shallow techniques and achieved significant improvements over LSV's last year's system, while using the same training data and patterns. Improvements mainly have been obtained by a feature representation focusing on surface skip n-grams and improved scoring for extracted distant supervision patterns. Important factors for effective extraction are the training and tuning scheme for distant supervision classifiers, and the query expansion by a translation model based on Wikipedia links. In the TAC KBP 2013 English…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems
