Recognizing Extended Spatiotemporal Expressions by Actively Trained Average Perceptron Ensembles
Wei Zhang, Yang Yu, Osho Gupta, Judith Gelernter

TL;DR
This paper introduces a new dataset and an ensemble learning approach using active training and average perceptron models to improve recognition of complex spatiotemporal expressions in text.
Contribution
It presents a novel dataset, a comparison of inference algorithms, a new active learning re-weighting method, and a joint parser for spatiotemporal expressions and named entities.
Findings
Ensemble model with Belief Propagation outperforms Viterbi decoding.
Active learning reduces annotation costs effectively.
The joint parser improves recognition of expanded spatiotemporal expressions.
Abstract
Precise geocoding and time normalization for text requires that location and time phrases be identified. Many state-of-the-art geoparsers and temporal parsers suffer from low recall. Categories commonly missed by parsers are: nouns used in a non- spatiotemporal sense, adjectival and adverbial phrases, prepositional phrases, and numerical phrases. We collected and annotated data set by querying commercial web searches API with such spatiotemporal expressions as were missed by state-of-the- art parsers. Due to the high cost of sentence annotation, active learning was used to label training data, and a new strategy was designed to better select training examples to reduce labeling cost. For the learning algorithm, we applied an average perceptron trained Featurized Hidden Markov Model (FHMM). Five FHMM instances were used to create an ensemble, with the output phrase selected by voting.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Algorithms and Data Compression · Topic Modeling
