
TL;DR
Petrarch 2 is an advanced event-data coding system that leverages syntactic parsing and linguistic theories to improve the accuracy and efficiency of extracting event information from text.
Contribution
It introduces a new syntactic-level processing approach using Stanford CoreNLP and formal linguistic theories, enhancing previous versions' capabilities.
Findings
Faster and smarter core logic for event coding
Utilizes syntactic parse trees for improved accuracy
Incorporates linguistic theories like Generative Grammar and Lambda-Calculus
Abstract
PETRARCH 2 is the fourth generation of a series of Event-Data coders stemming from research by Phillip Schrodt. Each iteration has brought new functionality and usability, and this is no exception.Petrarch 2 takes much of the power of the original Petrarch's dictionaries and redirects it into a faster and smarter core logic. Earlier iterations handled sentences largely as a list of words, incorporating some syntactic information here and there. Petrarch 2 now views the sentence entirely on the syntactic level. It receives the syntactic parse of a sentence from the Stanford CoreNLP software, and stores this data as a tree structure of linked nodes, where each node is a Phrase object. Prepositional, noun, and verb phrases each have their own version of this Phrase class, which deals with the logic particular to those kinds of phrases. Since this is an event coder, the core of the logic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
