POTATO: exPlainable infOrmation exTrAcTion framewOrk

\'Ad\'am Kov\'acs; Kinga G\'emes; Eszter Ikl\'odi; G\'abor Recski

arXiv:2201.13230·cs.CL·October 18, 2022

POTATO: exPlainable infOrmation exTrAcTion framewOrk

\'Ad\'am Kov\'acs, Kinga G\'emes, Eszter Ikl\'odi, G\'abor Recski

PDF

Open Access 1 Repo

TL;DR

POTATO is a versatile, human-in-the-loop framework that enables rule-based text classification across languages and domains by leveraging graph-based features and interpretable machine learning.

Contribution

It introduces a novel, language- and task-independent system for rule extraction using graph representations and real-time user interaction.

Findings

01

Supports multiple graph formats like AMR, UD, 4lang

02

Applied successfully to legal and social media texts

03

Provides real-time rule suggestions and refinements

Abstract

We present POTATO, a task- and languageindependent framework for human-in-the-loop (HITL) learning of rule-based text classifiers using graph-based features. POTATO handles any type of directed graph and supports parsing text into Abstract Meaning Representations (AMR), Universal Dependencies (UD), and 4lang semantic graphs. A streamlit-based user interface allows users to build rule systems from graph patterns, provides real-time evaluation based on ground truth data, and suggests rules by ranking graph features using interpretable machine learning models. Users can also provide patterns over graphs using regular expressions, and POTATO can recommend refinements of such rules. POTATO is applied in projects across domains and languages, including classification tasks on German legal text and English social media data. All components of our system are written in Python, can be installed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

adaamko/potato
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies