Background Knowledge Injection for Interpretable Sequence Classification

Severin Gsponer; Luca Costabello; Chan Le Van; Sumit Pai; Christophe; Gueret; Georgiana Ifrim; Freddy Lecue

arXiv:2006.14248·cs.LG·June 26, 2020

Background Knowledge Injection for Interpretable Sequence Classification

Severin Gsponer, Luca Costabello, Chan Le Van, Sumit Pai, Christophe, Gueret, Georgiana Ifrim, Freddy Lecue

PDF

Open Access

TL;DR

This paper presents a novel sequence classification method that combines linear classifiers with background knowledge embeddings, enhancing interpretability without sacrificing accuracy, demonstrated on activity recognition and amino acid sequences.

Contribution

Introduces a new sequence learning algorithm integrating background knowledge embeddings with linear classifiers for improved interpretability.

Findings

01

Maintains predictive accuracy comparable to existing methods.

02

Produces more interpretable models through knowledge-augmented features.

03

Proposes a new interpretability measure based on symbol embeddings.

Abstract

Sequence classification is the supervised learning task of building models that predict class labels of unseen sequences of symbols. Although accuracy is paramount, in certain scenarios interpretability is a must. Unfortunately, such trade-off is often hard to achieve since we lack human-independent interpretability metrics. We introduce a novel sequence learning algorithm, that combines (i) linear classifiers - which are known to strike a good balance between predictive power and interpretability, and (ii) background knowledge embeddings. We extend the classic subsequence feature space with groups of symbols which are generated by background knowledge injected via word or graph embeddings, and use this new feature space to learn a linear classifier. We also present a new measure to evaluate the interpretability of a set of symbolic features based on the symbol embeddings. Experiments…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Time Series Analysis and Forecasting

MethodsInterpretability