RuDi: Explaining Behavior Sequence Models by Automatic Statistics   Generation and Rule Distillation

Yao Zhang; Yun Xiong; Yiheng Sun; Caihua Shan; Tian Lu; Hui Song,; Yangyong Zhu

arXiv:2208.07211·cs.LG·August 17, 2022

RuDi: Explaining Behavior Sequence Models by Automatic Statistics Generation and Rule Distillation

Yao Zhang, Yun Xiong, Yiheng Sun, Caihua Shan, Tian Lu, Hui Song,, Yangyong Zhu

PDF

Open Access 1 Repo

TL;DR

RuDi is a two-stage method that automatically generates informative statistics and distills black-box behavior sequence models into transparent rule-based models, enhancing explainability in risk scoring.

Contribution

It introduces a novel Monte Carlo tree search-based statistics generation and neural logical networks for rule distillation from black-box models.

Findings

01

Effective on multiple real-world datasets

02

Produces interpretable rule-based models

03

Maintains high predictive performance

Abstract

Risk scoring systems have been widely deployed in many applications, which assign risk scores to users according to their behavior sequences. Though many deep learning methods with sophisticated designs have achieved promising results, the black-box nature hinders their applications due to fairness, explainability, and compliance consideration. Rule-based systems are considered reliable in these sensitive scenarios. However, building a rule system is labor-intensive. Experts need to find informative statistics from user behavior sequences, design rules based on statistics and assign weights to each rule. In this paper, we bridge the gap between effective but black-box models and transparent rule models. We propose a two-stage method, RuDi, that distills the knowledge of black-box teacher models into rule-based student models. We design a Monte Carlo tree search-based statistics…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yzhang1918/cikm2022rudi
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Software Engineering Research · Machine Learning and Data Classification