Learning with Changing Features

Amit Dhurandhar; Steve Hanneke; Liu Yang

arXiv:1705.00219·cs.LG·May 2, 2017·1 cites

Learning with Changing Features

Amit Dhurandhar, Steve Hanneke, Liu Yang

PDF

Open Access

TL;DR

This paper introduces a method to detect when features start to influence an output variable in dynamic settings, with applications in retail and manufacturing, providing theoretical guarantees and real-world case studies.

Contribution

It presents the first formal approach for change point detection in a distribution-independent, supervised learning context, with provable performance and practical efficiency.

Findings

01

Successfully identified change points in retail data linked to news events.

02

Detected feature relevance shifts in manufacturing plant data.

03

Method is efficient and applicable to multiple change points.

Abstract

In this paper we study the setting where features are added or change interpretation over time, which has applications in multiple domains such as retail, manufacturing, finance. In particular, we propose an approach to provably determine the time instant from which the new/changed features start becoming relevant with respect to an output variable in an agnostic (supervised) learning setting. We also suggest an efficient version of our approach which has the same asymptotic performance. Moreover, our theory also applies when we have more than one such change point. Independent post analysis of a change point identified by our method for a large retailer revealed that it corresponded in time with certain unflattering news stories about a brand that resulted in the change in customer behavior. We also applied our method to data from an advanced manufacturing plant identifying the time…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Stream Mining Techniques · Fuzzy Systems and Optimization · Time Series Analysis and Forecasting