Detecting and Identifying Selection Structure in Sequential Data
Yujia Zheng, Zeyu Tang, Yiwen Qiu, Bernhard Sch\"olkopf, Kun Zhang

TL;DR
This paper investigates the causal structure of selection in sequential data, demonstrating its identifiability without parametric assumptions and proposing an algorithm to detect such structures, with applications to music sequences.
Contribution
It introduces a nonparametric method to identify selection mechanisms in sequential data, advancing understanding of underlying causal processes beyond bias correction.
Findings
Selection structure is identifiable without parametric assumptions.
The proposed algorithm accurately detects selection in synthetic and real data.
Understanding selection improves modeling of sequential data.
Abstract
We argue that the selective inclusion of data points based on latent objectives is common in practical situations, such as music sequences. Since this selection process often distorts statistical analysis, previous work primarily views it as a bias to be corrected and proposes various methods to mitigate its effect. However, while controlling this bias is crucial, selection also offers an opportunity to provide a deeper insight into the hidden generation process, as it is a fundamental mechanism underlying what we observe. In particular, overlooking selection in sequential data can lead to an incomplete or overcomplicated inductive bias in modeling, such as assuming a universal autoregressive structure for all dependencies. Therefore, rather than merely viewing it as a bias, we explore the causal structure of selection in sequential data to delve deeper into the complete causal process.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInnovation Diffusion and Forecasting · Big Data and Business Intelligence · Data Mining Algorithms and Applications
