Metric Learning for Temporal Sequence Alignment

Damien Garreau (INRIA Paris - Rocquencourt; DI-ENS); R\'emi Lajugie; (INRIA Paris - Rocquencourt; DI-ENS); Sylvain Arlot (INRIA Paris -; Rocquencourt; DI-ENS); Francis Bach (INRIA Paris - Rocquencourt; DI-ENS)

arXiv:1409.3136·cs.LG·September 11, 2014·37 cites

Metric Learning for Temporal Sequence Alignment

Damien Garreau (INRIA Paris - Rocquencourt, DI-ENS), R\'emi Lajugie, (INRIA Paris - Rocquencourt, DI-ENS), Sylvain Arlot (INRIA Paris -, Rocquencourt, DI-ENS), Francis Bach (INRIA Paris - Rocquencourt, DI-ENS)

PDF

Open Access

TL;DR

This paper introduces a method to learn a Mahalanobis distance for aligning multivariate time series, improving alignment accuracy by optimizing a learned similarity measure and combining features.

Contribution

It presents a novel metric learning approach for temporal sequence alignment, casting it as a structured prediction problem with tractable optimization.

Findings

01

Improved alignment performance on real audio data.

02

Effective feature selection enhances alignment accuracy.

Abstract

In this paper, we propose to learn a Mahalanobis distance to perform alignment of multivariate time series. The learning examples for this task are time series for which the true alignment is known. We cast the alignment problem as a structured prediction task, and propose realistic losses between alignments for which the optimization is tractable. We provide experiments on real data in the audio to audio context, where we show that the learning of a similarity measure leads to improvements in the performance of the alignment task. We also propose to use this metric learning framework to perform feature selection and, from basic audio features, build a combination of these with better performance for the alignment.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Time Series Analysis and Forecasting · Speech and Audio Processing