Revisiting CNNs for Trajectory Similarity Learning
Zhihao Chang, Linzhu Yu, Huan Li, Sai Wu, Gang Chen, Dongxiang Zhang

TL;DR
This paper revisits CNNs for trajectory similarity learning, showing they outperform RNNs and Transformers in accuracy and speed by focusing on local features, with theoretical justification and large-scale experiments.
Contribution
Introducing ConvTraj, a CNN-based model that captures local features for trajectory similarity, providing theoretical analysis and demonstrating superior performance and efficiency.
Findings
ConvTraj achieves state-of-the-art accuracy on large-scale datasets.
Training and inference speeds are significantly improved, by 240x and 2.16x respectively.
ConvTraj outperforms RNNs and Transformers in trajectory similarity tasks.
Abstract
Similarity search is a fundamental but expensive operator in querying trajectory data, due to its quadratic complexity of distance computation. To mitigate the computational burden for long trajectories, neural networks have been widely employed for similarity learning and each trajectory is encoded as a high-dimensional vector for similarity search with linear complexity. Given the sequential nature of trajectory data, previous efforts have been primarily devoted to the utilization of RNNs or Transformers. In this paper, we argue that the common practice of treating trajectory as sequential data results in excessive attention to capturing long-term global dependency between two sequences. Instead, our investigation reveals the pivotal role of local similarity, prompting a revisit of simple CNNs for trajectory similarity learning. We introduce ConvTraj, incorporating both 1D and 2D…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic Prediction and Management Techniques · Automated Road and Building Extraction · Data Management and Algorithms
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
