Learning event representation: As sparse as possible, but not sparser
Tuan Do, James Pustejovsky

TL;DR
This paper explores the use of qualitative spatial reasoning features for classifying human-object interactions in 3D space, demonstrating significant performance improvements over quantitative features and proposing a simple, visualizable representation method.
Contribution
It introduces a modified QSRLib for feature extraction in 3D activity classification, showing qualitative features outperform quantitative ones and enabling a visualizable 2D action composition approach.
Findings
Qualitative spatial features improve classification accuracy.
Sequential QSR features yield the best performance.
Proposed method allows simple visualization of 3D activities.
Abstract
Selecting an optimal event representation is essential for event classification in real world contexts. In this paper, we investigate the application of qualitative spatial reasoning (QSR) frameworks for classification of human-object interaction in three dimensional space, in comparison with the use of quantitative feature extraction approaches for the same purpose. In particular, we modify QSRLib, a library that allows computation of Qualitative Spatial Relations and Calculi, and employ it for feature extraction, before inputting features into our neural network models. Using an experimental setup involving motion captures of human-object interaction as three dimensional inputs, we observe that the use of qualitative spatial features significantly improves the performance of our machine learning algorithm against our baseline, while quantitative features of similar kinds fail to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsConstraint Satisfaction and Optimization · AI-based Problem Solving and Planning · Multimodal Machine Learning Applications
