From 'Here' to 'There': Exploring Proximity Semantics in Multimodal Data Exploration
Dennis Bromley, Diana Wang, Vidya Setlur

TL;DR
This paper introduces a multimodal data exploration system that combines sketching, language, and annotations, leveraging proximity semantics to improve pattern discovery in complex datasets.
Contribution
It presents a novel hybrid architecture and conceptual framework for understanding proximity semantics in multimodal interactions, supported by a user study.
Findings
Participants used spatial, temporal, and visual proximity to disambiguate meaning.
Proximity semantics influence how users relate multimodal elements during exploration.
The system supports pattern matching and semantic constraints through hybrid architecture.
Abstract
Modern data exploration tools often struggle to capture the subtleties of analytical intent, especially when users seek patterns that are difficult to specify using traditional query methods or natural language alone. We introduce a multimodal research probe for querying time-series and geospatial data that integrates free-form sketching, natural language, and visual annotations within a unified interaction space. Users articulate queries by sketching trends or spatial paths and augmenting them with annotations and analytical directives grounded in shared spatial and temporal context. The system employs a hybrid architecture combining geometric sketch matching and visual language models (VLMs) to support queries that interleave pattern matching and semantic constraints. Through a preliminary study with 20 participants, we observed recurring interaction patterns in which participants…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
