What Is the Difference Between a Mountain and a Molehill? Quantifying Semantic Labeling of Visual Features in Line Charts
Dennis Bromley, Vidya Setlur

TL;DR
This paper introduces a crowdsourced dataset of labeled visual features in line charts and a novel method that combines feature-word distributions with visual and data domain information to improve semantic labeling and summarization of charts.
Contribution
It presents a new dataset of visual feature labels in line charts and a novel feature-word-topic modeling approach that captures subtle semantic differences in chart descriptions.
Findings
The dataset effectively captures visual feature semantics.
The proposed model identifies nuanced semantic differences.
Application to chart annotation and summarization demonstrated.
Abstract
Relevant language describing visual features in charts can be useful for authoring captions and summaries about the charts to help with readers' takeaways. To better understand the interplay between concepts that describe visual features and the semantic relationships among those concepts (e.g., 'sharp increase' vs. 'gradual rise'), we conducted a crowdsourced study to collect labels and visual feature pairs for univariate line charts. Using this crowdsourced dataset of labeled visual signatures, this paper proposes a novel method for labeling visual chart features based on combining feature-word distributions with the visual features and the data domain of the charts. These feature-word-topic models identify word associations with similar yet subtle differences in semantics, such as 'flat,' 'plateau,' and 'stagnant,' and descriptors of the visual features, such as 'sharp increase,'…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Data Management and Algorithms · Advanced Text Analysis Techniques
