Connecting the Dots: Surfacing Structure in Documents through AI-Generated Cross-Modal Links
Alyssa Hwang, Hita Kambhamettu, Yue Yang, Ajay Patel, Joseph Chee Chang, Andrew Head

TL;DR
This paper introduces a framework and interactive tool that enhances understanding of complex, information-dense documents by surfacing cross-modal links, leading to improved comprehension without increasing reading time.
Contribution
The paper presents a novel framework and an augmented reading interface that integrates information across media types in complex documents, improving comprehension.
Findings
Participants scored higher on reading quizzes using the tool.
No increase in reading time or cognitive load observed.
Supports engagement with complex materials.
Abstract
Understanding information-dense documents like recipes and scientific papers requires readers to find, interpret, and connect details scattered across text, figures, tables, and other visual elements. These documents are often long and filled with specialized terminology, hindering the ability to locate relevant information or piece together related ideas. Existing tools offer limited support for synthesizing information across media types. As a result, understanding complex material remains cognitively demanding. This paper presents a framework for fine-grained integration of information in complex documents. We instantiate the framework in an augmented reading interface, which populates a scientific paper with clickable points on figures, interactive highlights in the body text, and a persistent reference panel for accessing consolidated details without manual scrolling. In a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Interactive and Immersive Displays · Visual and Cognitive Learning Processes
