Document Navigability: A Need for Print-Impaired

Anukriti Kumar; Tanuja Ganu; Saikat Guha

arXiv:2206.10253·cs.CV·June 27, 2022

Document Navigability: A Need for Print-Impaired

Anukriti Kumar, Tanuja Ganu, Saikat Guha

PDF

Open Access

TL;DR

This paper addresses the challenge of navigating internal references in printed scientific documents for print-disabled individuals by proposing a vision-based technique to locate and extract metadata of referenced content, improving accessibility.

Contribution

The paper introduces a novel vision-based method to identify and extract metadata of internal references in printed documents, aiding print-disabled users' navigation.

Findings

01

Effective on both digital and scanned documents

02

Works well for citations in scientific papers

03

Improves navigation for print-disabled users

Abstract

Printed documents continue to be a challenge for blind, low-vision, and other print-disabled (BLV) individuals. In this paper, we focus on the specific problem of (in-)accessibility of internal references to citations, footnotes, figures, tables and equations. While sighted users can flip to the referenced content and flip back in seconds, linear audio narration that BLV individuals rely on makes following these references extremely hard. We propose a vision based technique to locate the referenced content and extract metadata needed to (in subsequent work) inline a content summary into the audio narration. We apply our technique to citations in scientific documents and find it works well both on born-digital as well as scanned documents.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Web Data Mining and Analysis · Video Analysis and Summarization

MethodsFLIP