Classroom Slide Narration System
Jobin K.V., Ajoy Mondal, and C. V. Jawahar

TL;DR
This paper introduces a Classroom Slide Narration System (CSNS) that automatically generates audio descriptions for slide content, aiding visually impaired students by accurately segmenting and extracting slide information using a novel architecture and multiple modules.
Contribution
The paper proposes a new segmentation architecture, CSSN, tailored for slide images, and integrates OCR, figure classification, and other modules to improve accessibility for VI students.
Findings
9.54% segmentation accuracy improvement on WiSe dataset
Better user feedback compared to Facebook's AAT and Tesseract
Effective extraction of slide content for narration
Abstract
Slide presentations are an effective and efficient tool used by the teaching community for classroom communication. However, this teaching model can be challenging for blind and visually impaired (VI) students. The VI student required personal human assistance for understand the presented slide. This shortcoming motivates us to design a Classroom Slide Narration System (CSNS) that generates audio descriptions corresponding to the slide content. This problem poses as an image-to-markup language generation task. The initial step is to extract logical regions such as title, text, equation, figure, and table from the slide image. In the classroom slide images, the logical regions are distributed based on the location of the image. To utilize the location of the logical regions for slide image segmentation, we propose the architecture, Classroom Slide Segmentation Network (CSSN). The unique…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Subtitles and Audiovisual Media · Tactile and Sensory Interactions
