Automatic Text Line Segmentation Directly in JPEG Compressed Document Images
Bulla Rajesh, Mohammed Javed, P Nagabhushan

TL;DR
This paper introduces methods for directly performing text line segmentation on JPEG compressed images by analyzing DCT coefficients, eliminating the need for full decompression and improving efficiency in processing large volumes of document images.
Contribution
It presents two novel approaches for text line segmentation directly in JPEG compressed domain, one with partial decompression and another using analysis of AC coefficients without decompression.
Findings
Effective segmentation across various font sizes and styles
Avoids full decompression, reducing computational cost
Achieves good performance in diverse document conditions
Abstract
JPEG is one of the popular image compression algorithms that provide efficient storage and transmission capabilities in consumer electronics, and hence it is the most preferred image format over the internet world. In the present digital and Big-data era, a huge volume of JPEG compressed document images are being archived and communicated through consumer electronics on daily basis. Though it is advantageous to have data in the compressed form on one side, however, on the other side processing with off-the-shelf methods becomes computationally expensive because it requires decompression and recompression operations. Therefore, it would be novel and efficient, if the compressed data are processed directly in their respective compressed domains of consumer electronics. In the present research paper, we propose to demonstrate this idea taking the case study of printed text line…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Advanced Image and Video Retrieval Techniques · Vehicle License Plate Recognition
MethodsDiscrete Cosine Transform
