A Survey on Vietnamese Document Analysis and Recognition: Challenges and Future Directions
Anh Le, Thanh Lam, Dung Nguyen

TL;DR
This survey reviews Vietnamese document analysis and recognition, highlighting challenges like diacritics and data scarcity, and explores how recent large language and vision-language models can advance the field.
Contribution
It provides a comprehensive overview of current techniques, identifies key limitations, and discusses future directions including dataset creation and multimodal approaches.
Findings
Deep learning shows promise but is limited by data scarcity.
LLMs and vision-language models improve Vietnamese text recognition.
Future research should focus on dataset development and model optimization.
Abstract
Vietnamese document analysis and recognition (DAR) is a crucial field with applications in digitization, information retrieval, and automation. Despite advancements in OCR and NLP, Vietnamese text recognition faces unique challenges due to its complex diacritics, tonal variations, and lack of large-scale annotated datasets. Traditional OCR methods often struggle with real-world document variations, while deep learning approaches have shown promise but remain limited by data scarcity and generalization issues. Recently, large language models (LLMs) and vision-language models have demonstrated remarkable improvements in text recognition and document understanding, offering a new direction for Vietnamese DAR. However, challenges such as domain adaptation, multimodal learning, and computational efficiency persist. This survey provide a comprehensive review of existing techniques in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Topic Modeling · Text and Document Classification Technologies
