HalalBench: A Multilingual OCR Benchmark for Food Packaging Ingredient Extraction
Hasan Arief

TL;DR
HalalBench is the first multilingual OCR benchmark for food packaging, addressing unique challenges like curved surfaces and dense multilingual text, with evaluations of four OCR engines and a post-processing improvement.
Contribution
It introduces HalalBench, a comprehensive multilingual dataset and benchmark for food packaging OCR, filling a gap in standardized evaluation tools.
Findings
DocTR achieves the highest F1 score of 0.193 among tested engines.
All engines perform poorly on Japanese text, with F1=0.000.
Post-processing improves F1 score by 36%.
Abstract
No standardized benchmark exists for evaluating OCR on food packaging, despite its critical role in automated halal food verification. Existing benchmarks target documents or scene text, missing the unique challenges of ingredient labels: curved surfaces, dense multilingual text, and sub-8pt fonts. We present HalalBench, the first open multilingual benchmark for food packaging OCR, comprising 1,043 images (50 real, 993 synthetic) with 36,438 annotations in COCO format spanning 14 languages. We evaluate four engines: docTR achieves F1=0.193, ML Kit 0.180, EasyOCR 0.167, while all fail on Japanese (F1=0.000). A clustering ablation shows 36% F1 improvement from our post-processing algorithm. We validate findings through HalalLens (https://halallens.no), a production halal scanner serving 20+ countries. Dataset and code are released under open licenses.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
