Context-Informed Machine Translation of Manga using Multimodal Large Language Models
Philip Lippmann, Konrad Skublicki, Joshua Tanner, Shonosuke, Ishiwatari, Jie Yang

TL;DR
This paper explores the use of multimodal large language models for manga translation, incorporating visual elements to improve quality, and introduces a new dataset and software tools for benchmarking in this domain.
Contribution
It proposes a novel methodology leveraging multimodal LLMs for manga translation, introduces a Japanese-Polish translation dataset, and provides open-source benchmarking tools.
Findings
Achieves state-of-the-art results for Japanese-English manga translation.
Sets a new standard for Japanese-Polish translation.
Demonstrates the effectiveness of visual context in translation quality.
Abstract
Due to the significant time and effort required for handcrafting translations, most manga never leave the domestic Japanese market. Automatic manga translation is a promising potential solution. However, it is a budding and underdeveloped field and presents complexities even greater than those found in standard translation due to the need to effectively incorporate visual elements into the translation process to resolve ambiguities. In this work, we investigate to what extent multimodal large language models (LLMs) can provide effective manga translation, thereby assisting manga authors and publishers in reaching wider audiences. Specifically, we propose a methodology that leverages the vision component of multimodal LLMs to improve translation quality and evaluate the impact of translation unit size, context length, and propose a token efficient approach for manga translation.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Translation Studies and Practices
MethodsSparse Evolutionary Training
